Statistical modelling lies at the heart of modern data analysis, helping us to describe and predict the real world. Statistical inference is the way that we use data and other information to learn about and apply statistical models. In this module, you will learn the theory underpinning modern statistical methods such as fitting normal linear models, evaluating how well they fit the data and taking inferences from it. You will apply the theory using statistical software such as R to analyse and draw conclusions from a range of real-world data sets. Topics covered in the module range from estimators, confidence intervals, design of experiments and hypothesis testing to statistical modelling, regression, inference and comparison of models. Skills developed in the module are taken further in modules such as MTH3012 Advanced Statistical Modelling.
Prerequisite module: MTH1004 or equivalent.
This module aims to develop understanding and competence in statistical modelling by introducing you to the Normal linear model from a modern perspective. It will provide you with the ability to formulate and apply these models in a range of practical settings, to carry out associated inference appreciating how this relates to the general likelihood inferential framework, and to perform appropriate model selection and model checking procedures. Use will be made of a suitable statistical computer language for practical work.
INTENDED LEARNING OUTCOMES (ILOs) (see assessment section below for how ILOs will be assessed)
On successful completion of this module, you should be able to:
Module Specific Skills and Knowledge:
1 demonstrate knowledge and understanding of inferential procedures, including point estimation, interval estimation and hypothesis testing;
2 apply these inferential procedures to draw correct inferences from data;
3 derive properties of basic inferential procedures;
4 formulate simple and multiple regression models and analyse their properties, including polynomial regression and models which involve categorical explanatory variables (i.e. factors) and understand how the latter relate to classical analysis of variance techniques;
5 demonstrate an awareness of the range of practical situations where it is, and is not, appropriate to employ Normal linear models;
6 demonstrate understanding of the theory and practice of estimation and inference for the Normal linear model and be able to apply this to fit models and carry out model selection and checking procedures in a range of practical situations;
7 carry out data analysis using multiple regression and related models in conjunction with a suitable computer language.
Discipline Specific Skills and Knowledge:
8 demonstrate understanding and appreciation of the mathematical modelling of stochastic phenomena and its usefulness;
9 demonstrate sufficient knowledge of fundamental ideas central to modern model-based statistics which are necessary to be able to progress to, and succeed in, further studies in statistical inference, statistical modelling of data and of stochastic modelling more generally.
Personal and Key Transferable/ Employment Skills and Knowledge:
10 demonstrate general data analysis skills and communicate associated reasoning and interpretations effectively in writing;
11 use relevant computer software competently;
12 demonstrate appropriate use of learning resources;
13 demonstrate self management and time management skills.
SYLLABUS PLAN - summary of the structure and academic content of the module
1 Introduction and revision
2 Likelihood inference
-
-The likelihood function
-
-Maximum likelihood estimates
-
-Numerical optimization in R
-
-Properties of estimators
-
-Properties of maximum likelihood estimators
-
-Likelihood ratio test
3 Normal Linear Model
-
-Model specification
-
-Parameter estimation and inference
-
-Model evaluation and selection
-
-ANOVA models
-
-Further topics: Gauss-Markov theorem, collinearity, variance stabilisation
-
-Out-of-sample predictive performance
4 Design of experiments
-
-Experimental designs
-
-Interactions
-
-Quantitative explanatory variables
-
-Simultaneous inference
-
-Robust or resistant statistical methods
-
-Sample size
-
-Manipulating levels
-
-Missing data
5 Nonparametric statistics
-
-Kernel density estimation
-
-Nonparametric tests
-
-Permuation and randomisation tests