Applications of Data Science and Statistics - 2024 entry
MODULE TITLE | Applications of Data Science and Statistics | CREDIT VALUE | 15 |
---|---|---|---|
MODULE CODE | MTHM503 | MODULE CONVENER | Dr Victoria Volodina (Coordinator) |
DURATION: TERM | 1 | 2 | 3 |
---|---|---|---|
DURATION: WEEKS | 0 | 11 | 0 |
Number of Students Taking Module (anticipated) | 15 |
---|
This module will enable you to learn new Data Science and Statistical methods, and to use the techniques learnt in other modules, by working on analyses of real data examples. There will be a strong emphasis throughout on understanding the practical application of statistical and machine learning methods including clustering, data reduction, methods for handling missing data, study design and introductory methods for time series data. Theory and ideas will be developed to allow the implementation of methods in examples drawn from industry, medicine, finance, public health and environmental challenges, including climate change and air pollution.
Pre-requisites: None
The aim of this module is to practice the use of Data Science and Statistical modelling by working through a series of case studies. The case studies will be based on real-life problems and will start with a description of the setting of the problem and the intended outcomes. One of the important things in any statistical analysis is to understand the background to the problem and, for each case study, there will be a review the field in which it is set. Analyses will start with raw data that will have to be sense-checked and manipulated into a form that is suitable for the intended analyses. Deciding on the exact form of the analyses in each case will be a central focus of this module and an important aim of this module will be developing the skills to make decisions in this regard, drawing on information from the setting, the exact nature of the problem being assessed and knowledge of the techniques and methods that are available. In each case study, the results of the chosen form of analyses will be interpreted, with particular attention given to the best way of communicating the results to a variety of technical and non-technical audiences.
Activities will include problem formulation, knowledge discovery, regression modelling, machine learning and report writing and presentation. Assessment will be based on examination and practical examples using real-world data examples.
On successful completion of this module you should be able to:
Module Specific Skills and Knowledge
2. Apply new techniques learnt through case studies to other datasets to answer questions in other applications
Discipline Specific Skills and Knowledge
Personal and Key Transferable / Employment Skills and Knowledge
8. Use R/RStudio and other software to manipulate and summarise data
Data Science and Statistical modelling topics will be introduced through their application in a series of case studies. Case studies may change each year, but the initial selection will include:
· Case study: modelling environmental hazards
· Case study: clustering and segmentation of customers
· Case study: forecasting electricity demands
· Case study: modelling the effects of air pollution on health
· Case study: mapping rates of disease
· Case study: exploring physical activity data for health
Case study: using local sources of data to address local challenges
Scheduled Learning & Teaching Activities | 36 | Guided Independent Study | 114 | Placement / Study Abroad | 0 |
---|
Category | Hours of study time | Description |
Scheduled learning and teaching | 24 | Lectures |
Scheduled learning and teaching | 12 | Hands-on practical sessions |
Guided Independent Study | 50 | Self study & background reading |
Guided Independent Study | 64 | Assessed data analyses, report writing. |
Form of Assessment | Size of Assessment (e.g. duration/length) | ILOs Assessed | Feedback Method |
---|---|---|---|
Feedback on unassessed data analyses examples (which will include report writing) | 24 | All | Oral |
Coursework | 80 | Written Exams | 20 | Practical Exams | 0 |
---|
Form of Assessment | % of Credit | Size of Assessment (e.g. duration/length) | ILOs Assessed | Feedback Method |
---|---|---|---|---|
Coursework – extended piece of data analysis involving data collection, analysis and reporting | 80 | Max 10 pages (plus appendixes) | All | Oral & Written |
Class test | 20 | 1 hour | All | Oral & Written |
Original Form of Assessment | Form of Re-assessment | ILOs Re-assessed | Time Scale for Re-assessment |
---|---|---|---|
Extended data analysis* | Extended data analysis (80%) | All | August Ref/Def Period |
Class test * | Class test (20%) | All | August Ref/Def Period |
*Please refer to reassessment notes for details on deferral vs. Referral reassessment
Deferrals: Reassessment will be by coursework and/or exam in the deferred element only. For deferred candidates, the module mark will be uncapped.
Referrals: Reassessment will be by a single piece of coursework worth 100% of the module only. As it is a referral, the mark will be capped at 50%.
information that you are expected to consult. Further guidance will be provided by the Module Convener
Reading list for this module:
Type | Author | Title | Edition | Publisher | Year | ISBN |
---|---|---|---|---|---|---|
Set | James, G., Witten, D., Hastie, T., Tibshirani, R. | An Introduction to Statistical Learning: with Applications in R | Springer | 2013 | 978-1461471370 | |
Set | Lantz, B. | Machine Learning with R: Expert Techniques for Predictive Modeling | 3rd | Packt | 2019 | 978-1788295864 |
CREDIT VALUE | 15 | ECTS VALUE | 15 |
---|---|---|---|
PRE-REQUISITE MODULES | None |
---|---|
CO-REQUISITE MODULES | None |
NQF LEVEL (FHEQ) | 7.5 | AVAILABLE AS DISTANCE LEARNING | No |
---|---|---|---|
ORIGIN DATE | Tuesday 12th March 2024 | LAST REVISION DATE | Tuesday 12th March 2024 |
KEY WORDS SEARCH | None Defined |
---|
Please note that all modules are subject to change, please get in touch if you have any questions about this module.