Skip to main content
 

MATH42515: Data Exploration, Visualization, and Unsupervised Learning

It is possible that changes to modules or programmes might need to be made during the academic year, in response to the impact of Covid-19 and/or any further changes in public health advice.

Type Tied
Level 4
Credits 15
Availability Available in 2024/2025
Module Cap None.
Location Durham and Queen's Campus Stockton
Department Mathematical Sciences

Prerequisites

  • None

Corequisites

  • None

Excluded Combinations of Modules

  • None

Aims

  • To introduce the concepts and methods of exploratory data analysis, data visualization, and unsupervised learning.

Content

  • Advanced exploratory data analysis.
  • Density estimation and data visualization.
  • Unsupervised learning and clustering.
  • Principal component analysis (PCA) and dimension reduction.
  • Data visualization and statistical computing with R.
  • Methods for non-numerical data: e.g. categorical, spatial and temporal, text, images, networks, graphs.
  • Further topics: e.g. anomaly detection, treatment of missing values, association rules.

Learning Outcomes

Subject-specific Knowledge:

  • By the end of the module students will:
  • Have a systematic and coherent understanding of the theory, computation, and application of the topics studied;
  • Have mastered advanced exploratory data analysis, data visualization, and statistical computing with R;
  • Have acquired a coherent body of applicable knowledge on density estimation, unsupervised learning, clustering, PCA and dimension reduction.

Subject-specific Skills:

  • In addition, students will have acquired:
  • The ability to use statistical software R to conduct synthesis of data, data analysis, and date visualization;
  • Programming skills generally used in advanced methods such as clustering and unsupervised learning;
  • The ability to identify and apply appropriate unsupervised learning methods to modern real-world problems.

Key Skills:

  • Sufficient mastery of advanced data analysis, data visualization and unsupervised learning methods and ability to apply them appropriately to real-world applications.
  • The ability to clearly communicate statistical methods and relevant conclusions through writing.
  • The ability to organise prioritise, and manage time effectively.
  • The ability to advance and extend their knowledge through significant independent learning and research.

Modes of Teaching, Learning and Assessment and how these contribute to the learning outcomes of the module

  • This module will be delivered by the Department of Mathematical Sciences.
  • Teaching will be delivered primarily by workshops, lectures, and practicals.
  • Lectures demonstrate what is required to be learned and the application of the theory to practical examples.
  • Workshops describe theory and its application to concrete examples, enable students to test and develop their understanding of the material by applying it to practical problems, and provide feedback and encourage active engagement.
  • Workshops are a combination of taught material, practicals, problem classes, tutorials and guided group work.
  • Practicals consolidate the studied material, explore theoretical ideas in practice, enhance practical understanding, and develop practical data analysis skills.
  • Lectures, workshops, and computer practicas will be supported by the distribution of materials such as video content, directed reading, e-assessments, reflective activities, opportunities for self-assessment, and peer-to-peer learning within a tutor-facilitated discussion board.
  • Students will be able to obtain further help in their studies via scheduled office hours as well as by approaching their lecturers by email.
  • Students will be expected to work in between workshops and lectures, and to discuss their own work during the workshops. This work will be guided by the module leader, but will be organised by the students themselves, thereby enabling them to demonstrate their time management skills.
  • Students will undertake independent research to further their knowledge of the topic and self-directed learning to further their technical and transferable skills.
  • The workshops also provide opportunities for module leaders to monitor progress and to provide feedback and guidance on the development of ideas for the project, and for students to gauge their progress throughout the duration of the module.
  • Student performance will be assessed through two individual assignments.
  • The assignments will provide the means for students to demonstrate their acquisition of subject knowledge and the development of their problem-solving skills.

Teaching Methods and Learning Hours

ActivityNumberFrequencyDurationTotalMonitored
Lectures162 times per week (Term 2, weeks 1-4, 6-9)1 hour16 
Workshops81 time per week (Term 2, weeks 1-4, 6-9)1 hour8 
Practicals81 time per week (Term 2, weeks 1-4, 6-9)1 hour8Yes
Preparation, exercises, and reading118 
Total150 

Summative Assessment

Component: AssignmentComponent Weighting: 100%
ElementLength / DurationElement WeightingResit Opportunity
Assignment 130Yes
Assignment 270Yes

Formative Assessment

Workshop discussion of students' ideas and experiences; informal discussions of student progress with module leader when necessary.

More information

If you have a question about Durham's modular degree programmes, please visit our Help page. If you have a question about modular programmes that is not covered by the Help page, or a query about the on-line Postgraduate Module Handbook, please contact us.

Prospective Students: If you have a query about a specific module or degree programme, please Ask Us.

Current Students: Please contact your department.