John M. Noble
Mathematical Statistics
Institute of Applied Mathematics
University of Warsaw
October 2023 - January 2024
Multivariate Statistics
Course Information
Language: English
Type of course: elective
Place and Time
There will be 14 lectures and 14 tutorials. These take place on Mondays. Lecture: 08.30 - 10.00 (room 5060) and tutorial 10.15 - 11.45 (room 2044: computer lab). The dates are:
October 2023 2nd, 9th, 16th, 23rd
November 2023 6th, 13th, 20th, 27th
December 2022 4th, 11th, 18th
January 2023 8th, 15th, 22nd
Note: NO CLASS ON MONDAY 30th OCTOBER: THIS DAY RUNS ACCORDING TO THE SCHEDULE FOR EVEN FRIDAYS
Description
The course ‘Multivariate Statistics’ is a Master's level course, giving some statistical theory, with application in R.
The topics covered are:
- Nonparametric Density Estimation (histograms, kernel methods, projection pursuit)
- Multiple regression: model assessment and selection, shrinkage methods (eg LASSO)
- Linear Dimensionality Reduction: Principal Component Analysis, Canonical Correlation Analysis, Projection Pursuit
- Linear Discriminant Analysis.
- Recursive Partitioning and Tree-based Methods
- Artificial Neural Networks
- Support Vector Machines
- Clustering techniques: hierarchical and non-hierarchical partitioning methods, self organising maps (SOM), clustering variables, clustering based on mixture models (the EM algorithm as a tool for clustering and semi-supervised learning).
- Multidimensional Scaling and Distance Geometry
- Committee Machines, Bagging and boosting, random forests
- Latent Variable Models for Blind Source Separation
- Nonlinear Dimensionality Reduction and Manifold Learning
- Correspondence Analysis
- The multivariate Gaussian distribution, parameter estimation, the Wishart distribution.
Introduction to R
You should learn some R programming throughout the course.
A reasonable introduction may be found here.
Assessment
Assessment is based on
- two data analysis assignments
- a take home written exam
- Tutorial participation.
Lecture and Tutorial Notes
These will be placed here throughout the course.
Data Files
Click
here for the data directory.
(Last updated: 15th January 2024 by John M. Noble)