In this module we will study how linear regression can be used to describe and analyse the relationship between explanatory variables $x_1, \ldots, x_n$ (input) and a response variable $y$ (output). The models we will consider are of the form
There will be 27 lectures (L1 to L27) and 6 example classes (E1 - E6). The schedule is given in the following table.
The following links contain pdf copies of the handouts from the lectures.
Paper copies of the handouts are ususally available from the blue drawers in front of the taught students office on level 8 of the maths building.
For the module we will use the statistical computing package R. This program is free software, and you can find the program and documentation at the R project homepage.
My recommendation would be to install the RStudio environment, which includes R, on your own computer and use this for the project. (Choose the open source version, "RStudio Desktop", on the download page.) Alternatively you can use RStudio or plain R on the university computers.
Below you can find the RStudio notebook files from the tutorials. I would recommend to download the "RStudio Notebook" to your own computer and to experiment with it in RStudio yourself (right click on the link and choose "Save link as …"); there is also a non-interactive "HTML version" which you can look at.
Useful resources for learning R include to following:
The following data sets were used in the module.