This lesson plan will help you to teach Introductory Predictive Analysis through an Exploratory Data Analysis with Linear Regression assignment. The lesson plan includes a hands-on computer-based classroom activity to be conducted on a dataset of the annual temperature records of Mumbai – a coastal city in Western India, for the span of 1842 to 2019. This activity includes hands-on Python code, and a set of inquiry-based questions that will enable your students to apply their understanding of scatter plots, trendlines, moving averages, heatmaps, correlation coefficients, linear regression, and regression equations.
Thus, the use of this lesson plan allows you to integrate the teaching of a climate science topic with a core topic in Mathematics, Statistics, and Data Science.
The tools in this lesson plan will enable students to:
Grade Level | Undergraduate |
Discipline | Mathematics and Statistics, Earth Sciences |
Topic(s) in Discipline |
Introduction to Statistics, Data Science, Trend Analysis, Computer Programming, Recent Climate Change, Scatter Plots, Correlation Coefficients, Regression Equations, Linear Regression, OLS and LOWESS Trendlines, Heatmap |
Climate Topic |
Climate Variability Record |
Location | Global, Asia, India |
Language(s) | English |
Access | Online / Offline |
Approximate Time Required | 60-90 min |
Share | |
Resource Download |
Here is a step-by-step guide to using this lesson plan in the classroom/laboratory. We have suggested these steps as a possible plan of action. You may customize the lesson plan according to your preferences and requirements.
Teaching Module (25 min)
Video micro-lectures(15 mins)
Use the video micro-lecture, ‘Introduction to Simple Linear Regression by dataminingincae, INCAE Business School for a basic introduction to Simple Linear Regression and terms like dependant variable, independent variable, regression line, and regression coefficients.
Classroom/ Laboratory activity(30 min)
Part 1: Load and prepare the data for use
Part 2: Exploratory Data Analysis: Know your data
Part 3: Heatmaps for correlations
Now let the students create some visualizations to see feature correlations i.e. a heatmap. The first heatmap is of all 12 months and another one is the correlation between moving average columns of all seasons. This will help us determine the difference that will be made by taking moving averages.- The first heatmap gives us the interpretation that the months- February, May, September, October, November, and December contribute the most to the annual column and have a high positive correlation. Whereas the second heatmap shows a very high positive correlation in October-December moving average with the Annual temperature column. Thus oct-dec column contributes the most to the annual column which means that the rise in overall temperature in the annual column can be seen more because of the oct-dec column i.e. winter season rather than the summer season months. None of the columns have a negative correlation amongst themselves which means no inverse correlation exists amongst any of the columns.
Part 4:Trendlines
Part 5:Linear Regression for Predictions
4. Encourage your students to answer topical questions by applying their understanding of scatter plots, correlation coefficients, heatmap, moving averages, trendlines, and linear regression.
5. Use the regression analyses performed to initiate a discussion on the increase in temperatures from 1980 to 2020 due to anthropogenic activities, which is one major reason behind global climate change.
Use this lesson plan to help your students find answers to:
5Python codeAshan Virdikar and Naveen Anupoju, India
1 | Teaching Module, “Introduction- Linear Regression and Correlation” | Provided by OpenStaxTM, Rice University |
2 | Teaching Module, “Chapter 3: Linear Regression” | Provided by Ramesh Sridharan, MIT from ‘Statistics for Research Projects’ |
3 | Video micro-lecture, ‘Introduction to Simple Linear Regression’’ | by dataminingincae, INCAE Business School |
4 | Dataset of Mumbai temperature records’ | Colaba Meteorological Station, Mumbai, India |
Here is a step-by-step guide to using this lesson plan in the classroom/laboratory. We have suggested these steps as a possible plan of action. You may customize the lesson plan according to your preferences and requirements.
Teaching Module (25 min)
Video micro-lectures(15 mins)
Use the video micro-lecture, ‘Introduction to Simple Linear Regression by dataminingincae, INCAE Business School for a basic introduction to Simple Linear Regression and terms like dependant variable, independent variable, regression line, and regression coefficients.
Classroom/ Laboratory activity(30 min)
Part 1: Load and prepare the data for use
Part 2: Exploratory Data Analysis: Know your data
Part 3: Heatmaps for correlations
Now let the students create some visualizations to see feature correlations i.e. a heatmap. The first heatmap is of all 12 months and another one is the correlation between moving average columns of all seasons. This will help us determine the difference that will be made by taking moving averages.- The first heatmap gives us the interpretation that the months- February, May, September, October, November, and December contribute the most to the annual column and have a high positive correlation. Whereas the second heatmap shows a very high positive correlation in October-December moving average with the Annual temperature column. Thus oct-dec column contributes the most to the annual column which means that the rise in overall temperature in the annual column can be seen more because of the oct-dec column i.e. winter season rather than the summer season months. None of the columns have a negative correlation amongst themselves which means no inverse correlation exists amongst any of the columns.
Part 4:Trendlines
Part 5:Linear Regression for Predictions
4. Encourage your students to answer topical questions by applying their understanding of scatter plots, correlation coefficients, heatmap, moving averages, trendlines, and linear regression.
5. Use the regression analyses performed to initiate a discussion on the increase in temperatures from 1980 to 2020 due to anthropogenic activities, which is one major reason behind global climate change.
Use this lesson plan to help your students find answers to:
5Python codeAshan Virdikar and Naveen Anupoju, India
1 | Teaching Module, “Introduction- Linear Regression and Correlation” | Provided by OpenStaxTM, Rice University |
2 | Teaching Module, “Chapter 3: Linear Regression” | Provided by Ramesh Sridharan, MIT from ‘Statistics for Research Projects’ |
3 | Video micro-lecture, ‘Introduction to Simple Linear Regression’’ | by dataminingincae, INCAE Business School |
4 | Dataset of Mumbai temperature records’ | Colaba Meteorological Station, Mumbai, India |
All maps & pedagogical tools are owned by the corresponding creators, authors or organizations as listed on their websites. Please view the individual copyright and ownership details for each tool using the links provided. We do not claim ownership of or responsibility or liability for any of these tools. Images copyrights remain with the respective owners.
TROP ICSU is a project of the International Union of Biological Sciences and Centre for Sustainability, Environment and Climate Change, FLAME University.