Course Overview
This course will take a practical approach to solving challenges in the public and private sectors using data analytics. A number of different themes will be explored as case studies in order to demonstrate how data-driven decision-making has widespread applications. The course will examine how the question being posed, the available data and the selected modelling approach all come together to arrive at a feasible solution. A range of quantitative techniques, involving both linear and nonlinear methods will be presented for dealing with numerical structured datasets. Substantial emphasis will be placed on the process of delivering data analytics via a dashboard to facilitate decision-making and policy-making. The course content will be structured to provide a roadmap for carrying out the necessary procedures and will be illustrated using case studies, reading material and previously published models. Participants will obtain hands-on experience by working on specific challenges with real-world data through a carefully structured set of assignments.
Course Syllabus
The full course syllabus contains additional information and can be found in PDF format HERE.Outcomes
After completing this course, students should be able to:
- Design a data analytics project in response to a specific challenge
- Download and organize data for addressing the challenge
- Explore the dataset using visualization techniques
- Apply a range of quantitative techniques
- Discuss the advantages and disadvantages of different models
- Select an approach that is optimal for meeting the objective
- Present conclusions and recommendations
- Communicate model output to decision-makers
Textbooks
There will be no required textbooks, though we suggest the following to help you to study (all available online):- (ESL): Elements of Statistical Learning Trevor Hastie, Robert Tibshirani and Jerome Friedman.
- (TM): Machine Learning, Tom Mitchell.
- (PRML): Pattern Recognition and Machine Learning, Christopher M. Bishop
- (MJ): Machine Learning Algorithms, The SAS Data Science Blog
Piazza
We will use Piazza for class discussions. Please go to the course Piazza site to join the course forum (note: you must use a cmu.edu email account to join the forum). We strongly encourage students to post on this forum rather than emailing the course staff directly (this will be more efficient for both students and staff). Students should use Piazza to:
- Ask clarifying questions about the course material.
- Share useful resources with classmates (so long as they do not contain homework solutions).
- Look for students to form study groups.
- Answer questions posted by other students to solidify your own understanding of the material.
Academic Integrity
The course Academic Integrity Policy must be followed when doing assignments and on the message boards at all times. Details on ECE's Academic Integrity Policy can be found in the course syllabus and HERE.Grading
The grades for this course will be based on students’ performance on seven homework assignments, a final exam and class participation. Homework assignments will be done individually and turned in via Canvas by the designated due date. Late work will be acceptable until 24 hours past the deadline, but it will lose 10%. The assignments will be graded based on both a writing report and code used to achieve results presented in the report. Class participation will be evaluated based on student’s contribution to discussions both in-class and on the Piazza Discussion Board. When posting or reacting to online discussion threads, students are expected to use their own words and the post should be relevant to the topic under discussion. Make sure to introduce, summarize and explain the article in your own words to enlighten the audience on the point the article is making.
The following is the weight distribution of the grades:
Class participation 5%
Kahoot Quizzes 2.5%
Piazza Participation 2.5%
Homework Assignment 1 20%
Homework Assignment 2 25%
Homework Assignment 3 30%
Final Exam (Multiple Choice) 15%
Teaching Assistants Contact Info
Name |
Office Hours |
Zoom Meeting ID |
|
Christian Iradukunda |
Tuesday 6:00PM - 8:00PM CAT |
||
Isaac Coffie |
Wednesday 6:00PM - 8:00PM CAT |
||
Akem Aristide Tanyi-Jong |
Thursday 8:00PM - 10:00PM CAT |
||
Eliphaz Niyodusenga |
Friday 8:00PM - 10:00PM CAT |
||
Innocent Mukoki |
Sunday 8:00PM - 10:00PM CAT |
Schedule (Subject to Change)
Date | Topics | Slides | Assignment |
---|---|---|---|
Tues, Feb 02 | Data Analytics | [Week 1A] | Assignment 1 released |
Thurs, Feb 04 | Weather forecasting | [Week 1] | |
Fri, Feb 05 | Recitation | ||
Tues, Feb 09 | Renewable energy | [Week 2A] | |
Thurs, Feb 11 | Wind energy | [Week 2] | |
Fri, Feb 12 | Recitation | ||
Mon, Feb 15 | Assignment 1 due | ||
Tues, Feb 16 | Solar energy | [Week 3A] | Assignment 2 released |
Thurs, Feb 18 | Demand forecasting | [Week 3] | |
Fri, Feb 19 | Risk | [Week 4A] | |
Tues, Feb 23 | Recitation | Assignment 1 grades released | |
Thurs, Feb 25 | Extreme events | [Week 4] | |
Fri, Feb 26 | Recitation | ||
Mon, Mar 01 | Assignment 2 due | ||
Tues, Mar 02 | Health | [Week 5A] | Assignment 3 released |
Thurs, Mar 04 | Biomedicine | [Week 5] | |
Fri, Mar 05 | Recitation | ||
Tues, Mar 09 | Early warning systems | [Week 6A] | Assignment 2 grades released |
Thurs, Mar 11 | Economic forecasting | [Week 6] | |
Fri, Mar 12 | Recitation | ||
Mon, Mar 15 | Assignment 3 due | ||
Tues, Mar 16 | Final Exam | Final exam grades released | |
Fri, Mar 19 | Assignment 3 grades released |