skip to page content
95-869 Big Data and Large Scale Computing
Spring 2018

Home
Syllabus
Assignments
Notes

Assignments

COURSEWORK:

Coursework consist of Coursework consist of 5 homework assignments, 1 final exam, and in-class pop-up quizzes that will determine your class participation (grading in parentheses):

IMPORTANT DATES:

Assignment Note Out Due Weight
Homework 0
Installation, Set up
3/26
--
0%
Homework 1
pySpark and RDDs
3/28
4/8
7%
Homework 2
Regression in Spark
4/8
4/15
12%
Homework 3
Classification in Spark
4/15
4/22
12%
Homework 4
Data Analysis with PCA in Spark
4/22
4/29
12%
Homework 5
Hands-on with ML-lib and SparkSQL
4/30
5/3
12%
Final Exam
You are allowed to bring a single, A4-size 'cheat' sheet with your notes
See here
--
35%
Class participation
Pop-quizzes (any time during class)
--
--
10%

HOMEWORK:

The goal of the homework is to enable the students to practice the concepts learned in class using real-world datasets.

  • ASSIGNMENTS ARE DUE AT 5 PM OF THE DUE DATE.
  • All assignments are to be done individually. Please see the collaboration policy.
  • Submission (only electronically):
    • Submit all of your source files on Canvas.
    • Also submit your print out/pdf with answers on Canvas.
    • Make sure that your answers are legible and coding is clear.
    • See course policies for assignment questions, late submissions, graded homework pick-up.


EXAM:

There will be a final exam. It will be closed everything -- no books, slides, computers, etc. You will only be allowed to bring along with you 1 A4-size paper of your own notes ('cheat sheet'), you can use both sides. The final date will be announced during the semester.

CLASS PARTICIPATION:

Attendance will be quantified via pop-up quizzes in class. Each quiz will be a short-answer question, to be answered given 2-3 minutes. The quizzes can be any time during the class, at the beginning, at the end, or anywhere in between.
There will be 15-18 such quizzes, 1 point each. We will select the highest-score 10 out of those for each student, for a total of 10% of the final grade. Students who attend class will get 0.5 point even if they return a blank answer. Partial answers will earn 0.75, and correct answers will get 1 point. Students who miss class and hence the quizzes will receive 0 point.