skip to page content
95-869 Big Data and Large Scale Computing
Spring 2024

Home
Syllabus
Assignments
Notes

Assignments

COURSEWORK:

Coursework consist of 5 homework assignments, 1 final exam, and after-class quizzes that will determine your class participation (grading in parentheses):

IMPORTANT DATES:

Assignment Note Out Due Weight
Homework 0
Installation, Set up
Week 1, Tue
--
0%
Homework 1
pySpark and RDDs
Week 2, Thu
Week 3, Thu
11%
Homework 2
Regression in Spark
Week 3, Thu
Week 4, Thu
12%
Homework 3
Classification in Spark
Week 4, Thu
Week 5, Thu
12%
Homework 4
Data Analysis with PCA in Spark
Week 5, Thu
Week 6, Thu
12%
Homework 5
Hands-on with ML-lib and SparkSQL
Week 6, Thu
Week 7, Thu
13%
Progress
Quizzes (posted on Canvas)
after each class
within 2 days
15%
Final Exam
You are allowed to bring a single, A4-size 'cheat' sheet with your notes
See here for date,
time,
place
--
25%

HOMEWORK:

The goal of the homework is to enable the students to practice the concepts learned in class using real-world datasets.

  • ASSIGNMENTS ARE DUE AT 11:59 PM OF THE DUE DATE.
  • All assignments are to be done individually. Please see the collaboration policy.
  • Submission (only electronically):
    • Submit all of your source files on Gradescope.
    • Also submit your print out/pdf with answers on Gradescope.
    • Make sure that your answers are legible and coding is clear.
    • See course policies for assignment questions, late submissions, graded homework pick-up.


EXAM:

There will be a final exam. It will be closed everything -- no books, slides, computers, etc. You will only be allowed to bring along with you one (1) A4-size paper of your own notes ('cheat sheet'), you can use both sides. The final date will be announced during the semester.

CLASS PARTICIPATION:

Student progress and participation will be quantified via quizzes. Each quiz will be a list of multiple-choice questions, to be posted on Canvas after each class -- with a due date and time.

There will be a total of 10-12 such quizzes during the semester, 1 point each. We will select the highest-scoring 10 out of those for each student, for a total of 15% of the final grade. Students who attempt all the questions in the quiz will get 0.5 point even if they answer all these questions wrong. Students who do not attempt to answer any questions in a quiz will receive 0 points.