Big Data Systems

2023-2024, Semester A

School of Computer Science
Tel Aviv University

Course Information

Course Staff

Course Goal

The course covers the architecture and design of modern big data systems from a data modeling and data management perspectives. Topics includes centralized vs distributed data systems, NoSQL and in particular wide-column database systems and Cassandra, storage strategies, denormalization modeling, stream processing, data warehouse and more.
The goal of the course is to provide the theoretical as well as the practical hands on knowledge required for designing and developing internet scale based data applications.

Course Format

The class meets once a week for a 3 hours lecture.
There will be 2-3 homework assignments (in pairs, some of which will involve programming) - 45% of the final grade.
Final exam - 55% of the final grade.

Course Requirements

  • Data structures (0368-2158) or Data structures and Algorithms (0512-2510)

Course Schedule and Slides

* tentative due to Iron Swords war

# Date Topics Material Notes
1 02.01.2024 Introduction Hello, World!
Introduction to Big Data
Introduction to Relational DB
2 09.01.2024 Relational DB SQL
Relational Data Integrity
MySQL CLI
3 16.01.2024 Relational data modeling Relational modeling
MySQL workbench
HW#1 distributed
- 23.01.2024 Reserve duty week
4 30.01.2024 Distributed DB, CAP theorem, NoSQL Introducrtion to Distributed DB
CAP theorm
NoSQL
5 06.02.2024 Dynamo Dynamo
Dynamo (Extra)
HW#1 due
6 13.02.2024 Bigtable Bigtable
7 20.02.2024 Cassandra - Intro Cassandra - Intro
- 27.02.2024 No class this week (local elections)
8 05.03.2024 Cassandra - Advanced Cassandra - CQL
Cassandra - Advanced
9 ---------- Cassandra - Hands on Astra DB
Cassandra - Java Driver
HW#2 distributed
- ----------
HW#2 due
10 12.03.2024 Data modeling in NoSQL Denormalization
Data Modeling in NoSQL - Intro
11 19.03.2024 Data modeling in NoSQL - Advanced
(Make-up class)
Data Modeling in NoSQL - Advanced
Data Modeling in NoSQL - Examples
HW#3 distributed
12 ---------- Data warehouse (BigQuery)
- 02.04.2024
HW#3 due
- 14.04.2024 Final Test