The course covers the architecture and design of modern big data systems from a data modeling and data management perspectives. Topics includes centralized vs distributed data systems, NoSQL and in particular wide-column database systems and Cassandra, storage strategies, denormalization modeling, stream processing, data warehouse and more.
The goal of the course is to provide the theoretical as well as the practical hands on knowledge required for designing and developing internet scale based data applications.
The class meets once a week for a 3 hours lecture.
There will be 2-3 homework assignments (in pairs, some of which will involve programming) - 45% of the final grade.
Final exam - 55% of the final grade.
* tentative due to Iron Swords war
# | Date | Topics | Material | Notes |
---|---|---|---|---|
1 | 02.01.2024 | Introduction |
Hello, World! Introduction to Big Data Introduction to Relational DB |
|
2 | 09.01.2024 | Relational DB |
SQL Relational Data Integrity MySQL CLI |
|
3 | 16.01.2024 | Relational data modeling |
Relational modeling MySQL workbench |
HW#1 distributed |
- | 23.01.2024 | Reserve duty week | ||
4 | 30.01.2024 | Distributed DB, CAP theorem, NoSQL |
Introducrtion to Distributed DB CAP theorm NoSQL |
|
5 | 06.02.2024 | Dynamo |
Dynamo Dynamo (Extra) |
HW#1 due |
6 | 13.02.2024 | Bigtable | Bigtable | |
7 | 20.02.2024 | Cassandra - Intro | Cassandra - Intro | |
- | 27.02.2024 | No class this week (local elections) | ||
8 | 05.03.2024 | Cassandra - Advanced |
Cassandra - CQL Cassandra - Advanced |
|
---------- | Cassandra - Java Driver |
|||
- | ---------- | |||
10 | 12.03.2024 | Data modeling in NoSQL |
Denormalization Data Modeling in NoSQL - Intro |
|
11 | 19.03.2024 | Data modeling in NoSQL - Advanced (Make-up class) |
Data Modeling in NoSQL - Advanced Data Modeling in NoSQL - Examples |
HW#3 distributed |
---------- | ||||
- | 02.04.2024 | HW#3 due | ||
- | 14.04.2024 | Final Test |