Schedule

Please note that this schedule is subject to change.

Calendar

Date Topic Readings Assignments
1/18 Introduction
1/23 Python McKinney, Chs. 1 & 2
1/25 Python McKinney, Chs. 3 A1
1/30 Relational Databases
2/01 Relational Databases
2/06 Structured Data McKinney, Chs. 4 A2
2/08 Data and Pandas McKinney, Chs. 5 & 6
2/13 Pandas and DuckDB
2/15 Data Wrangling Kandel et al.
2/20 Data Wrangling Jin et al.
2/22 Data Transformation Yan & He A3
2/27 Test 1
3/01 Data Cleaning Rekatsinas et al.
3/06 Data Integration Stonebraker & Ilyas
3/08 Data Fusion Dong et al.
3/13 No Class
3/15 No Class
3/20 Scalable Databases Gessert et al.
3/22 Scalable Databases Pavlo & Aslett A4
3/27 Scalable Dataframes Petersohn et al.
3/29 Scalable Dataframes Jindal et al.
4/03 Time Series Data Pelkonen et al.
4/05 Graph Data Sahu et al.
4/10 Test 2
4/12 Databases and Visualization Moritz et al. A5
4/17 Spatial Data Eldawy et al.
4/19 Data Curation Wilkinson et al.
4/24 Provenance Chapman et al.
4/26 Reproducibility Collberg & Proebsting
5/01 Databases and Machine Learning Kraska et al.
5/03 Review
5/10 Final Exam (8-9:50am)

Lectures

(01/18) Introduction
(01/23) Python
(01/25) Python
(01/30) Databases
(02/01) Relational Databases
(02/06) Structured Data
(02/08) Data and Pandas
(02/13) Pandas and DuckDB
(02/15) Data Wrangling
(02/20) Data Wrangling
(02/22) Data Transformation
(03/01) Data Cleaning
(03/06) Data Integration
(03/08) Data Fusion
(03/20) Scalable Databases
(03/22) Scalable Databases
(03/27) Scalable Dataframes
(03/29) Scalable Dataframes
(04/03) Time Series Data
(04/05) Graph Data
(04/12) Databases and Visualization
(04/17) Spatial Data
(04/19) Data Curation
(04/24) Provenance
(04/26) Reproducibility
(05/01) Databases and Machine Learning
(05/03) Review