6.S079 Software Systems for Data Science

Teaching Assistant, MIT, 2024

Teaching assistant for MIT 6.S079 Software Systems for Data Science in Spring 2024, taught by Professors Samuel Madden and Michael Cafarella.

The course surveys techniques and systems for ingesting, efficiently processing, analyzing, and visualizing large data sets. Topics include data cleaning, data integration, scalable systems (relational databases, NoSQL, Spark, etc.), analytics (data cubes, scalable statistics and machine learning), and scalable visualization of large data sets.