Mining Massive Datasets With Mapreduce

Host University

George Mason University

Semester

Fall 2024

Course Number

CS 657 DL1

Credits

3

Discipline

Computer Science

Instructor

Barbara, Daniel (dbarbara@gmu.edu)

Times and Days

4:30 pm - 7:10 pm

R

Course Information

Covers the techniques to mine large datasets, including Distributed File Systems and Map-Reduce, similarity search, and data stream processing. Covers classic problems in data mining, such as clustering, association rule mining, and others from the point of view of scalability. Includes a final project to exercise concepts covered in class. Offered by Computer Science. May not be repeated for credit.

Prerequisites

CS 584 or 584