Covers the techniques to mine large datasets, including Distributed File Systems and Map-Reduce, similarity search, and data stream processing. Covers classic problems in data mining, such as clustering, association rule mining, and others from the point of view of scalability. Includes a final project to exercise concepts covered in class. Offered by Computer Science. May not be repeated for credit.
Mining Massive Datasets With Mapreduce
Host University
George Mason University
Semester
Fall 2024
Course Number
CS 657 DL1
Credits
3
Discipline
Computer Science
Instructor
Barbara, Daniel (dbarbara@gmu.edu)
Times and Days
4:30 pm - 7:10 pm
R
Course Information
Prerequisites
CS 584 or 584