The book is based on Stanford Computer Science course CS246: Mining Massive Datasets (and CS345A: Data Mining).
The book, like the course, is designed at the undergraduate computer science level with no formal prerequisites. To support deeper explorations, most of the chapters are supplemented with further reading references.Read more.
The techniques and tools covered in Mining of Massive Datasets are most similar to the requirements found in Data Scientist job advertisements.