The book is based on Stanford Computer Science course CS246: Mining Massive Datasets (and CS345A: Data Mining).
The book, like the course, is designed at the undergraduate computer science level with no formal prerequisites. To support deeper explorations, most of the chapters are supplemented with further reading references.阅读更多.
Mining of Massive Datasets 中涵盖的技术和工具与 数据科学家 招聘广告中的要求最为相似。