The book is based on Stanford Computer Science course CS246: Mining Massive Datasets (and CS345A: Data Mining).
The book, like the course, is designed at the undergraduate computer science level with no formal prerequisites. To support deeper explorations, most of the chapters are supplemented with further reading references.Lee mas.
Las técnicas y herramientas cubiertas en Mining of Massive Datasets son muy similares a los requisitos que se encuentran en los anuncios de trabajo de Científico de datos.