Switch to English Site

dotsdots

Spark - Intermedio Data Science Course

Descripción

In this course, you’ll learn how to use Spark to work with big data and build machine learning models at scale, including how to wrangle and model massive datasets with PySpark, the Python library for interacting with Spark. In the first lesson, you will learn about big data and how Spark fits into the big data ecosystem. In lesson two, you will be practicing processing and cleaning datasets to get comfortable with Spark’s SQL and dataframe APIs. In the third lesson, you will debug and optimize your Spark code when running on a cluster. In lesson four, you will use Spark’s Machine Learning Library to train machine learning models at scale.Lee mas.

Este recurso es ofrecido por un socio afiliado. Si paga por la capacitación, podemos ganar una comisión para respaldar este sitio.

Relevancia profesional por rol de datos

The techniques and tools covered in Spark are most similar to the requirements found in Ingeniero de datos data science job advertisements.

Puntuaciones de similitud (sobre 100)

Secuencia de aprendizaje

Spark is a part of tres structured learning paths.

None
DataKwery
None
DataKwery

17 Courses

Free Data Engineer

None
DataKwery