Switch to English Site

描述

In this course, you’ll learn how to use Spark to work with big data and build machine learning models at scale, including how to wrangle and model massive datasets with PySpark, the Python library for interacting with Spark. In the first lesson, you will learn about big data and how Spark fits into the big data ecosystem. In lesson two, you will be practicing processing and cleaning datasets to get comfortable with Spark’s SQL and dataframe APIs. In the third lesson, you will debug and optimize your Spark code when running on a cluster. In lesson four, you will use Spark’s Machine Learning Library to train machine learning models at scale.阅读更多.

此资源由附属合作伙伴提供。 如果您支付培训费用,我们可能会赚取佣金来支持该网站。

按照数据工作岗位排列职业相关性

Spark 中涵盖的技术和工具与 数据工程师 招聘广告中的要求最为相似。

相似度得分(满分 100)

学习顺序

Spark is a part of 三 structured learning paths.

None
DataKwery

17 Courses

Free Data Engineer

None
DataKwery
None
DataKwery