Fundamentals of Scalable Data Science

Description

Apache Spark is the de-facto standard for large scale data processing. This is the first course of a series of courses towards the IBM Advanced Data Science Specialization. We strongly believe that is is crucial for success to start learning a scalable data science platform since memory and CPU constraints are to most limiting factors when it comes to building advanced machine learning models.

Read more.

Career Relevance by Data Role

The techniques and tools covered in Fundamentals of Scalable Data Science are most similar to the requirements found in Data Scientist job advertisements.


Similarity Scores (Out of 100)

Subscribe for updates and new courses
Or create a DataKwery.com account
Fast Facts

Tools
Apache SparkJupyterPySparkPythonSQLWatson

Techniques
Applied MathematicsBig DataData AnalysisData ProcessingData ScienceData VisualizationDimension ReductionExploratory Data AnalysisFeature SelectionMachine LearningProgrammingStatistical Analysis

Similar Opportunities
Distributed Computing with Spark SQL

Coursera - University of California, Davis

Big Data Analytics Using Spark

edX - University of California, San Diego

Big Data Analysis with Apache Spark

edX - University of California, Berkeley

Introduction to Apache Spark

edX - University of California, Berkeley