Data Cleaning

Data cleansing or data cleaning is the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database and refers to identifying incomplete, incorrect, inaccurate or irrelevant parts of the data and then replacing, modifying, or deleting the dirty or coarse data. Data cleansing may be performed interactively with data wrangling tools, or as batch processing through scripting.

Job Relevance

Percent of recent job postings including Data Cleaning

Related Training

34 learning opportunities

Follow
Logo for Kaggle

Online Learning

Data Cleaning

Master efficient workflows for cleaning real-world, messy data.

By Rachael Tatman

Follow
Logo for DataCamp

Online Learning

Data Modeling in Power BI

Proper data modeling is the foundation of data analysis and creating reports in Power BI. This course lets you explore a toolbox of data cleaning, sh…

By Maarten Van den Broeck and Sara Billen

Follow
Logo for Coursera

Online Learning

Johns Hopkins University

Getting and Cleaning Data

Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining …

Follow
Logo for Codecademy

Online Learning

Learn Text Processing

Text is everywhere, and knowing how to clean it will transform your data science skillset. Many in the industry estimate that 80% of data science is …

Follow
Logo for LinkedIn

Online Learning

NLP with Python for Machine Learning Essential Training

Explore natural language processing (NLP) concepts, review advanced data cleaning and vectorization techniques, and learn how to build machine learni…

Follow
Logo for Coursera

Online Learning

Google

Process Data from Dirty to Clean

This is the fourth course in the Google Data Analytics Certificate. These courses will equip you with the skills needed to apply to introductory-leve…

Follow
Logo for Online Textbooks

Online Textbooks

A Beginner's Guide to Clean Data

This book will help you to become a better data scientist by showing you the things that can go wrong when working with data - particularly low-quali…

By Benjamin Greve

Follow
Logo for FutureLearn

Online Learning

Analysing Data in Excel

Learn how to become a pro at using Microsoft Excel’s key formulasExcel is one of the most widely used business applications in the world. But while …

Follow
Logo for DataCamp

Online Learning

Analyzing IoT Data in Python

Learn how to import, clean and manipulate IoT data in Python to make it ready for machine learning.

By Matthias Voppichler