Intro to Data Analysis Workflows in Python with Pandas
Free Live Workshop on April 22 at 11am Eastern - Register Now
In statistics, classification is the problem of identifying to which of a set of categories (sub-populations) a new observation belongs, on the basis of a training set of data containing observations (or instances) whose category membership is known. Examples are assigning a given email to the "spam" or "non-spam" class, and assigning a diagnosis to a given patient based on observed characteristics of the patient (sex, blood pressure, presence or absence of certain symptoms, etc.).
Classification is most likely to appear on Data Scientist job descriptions where we found it mentioned 12.1 percent of the time.