Data Cleaning

Data Cleaning (also caleed data cleansing) is the process of detecting and correcting erroneous data, typically by updating or changing/transforming faulty data to correct data. Erroneous data can also be viewed as data of poor data quality, where data quality is the measure of how useful or usable a dataset is. Data quality is typically an aggregate over all data in a dataset, where different aspects contributes to good and poor data quality, such as:

Note that data quality is specific to use.

Relevant Articles and Tutorials