Data Validation

Data Validation is the process of checking that data is correct or has been properly corrected. This process is often done after data cleaning to check that the errors have been properly removed and that the data is of a sufficiently high quality.

Validation is typically done using constraints of some form. In relational databases one can naturally use normal relational constraints for most validation, or can use more complex functionality such as triggers for more sophisticated validation. For triplestores and RDF one can use any of the constraint languages available, such as SHACL or SPIN, or use OWL to check for inconsitencies.

Relevant Articles and Tutorials