Data classification is the process of organizing data into categories based on attributes like file type, content, or metadata. The data is then assigned class labels that describe a set of attributes for the corresponding data sets. The goal is to provide meaningful class attributes to former less structured information.
Data classification can be viewed as a multitude of labels that are used to define the type of data, especially on confidentiality and integrity issues. Data classification is typically a manual process; however, there are tools that can help gather information about the data. Data sensitivity levels are often proposed to be considered.