In data preprocessing, what is the term for the process of converting text data into numerical values?

Data Encoding
Data Tokenization
Data Parsing
Data Transformation

The correct answer is: A. Data Encoding

Data encoding is the process of converting data from one form to another, typically from human-readable to machine-readable. In the context of data preprocessing, data encoding is used to convert text data into numerical values. This is done by assigning a unique number to each unique word or character in the text. This allows the data to be stored and processed more easily by computers.

Data tokenization is the process of breaking down text into smaller units, such as words or phrases. This is done by identifying the boundaries between words and phrases. Data tokenization is often used in natural language processing tasks, such as text classification and sentiment analysis.

Data parsing is the process of converting data from one format to another. This is done by identifying the structure of the data and then converting it into a format that can be used by a computer. Data parsing is often used in data integration tasks, such as loading data into a database.

Data transformation is the process of changing the format or structure of data. This is done by applying a set of rules to the data. Data transformation is often used in data cleaning tasks, such as removing duplicate data or correcting errors.