11. In data preprocessing, what is the term for the process of converting text data into numerical values?

Data Encoding
Data Tokenization
Data Parsing
Data Transformation

Detailed SolutionIn data preprocessing, what is the term for the process of converting text data into numerical values?

12. Data that summarize all observations in a category are called . . . . . . . . data.

frequency
summarized
raw
none of the mentioned

Detailed SolutionData that summarize all observations in a category are called . . . . . . . . data.

13. Which of the following is NOT a common data preprocessing technique?

Data Cleaning
Data Visualization
Data Encoding
Data Imputation

Detailed SolutionWhich of the following is NOT a common data preprocessing technique?

14. What is the term for the process of selecting a representative subset of a larger dataset for analysis?

Data Sampling
Data Integration
Data Transformation
Data Cleaning

Detailed SolutionWhat is the term for the process of selecting a representative subset of a larger dataset for analysis?

15. What is the term for the process of organizing and storing data in a structured format that enables efficient retrieval?

Data Integration
Data Transformation
Data Aggregation
Data Warehousing

Detailed SolutionWhat is the term for the process of organizing and storing data in a structured format that enables efficient retrieval?

16. Which of the following transforms can be performed with data value?

log2
cos
log10
all of the mentioned

Detailed SolutionWhich of the following transforms can be performed with data value?

17. Which of the following block information is odd man out?

Subsetting
Raw data
Ready for analysis
None of the mentioned

Detailed SolutionWhich of the following block information is odd man out?

18. Which of the following can be used to view all the tables in memory?

tables
alltable
table
none of the mentioned

Detailed SolutionWhich of the following can be used to view all the tables in memory?

19. In data preprocessing, what is the term for the identification and removal of duplicate or redundant data?

Data Deduplication
Data Aggregation
Data Normalization
Data Imputation

Detailed SolutionIn data preprocessing, what is the term for the identification and removal of duplicate or redundant data?

20. Regular expressions can be thought of as a combination of literals and metacharacters.

TRUE
nan
nan
nan

Detailed SolutionRegular expressions can be thought of as a combination of literals and metacharacters.