The correct answer is D. Pandas.
Pandas is a Python library that provides fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language.
Pandas is well suited for data preprocessing tasks, such as feature scaling and transformation. It provides a variety of functions and tools for these tasks, making them easy to perform. For example, the DataFrame.scale()
function can be used to scale features, and the DataFrame.transform()
function can be used to transform features.
Pandas is also well suited for working with large datasets. It provides a number of features that make it efficient for working with large data, such as the ability to read and write data from a variety of sources, and the ability to cache data in memory.
Overall, Pandas is a powerful and flexible Python library that is well suited for data preprocessing tasks. It provides a variety of functions and tools for these tasks, making them easy to perform. Additionally, it is well suited for working with large datasets.