Pandas is a Python library that makes it easy to sort and analyze data that is in row/column format. You can think of it like Excel for Python. A storyteller might use Pandas to filter data for a data visualization, such as the interactive visualization in The G Word project.
While Python has built-in data structures like lists and arrays to work with data, Pandas makes common tasks easier. For example, imagine a table where every row has a name. In Pandas, it is very easy to filter the table by names starting with “A” or count how many times each name appears in the table. In addition to data manipulation, Pandas also allows for straightforward data analysis. It is also possible to use Pandas to visualize some kinds of data.
In short, Pandas is an open-source software library written for the Python programming language for data manipulation and analysis. It relies on Numpy and Matplotlib, two of the main libraries in the Python ecosystem. Pandas is comparable to Scikit-Learn, but some consider Pandas more accessible.