Session: Advanced data wrangling with pandas

Target-audience:
Advanced

Pandas is nowadays the library of choice for manipulating and analysing structured data, providing high-performance, easy-to-use data structures and data analysis tools.

In this hands-on tutorial, using an air quality time series dataset, I will guide you through some of its powerful methods to answer questions from the data. How is the air quality evolving? What are the typical daily profiles? But above all, this tutorial will guide you through generally applicable and advanced data manipulation methods, including time series manipulation (resampling and rolling operations), groupby operations, reshaping with stack/unstack/pivot, …

Basic familiarity with pandas data structures (Series/DataFrame) is assumed.