- Day 3, 11:15‑12:00
- English talk
- Python Libraries
Practical Data Transformation and Analysis with Pandas
In most large companies, data transformation and analysis are done with SQL. However, most SQL environments don't possess a rich ecosystem like Python and using SQL for complex data aggregation complicates the SQL code and make it not maintainable. By exploiting the power of Pandas and Python, data transformation and analysis can become simple and wonderful jobs.
This talk mainly focuses on performing data transformation and analysis with Pandas. It will start by introducing basic pandas components and how to work with text data. After that, this talk will focuses on how to use split-apply-combine strategy to transform and aggregate data. The final part of this talk will be demo and Q&A.
Working with various big data solutions in the company and fighting with the MS environment in the company