O'reilly Programming Podcast - O'reilly Media Podcast
Katharine Jarmul on using Python for data analysis
- Autor: Vários
- Narrador: Vários
- Editora: Podcast
- Duração: 0:26:17
- Mais informações
Informações:
Sinopse
The O’Reilly Programming Podcast: Wrangling data with Python’s libraries and packages.In this episode of the O’Reilly Programming Podcast, I talk with Katharine Jarmul, a Python developer and data analyst whose company, Kjamistan, provides consulting and training on topics surrounding machine learning, natural language processing, and data testing. Jarmul is the co-author (along with Jacqueline Kazil) of the O’Reilly book Data Wrangling with Python, and she has presented the live online training course Practical Data Cleaning with Python.Discussion points: How data wrangling enables you to take real-world data and “clean it, organize it, validate it, and put it in some format you can actually work with,” says Jarmul. Why Python has become a preferred language for use in data science: Jarmul cites the accessibility of the language and the emergence of packages such as NumPy, pandas, SciPy, and scikit-learn. Jarmul calls pandas “Excel on steroids” and say