- Montréal 2020
A typical workflow of a Data Scientist involves some level of exploratory data analysis. If you’re using Python when working with your data, you are probably quite familiar with packages like pandas, matplotlib, and others. Switching from pandas to Spark - how do you explore your data? How do you visualize it? In this talk, I’ll take a dataset and will explore it with Spark using IntelliJ IDEA and Apache Zeppelin.
Voir les 156 présentations
Maria is a Developer Advocate at JetBrains where she focuses on data science, data engineering, and machine learning. Before joining the advocacy team, she has has been a part of such projects as IntelliJ IDEA, TeamCity, Upsource. Maria understands the challenges faced by newcomers to data science and wants to share her knowledge to help others overcome them. She is one of the organizers and co-founders of PyData Montreal, and she has been a speaker at a number of industry events.