February 26-28, 2020
Montreal, Canada

A closer look: Exploratory Data Analysis with Spark

A typical workflow of a Data Scientist involves some level of exploratory data analysis. If you’re using Python when working with your data, you are probably quite familiar with packages like pandas, matplotlib, and others. Switching from pandas to Spark - how do you explore your data? How do you visualize it? In this talk, I’ll take a dataset and will explore it with Spark using IntelliJ IDEA and Apache Zeppelin.

View all 156 sessions

Maria Khalusova


Maria is a Developer Advocate at JetBrains where she focuses on data science, data engineering, and machine learning. Before joining the advocacy team, she has has been a part of such projects as IntelliJ IDEA, TeamCity, Upsource. Maria understands the challenges faced by newcomers to data science and wants to share her knowledge to help others overcome them. She is one of the organizers and co-founders of PyData Montreal, and she has been a speaker at a number of industry events.

Read More

Montreal 2020 sponsored by