Lessons learned running data pipeline on AWS

Before doing any data science, machine learning or AI, you need to get your data right. As the volume of data grows, having a reliable, available and scalable data pipeline becomes a challenge.
In this talk we will share our learnings from running a data pipeline in AWS infrastructure using technologies like Apache Spark, gRPC, Protocol buffers.

Voir les 156 présentations

Majid Fatemian

Red Ventures

Majid Fatemian, is a Principal Software Engineer of data platforms at Red Ventures. He is passionate about scalability and reliability of distributed systems.

You can find him on twitter @majidfn .

Lessons learned running data pipeline on AWS

Majid Fatemian

Montréal 2019 sponsored by

Montréal 2019
sponsored by