As a data engineer you will be working on our massive (PBs) data pipelines, making sure the data is clean, whole, and accessible. Your team's goal is to create amazing ground breaking tools to make the data scientists more productive and agile.
If you love working on complicated network pipelines, you understand the importance of reliable data and have felt the pain of big data inconsistencies,and you're the type who thinks of great solutions and want to bring them to life, come work with us.
5+ Years coding (preferably Python)
Strong SQL abilities
2+ Years experience with big data tools: Hadoop, Spark, Kafka, Presto, EMR etc.
Experience building and optimizing big data data pipelines; including - message queuing,stream processing, and highly scalable data sets
Experience performing root cause analysis on internal and external data and processes.
Strong organizational skills with the ability to juggle multiple tasks within constraints timelines