Senior Data Engineer

  • The Walt Disney Company
  • Burbank, CA, USA
  • Aug 15, 2019
Full-time AWS Data Engineering Data Warehousing Hadoop Java Kafka Linux Scala Spark Unix

Job Description

Movies Anywhere is the next generation of in-home entertainment, providing an unparalleled digital entertainment experience. Leveraging cutting edge technology, unique partnerships, and a talented team, Movies Anywhere is an exclusive, cross-platform, cloud-based movie service that enables consumers to seamlessly discover, grow, access, and enjoy their personal digital movie collection across a variety of studios, retailers, and platforms all in one convenient app and/or website.

Movies Anywhere seeks a Staff Data Engineer to join a team of seasoned, dedicated technologists solving a range of interesting problems in innovative ways in an exciting and dynamic industry. We are looking for a self-starting engineer who wants to shape the next generation of video consumption applications. We’re a casual shop that values passion, community involvement and code that stands out. If you are interested, we’d love to hear from you.

The Data Engineer will work in a small team of multi-disciplined technologists developing insights to drive the Business, Marketing and Finance decisions for our next-generation video delivery and consumption platform. We expect you to be up to date on the happening in the data community, passionate about what you do, and connected to the open source community. You will participate in overall system design, developing multi-tiered data solutions emphasizing reuse and good design patterns.

Responsibilities :

  • Design, build and implement Hadoop/Spark batch jobs
  • Build and optimize performance of Spark, Kafka, ELK, and whatever else makes sense for real-time pipelines
  • Design and architect high quality data-lake, data-warehouse, and data-marts data models
  • Enable and implement Data Science workflows and advanced machine learning algorithms
  • Build and optimize performance of ElasticSearch cluster and relevance
  • Build and maintain data pipelines orchestration
  • Develop and cultivate expertise in current and new technologies and tools
  • Collaborate with other software engineers and cross-functional teams
  • Share new ideas with a larger community of highly experienced technologists
  • Ability to prioritize tasks, requirements, and complexity
  • Mentor junior data engineers on best practices

Basic Qualifications :

  • Real passion for coding (If you have a Github profile, that’s awesome! We would love to check it out!)
  • Understanding of distributed systems and distributed computation
  • Working knowledge in at least 2 of: Scala, Java, Python, or Go-Lang
  • Working knowledge of data Apache Spark ecosystem technologies like Spark, Kafka, Hive, Presto, Oozie, Pig, Hue, Zeppelin
  • Demonstrated working knowledge of data modeling, data-warehouse, data-mart and data-lake
  • Unit, Integration, and Load testing
  • Developing REST APIs
  • Git
  • Maven, SBT, and/or Gradle
  • Unix/Linux
  • Docker containers building and deployment
  • Working experience of AWS
  • Excellent communication and collaboration skills

Preferred Qualifications:

  • Knowledge of Machine Learning Frameworks (MLib,Tensorflow, etc)
  • GraphQL Knowledge
  • Kubernetes knowledge
  • Apache Spark GraphX
  • R
  • GitLab CI/CD
  • Akka Streams

Required Education :

  • BS in Computer Science or related field with 7+ years of experience


Job ID