Senior Data Scientist / Software Engineer

  • Tesla
  • Palo Alto, CA, USA
  • Sep 16, 2020
Full-time Data Modeling Data Science Data Visualization DevOps Machine Learning MATLAB Python R Software Engineering Statistics

Job Description

Data is deeply embedded in the product and engineering culture at Tesla. We rely on data – lots of it – to improve autopilot, to optimize hardware designs, to proactively detect faults, and to optimize load on the electrical grid. We collect data from each of our cars, superchargers, and energy products and use it to make these products better and our customers safer.

We're the Fleet Analytics team, a small but fast-growing central team that helps many teams leverage the data we collect. We help engineers through direct support by doing data analysis for them and through applications and tools so they can self-serve those analyses in the future. To do so, we leverage the internal big data platform that is built on top of Kafka, Spark, Presto and data science tools such as Jupyter notebooks, Pandas, Bokeh, Superset and Airflow.

We're looking for an experienced engineer to join us. This foundational member will provide leadership in the definition and implementation of processes and tools that enable Tesla's data science.

You will lead the end to end machine learning pipeline for our Telematics Insurance products, where customers pay a premium based on their driving behavior. You will be designing, building and operating the data pipelines to support these products, from building features to training the model, to adapting the model to different markets, and making sure that the results are available on time. By supporting these new insurance products, you will directly further our mission by making it more affordable to own a Tesla, as well as giving our customers an incentive to drive safer.

Responsibilities

  • Design and build your machine learning pipeline end to end
  • Be creative in identifying with new driving behavior metrics that can help predict insurance losses
  • Work across engineering teams to understand how to find the data you need
  • Build efficient and reproducible data pipelines consuming petabytes of time series data using cutting-edge open source technologies
  • Build a statistical model to predict insurance losses, and anything you need to iterate over your model (feature selection, hyper parameter tuning, validation, etc)
  • Evaluate, justify and communicate model performance
  • Schedule and operate your data pipeline
  • Present your results to Tesla's leadership, including CFO and CEO
  • Write clean and tested code that can be maintained and extended by other software engineers
  • Keep up to date on relevant technologies and frameworks, and propose new ones that the team could leverage
  • Identify trends, invent new ways of looking at data, and get creative in order to drive improvements in both existing and future products
  • Give talks, contribute to open source projects, and advance data science on a global scale

Requirements

  • Strong proficiency in Python
  • Strong foundation in statistics
  • Strong foundation in software engineering
  • Experience building multiple statistical models that provided company value
  • Experience with data science tools such as Pandas, Numpy, R, Matlab, Octave
  • Strong verbal and written communication skills
  • Strong problem-solving skills to come up with good solutions to problems you are the first to tackle
  • Smart but humble, with a bias for action

Nice to have

  • Experience in the insurance industry
  • Experience with getting statistical models approved by regulators
  • Experience building data pipelines
  • Experience building web applications
  • Experience building data visualizations
  • Experience with continuous integration and continuous development
  • Experience in DevOps, i.e. Linux, Ansible, Docker, Kubernetes
  • Understanding of distributed computing, i.e. how HDFS, Spark and Presto work
  • Proficient in Scala

Job ID

67603