Principal Data Engineer

  • Amgen
  • Tampa, FL, USA
  • Dec 11, 2020
Full-time Data Architecture Data Engineering Data Governance Data Integration Data Modeling ETL Programming Software Engineering Spark SQL

Job Description

What you will do

Let’s do this. Let’s change the world. In this vital role you will work closely with product owner/business analyst and delivery managers to understand the complexity of regional requirements. This role will be part of the newly established technical/engineering team, lead design and develop of data flow pipelines to extract, transform, and load data from various data sources in various data format to enterprise data lake and data warehouse system in three regions in AWS.


  • Be able to do hands on work, lead data engineer team, mentor junior data engineers
  • Collaborate with Business SME’s, and Data Scientists to architect data products and services
  • Provide architectural and data model oversight for processes which perform data extraction, transformation, work flow management and data quality check
  • Drive the exploration and adoption of new tools, technique, propose improvements to the data pipelines and system architecture
  • Provide governance on data integration cross different organizations in enterprise data lake
  • Act as a product manager for the data platform backlog
  • Participate in sprint planning meetings and provide estimations on technical implementation
  • Travel – Approximately 10% of work time

What we expect of you

We are all different, yet we all use our unique contributions to serve patients. The hard working professional we seek is a Data Engineer with experience building scalable, automated solutions in a fast-paced environment, with these qualifications.

Basic Qualifications

Doctorate degree and 2 years of Information Systems experience


Master’s degree and 6 years of Information Systems experience


Bachelor’s degree and 8 years of Information Systems experience


Associate degree and 10 years of Information Systems experience


High school diploma / GED and 12 years of Information Systems experience

Preferred Qualifications

  • BS or MS in Computer Science or Engineering related fields
  • Experience architecting and building ETL pipelines; Hands-on experience with SQL, preferred Oracle, PostgreSQL, and Hive SQL
  • Demonstrable experience with data modeling for both OLAP and OLTP databases, performance tuning for relational database, NoSQL datastore, and columnar database;
  • Experience working with Apache Spark, Apache Airflow
  • 8+ years of experience with one or more general purpose programming languages, Java, Python, Scala, C, C++, C#, or JavaScript
  • Experience with Software engineering best-practices, including but not limited to version control (Git, Subversion, etc.), CI/CD (Jenkins, Maven etc.), automated unit testing, Dev Ops
  • Experience with AWS cloud services: EC2, S3, EMR, RDS, Redshift/Spectrum, Lambda, Glue, Athena, API gateway
  • Full stack development using cloud services (AWS preferred) and cloud-native tools and design patterns (Containers, Serverless, Docker, etc.)
  • Hands on development experience with Databricks
  • Hands on development experience with Informatica MDM product
  • Experience working with and leading agile development methodologies such as Sprint and Scrum
  • Biotech / Pharma experience

What you can expect of us

As we work to develop treatments that take care of others, so we work to care for our teammates’ professional and personal growth and well-being.

Vast opportunities to learn and move up and across our global organization

Diverse and inclusive community of belonging, where teammates are empowered to bring ideas to the table and act

Generous Total Rewards Plan comprising health, finance and wealth, work/life balance, and career benefits

Job ID