icrunchdata Network
Camden, NJ, USA
Position: Spark Developer Location: Camden, NJ Duration: 6 months Job Description Passion in Data Engineering, Programming Technologies like Scala or Java. Strong Object-oriented programming and design skills, preferably in Java or Scala programming with Spark Framework Excellent analytical and problem-solving skills, oral and written communication skills. Experience building scalable, reliable, distributed Unix-based systems with Big Data processing technologies (MapReduce, Hadoop, HBase, Cassandra, other NO SQL solutions). Experience designing and implementing data ingestion and transformation for big data platforms. (Spark, Kafka, S3 Parquet, HDFS, Athena etc.). Proven track record designing highly parallelized data ingestion and transformation jobs in Spark including Spark Streaming. Production experience working with EMR clusters. Strong relational Database (AWS RDS, Postgres, MySQL) skills. Lead team of developers take care of all data ingestion request from various source system. Experience working with Elastic Search. GitHub Code repository skill is mandatory. Good SQL programming skills. Python programming skill is a plus. Knowledge on AWS services and CI/CD is a plus. - provided by Dice
Position: Spark Developer Location: Camden, NJ Duration: 6 months Job Description Passion in Data Engineering, Programming Technologies like Scala or Java. Strong Object-oriented programming and design skills, preferably in Java or Scala programming with Spark Framework Excellent analytical and problem-solving skills, oral and written communication skills. Experience building scalable, reliable, distributed Unix-based systems with Big Data processing technologies (MapReduce, Hadoop, HBase, Cassandra, other NO SQL solutions). Experience designing and implementing data ingestion and transformation for big data platforms. (Spark, Kafka, S3 Parquet, HDFS, Athena etc.). Proven track record designing highly parallelized data ingestion and transformation jobs in Spark including Spark Streaming. Production experience working with EMR clusters. Strong relational Database (AWS RDS, Postgres, MySQL) skills. Lead team of developers take care of all data ingestion request from various source system. Experience working with Elastic Search. GitHub Code repository skill is mandatory. Good SQL programming skills. Python programming skill is a plus. Knowledge on AWS services and CI/CD is a plus. - provided by Dice
icrunchdata Network
Camden, NJ, USA
American Water is purpose-driven , people-powered , customer-obsessed and the trusted source of everything water . Our Technology & Innovation teams work with cutting-edge technologies to create adaptive solutions that have an impact on the environment and people s lives. If you re interested in working as a member of a fast-paced team that loves to innovate and do amazing things, with the purpose of making the world a better place by inventing the future of the water industry, then enroll in our vision today Apply now or visit American Water Careers to review all of the amazing opportunities we have to offer. Primary Role Helping to develop efficient unstructured data extraction and self-learning NLP applications and/or deep learning Work closely with Software Engineers and internal teams to discover, invent, and build at the largest scale. Ideas may come from internal projects as well as from collaborations with peer researchers. From creating experiments and prototyping implementations to designing new learning algorithms and deploying them Participate in cutting edge research in NLP and/or deep learning applications Develop and deploy solutions for real world, large scale problems Key Accountabilities Exploring and developing new Machine Learning models and techniques Introduces creative approaches to research topics and generates new approaches, perspectives and solutions to research topics Planning and designing research projects: specifying the problem and defining the project scope Realizing solutions through prototypes Exploring new data sources and discovering techniques for best leveraging data Collecting and performing data analysis to validate and further new theories and discoveries Working closely with product engineers to design, develop and incorporate AI solutions into new products Thinking strategically about research directions Knowledge/Skills Hands-on experience in popular NLP libraries (e.g. NLTK, Stanford CoreNLP, spaCy, Gensim) Experience with machine learning/deep learning frameworks/packages (e.g. Keras, Tensorflow, PyTorch) Solid background in statistical learning techniques for NLP (e.g. Nave Bayes, HMMs, CRFs, LDA, LSI, MRFs) Knowledge/experience with state-of-the-art methods in NLP (e.g. word/paragraph embedding, LSTM, attention) Experience/Education 2 years of full-time industry experience in NLP and/or deep learning, and machine learning, from concept to production 2 years experience in Python or any object-oriented programming languages. Experience in big data (e.g. Hive, Spark) and cloud computing platforms (e.g. AWS) is a plus Experience in one or more of the following areas in NLP: entity/relation extraction, question answering (QA) system, information extraction, summarization, semantics, document classification, ontology, chatbot, and/or in deep learning: Hands-on experiences in deep-learning models (e.g. CNN, RNN, LSTM, GANS, transformers) Experience working in Agile development process and deep understanding of various phases of the data-science project development life cycle Travel Requirements As necessary, up to 30%. Competencies Champions safety Collaborates Cultivates innovation Customer obsessed Drives Results Nimble learning Join American Water We Keep Life Flowing American Water is firmly committed to Equal Employment Opportunity (EEO) and prohibits employment discrimination for employees and applicants based on his or her age, race, color, pregnancy, gender, gender identity, sexual orientation, national origin, religion, marital status, citizenship, or because he or she is an individual with a disability, protected veteran or other status protected by federal, state, and local laws. - provided by Dice
American Water is purpose-driven , people-powered , customer-obsessed and the trusted source of everything water . Our Technology & Innovation teams work with cutting-edge technologies to create adaptive solutions that have an impact on the environment and people s lives. If you re interested in working as a member of a fast-paced team that loves to innovate and do amazing things, with the purpose of making the world a better place by inventing the future of the water industry, then enroll in our vision today Apply now or visit American Water Careers to review all of the amazing opportunities we have to offer. Primary Role Helping to develop efficient unstructured data extraction and self-learning NLP applications and/or deep learning Work closely with Software Engineers and internal teams to discover, invent, and build at the largest scale. Ideas may come from internal projects as well as from collaborations with peer researchers. From creating experiments and prototyping implementations to designing new learning algorithms and deploying them Participate in cutting edge research in NLP and/or deep learning applications Develop and deploy solutions for real world, large scale problems Key Accountabilities Exploring and developing new Machine Learning models and techniques Introduces creative approaches to research topics and generates new approaches, perspectives and solutions to research topics Planning and designing research projects: specifying the problem and defining the project scope Realizing solutions through prototypes Exploring new data sources and discovering techniques for best leveraging data Collecting and performing data analysis to validate and further new theories and discoveries Working closely with product engineers to design, develop and incorporate AI solutions into new products Thinking strategically about research directions Knowledge/Skills Hands-on experience in popular NLP libraries (e.g. NLTK, Stanford CoreNLP, spaCy, Gensim) Experience with machine learning/deep learning frameworks/packages (e.g. Keras, Tensorflow, PyTorch) Solid background in statistical learning techniques for NLP (e.g. Nave Bayes, HMMs, CRFs, LDA, LSI, MRFs) Knowledge/experience with state-of-the-art methods in NLP (e.g. word/paragraph embedding, LSTM, attention) Experience/Education 2 years of full-time industry experience in NLP and/or deep learning, and machine learning, from concept to production 2 years experience in Python or any object-oriented programming languages. Experience in big data (e.g. Hive, Spark) and cloud computing platforms (e.g. AWS) is a plus Experience in one or more of the following areas in NLP: entity/relation extraction, question answering (QA) system, information extraction, summarization, semantics, document classification, ontology, chatbot, and/or in deep learning: Hands-on experiences in deep-learning models (e.g. CNN, RNN, LSTM, GANS, transformers) Experience working in Agile development process and deep understanding of various phases of the data-science project development life cycle Travel Requirements As necessary, up to 30%. Competencies Champions safety Collaborates Cultivates innovation Customer obsessed Drives Results Nimble learning Join American Water We Keep Life Flowing American Water is firmly committed to Equal Employment Opportunity (EEO) and prohibits employment discrimination for employees and applicants based on his or her age, race, color, pregnancy, gender, gender identity, sexual orientation, national origin, religion, marital status, citizenship, or because he or she is an individual with a disability, protected veteran or other status protected by federal, state, and local laws. - provided by Dice