ISE, SIML - Data Engineer

Cupertino, CA 95014
  • Job Code
    200283520
Summary

Summary

Posted: Aug 31, 2021

Role Number:200283520

Do you think Computer Vision, NLP and Machine Learning can change the world? Do you think it can transform the way millions of peo...Summary

Summary

Posted: Aug 31, 2021

Role Number:200283520

Do you think Computer Vision, NLP and Machine Learning can change the world? Do you think it can transform the way millions of people capture, discover and share the most special moments of their lives? We truly believe it can! The System Intelligence and Machine Learning (SIML) group is responsible for crafting machine learning solutions to extract high level structure information from images, videos and text shipping on all Apple platforms (macOS, iOS, tvOS, watchOS). Examples include face recognition, scene classification, OCR, handwriting recognition as well as the support for internal tools. We, the SIML Data Team, are responsible for crafting and building high quality datasets at scale. At the heart of machine learning, data defines how Apple features and products operate and what is the final user experience that will impact millions of our customers. This is an exciting time to join us: grow fast, and have an impact on multiple key features on your first day at Apple!

Key Qualifications

  • Proficient in Python or another modern programming language
  • Solid understanding of algorithms, data structures and coding standards
  • Excellent written and verbal communication skills
  • Self-starter, works well with ambiguity and identify the right people and tools to get the job done
  • Curious and eager to learn new technologies and knowledge
  • Passion for natural language processing and internationalization
  • Able to work in fast-paced, high uncertainty environments with loosely defined project needs
  • Aware of the challenges with biased datasets. Able to lead the curation of fair and inclusive datasets
  • Comfortable building and implementing inference pipelines that process large quantities of data (Hadoop, distributed GPU computation), using existing models and/or creating new ones

Description

Our team works in close interaction with R&D, infrastructure and client teams, as well as with other groups and other functions across Apple (legal, privacy) and externally. This position focuses on designing and implementing smart data pipelines based on advanced computer vision technology, NLP and humans in the loop. You will be responsible for the design and development of the data pipelines, automation, visualization and tools that constitute the end-to-end process for building models, from raw data to trained model to evaluation to deployment. You'll partner with data infrastructure and data producing teams to ensure that we have high quality, representative data. You'll work with ML engineers to refine the modeling process to enable faster iteration and better modeling decisions and deploy models more rapidly to customers. You'll collaborate with data scientists and analysts to build insights from customer analytics and feedback into the process to complete the cycle of continuous improvement. Your work will impact hundreds of millions of Apple's customers and help people communicate more easily in the languages and modalities of their choice.

Education & Experience

Bachelors, Masters, or Ph.D. in Computer Science, Mathematics, Physics, or a related field (or equivalent practical experience). Industry experience working with distributed data technologies for building efficient & large-scale data pipelines

Additional Requirements

  • * Strong knowledge of either NLP or Computer Vision is a plus
  • * Familiarity with AWS and Google Cloud Platforms is a plus but not required
  • * Strong knowledge of Scala, Java or other JVM languages is a plus
  • * Solr, Kafka, Hadoop, Spark, Kubernetes and Docker experience is a plus
  • * Experience with Jenkins, Chef, Terraform, pulumi is a plus
  • * Experience with distributed computation and large databases is a plus


Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

ISE, SIML - Data Engineer

Apple, Inc.
Cupertino, CA 95014

Join us to start saving your Favorite Jobs!

Sign In Create Account