Allocation and Capacity planning engineer

Cupertino, CA 95014
  • Job Code
    200214284
Summary

Summary

Posted: Feb 10, 2021

Weekly Hours: 40

Role Number:200214284

Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products very quickly....Summary

Summary

Posted: Feb 10, 2021

Weekly Hours: 40

Role Number:200214284

Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products very quickly. Bring passion and dedication to your job, and there's no telling what we can accomplish together. We're looking for a hardworking and passionate person to join this amazing team, and if you feel this is you, we'd love to hear from you! Do you love crafting elegant solutions to highly complex challenges? Can you intrinsically see the importance of every detail?

As cloud administration engineers we are responsible for all aspects of capacity allocation and management for Apple's largest internal batch cloud system. This infrastructure runs millions of jobs per day from thousands of users, providing critical compute for the development of nearly every product at Apple. Your responsibilities will include defining and implementing policies for how capacity is allocated to teams and users, reviewing and vetting capacity requests and changes, creating tools and services to register capacity requirements, leading capacity planning efforts by providing data on infrastructure usage, and developing control systems to automate the configuration of the job scheduler. We are a highly dynamic team, the responsibilities can evolve day-to-day to meet the every-changing needs and demands of the business.

Key Qualifications

  • Experience administrating compute capacity of a cloud infrastructure or supercomputing center (alternatively, experience developing tools to facilitate that task).
  • Experience with batch cloud or HPC infrastructures and their workloads.
  • Understanding of batch job scheduling technologies: QoS, fairshare, prioritization, and preemption.
  • Familiarity with modern software development processes and best practices.
  • Experience developing and managing a large-scale production system.
  • Proficiency with performing data analysis and creating tools in at least one of the following programming languages: Golang, Python.
  • Fluency in Linux and shell

Description

Review and implement allocation requests for a cloud batch infrastructure. Define capacity allocation policies and QoS semantics. Design, implement, and debug tools and services to support the capacity management process: customer requirements registry, capacity allocation definition, and scheduler configuration automation. We support infrastructure planning efforts with tools to provide analysis on workload patterns and usage data.

Education & Experience

Bachelor's Degree in Computer Science or related field, or equivalent related experience. Advanced Degree preferred.

Additional Requirements

  • EXPERIENCE IN THE BELOW IS A PLUS:
  • - Workload analysis and modeling
  • - Supporting internal customers of a cloud infrastructure or HPC systems.
  • - HPC scheduling systems: Slurm, LSF, Moab, Torque, Flux
  • - Fluency in SQL, Golang, and Python


Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Allocation and Capacity planning engineer

Apple, Inc.
Cupertino, CA 95014

Join us to start saving your Favorite Jobs!

Sign In Create Account