Senior Site Reliability Engineer - Retail Store Applications

Cupertino, CA 95014
  • Job Code
    200271256
Summary

Summary

Posted: Aug 11, 2021

Weekly Hours: 40

Role Number:200271256

We at Retail store apps infrastructure and operations team are responsible for the maintenance and high availab...Summary

Summary

Posted: Aug 11, 2021

Weekly Hours: 40

Role Number:200271256

We at Retail store apps infrastructure and operations team are responsible for the maintenance and high availability of business critical infrastructure and services. We are seeking a talented, self-driven individual to help co-ordinate the timely restoration of these services and drive incident management process across various cross-functional groups.
Are you a lifelong learner? Do you have a passion for building tools and automation? Come and join us in a fun-loving, highly motivated team and be a part of Apple's WW retail journey.

Key Qualifications

  • 5+ years of recent systems engineering, software engineering, site reliability, or dev-ops experience in a medium to large scale production Linux or other UNIX environment.
  • 5+ years of scripting and development skills in bash and one of Python, Ruby, Perl or JavaScript.
  • 5+ years experience with Linux, Apache, DNS, monitoring, load-balancing, and caching.
  • Proficiency in using Splunk, Kibana, distributing tracing systems like ZipKin, Jaeger etc
  • Should be proficient in Java - Oracle based app architecture.
  • Proficient in object-oriented programming and distributed systems
  • Experience in Kafka, Cassandra, Couchbase, Elastic search,Solr and other no sql technologies
  • Working Knowledge on Docker, Kubernetes and other container technologies is highly preferred
  • Bonus Skills
  • Knowledge of Configuration Management tools (e.g. Ansible, Puppet, Chef, SALT, Terraform, CloudFormation)
  • Experience on AWS

Description

At Retail store apps team, we build and manage large scale web and iOS applications that are used by Apple retail store employees world wide. We strive to provide operational excellence by ensuring the highest levels of performance and availability across retail store apps. As we expand our presence from Apple Data center to other cloud providers, we are looking for a self-driven and highly motivated Site reliability engineer to restore the critical services and provide highest quality of customer support Should have the ability to work in a fast-paced, mission critical environment
Build robust monitoring and alerting systems for our suite of apps
Review application design, identify SLO and SLA for micro services and build tools to measure these metrics and improve them
Design reporting system to measure the health and effectiveness of app features being rolled out in production
Manage pilot roll out of application to apple retail stores across the world and build systems to collect feedback on app roll out
Represent the Ops team on key engineering releases and features - ensure operational readiness and communicate deployment and mitigation planning to all stakeholders
On-call support, monitoring, and triaging as part of a shared rotation. Diagnose and mitigate critical failures in high pressure situations. Perform deep dives and root cause analysis as needed
Develop strong cross-functional relationships with business partners, Engineering, Quality engineering and Retail Store field team
Troubleshoot, research, analyze, and diagnose complicated technical issues by diving in to backend systems and logging

Education & Experience

Basic Qualifications - BS in Computer Science or equivalent;
Preferred Qualifications - MS in Computer Science or equivalent;

Additional Requirements

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Senior Site Reliability Engineer - Retail Store Applications

Apple, Inc.
Cupertino, CA 95014

Join us to start saving your Favorite Jobs!

Sign In Create Account