This role will be responsible for designing, building, running, and monitoring public cloud infrastructure to support a variety of mission critical services. This is a highly technical, hands-on role that requires expertise supporting systems at enterprise scale. The candidate will deliver innovative solutions in key areas:
• Engineering - continuously optimize secure, scalable and performant security tools and services
• Reliability - drive fault detection and correction, performance and uptime at global scale
• Monitoring - instrument systems to gain visibility and understanding of how they are performing at any time
• Automation and orchestration to enable -
• Accelerated infrastructure, application and software configuration deployment
• Automated response to alerts or indicators of performance issues
• Infrastructure as code
Key Qualifications
5+ years of experience in Software Engineering and Systems Engineering to manage operations
Experience supporting infrastructure and services in public and private cloud environments (AWS, GCP, AliCloud, OpenStack etc.)
Proficient with various programming languages such as Python/Java/Ruby/Perl/Go/Makefile for building automation or integration with APIs
Experience with common formats such as JSON, YAML and compression utilities
Expertise with monitoring or log aggregation tools (Prometheus, Grafana, Splunk, ELK, etc.)
Solid understanding and experience with centralized configuration management, coordination and provisioning technologies, such as Terraform, Ansible, Chef etc.
Excellent communication skills, must be capable of working with cross functional technical and business teams and varying levels of management
Understanding of Agile methodologies like Scrum and be able to work in fast-paced environment
Must be capable of writing detailed solution specifications, diagrams, best practices/standards documentation, operating procedures, test plans/test reports, etc.
Solid understanding Linux/Unix system internals, including kernel tuning
Failure Testing and Chaos Engineering
Working knowledge of network protocols and network based services, including routing and network load balancing
Experience with virtualization technologies
Solid understanding of storage systems, including network filesystems
Experience building and supporting containerized applications on various platforms like GKE, EKS, ECS.
Experience with Kubernetes
Description
• Engineer and support cloud platform IaaS and PaaS services
• Partner with application teams to provision scalable workloads reliably across distributed compute resources
• Provide engineering and operational support for distributed systems and network based information security tools, including for configuration management and provisioning
• Implement and maintain security controls
• Work closely with development teams to understand application performance and behavior patterns to proactively monitor, tune and correct issues before they occur
• Identify opportunities to improve security tooling reliability, performance and security
• Develop tools and automation to eliminate manual and repetitive efforts
Additional Requirements
Apple is an Equal Opportunity Employer that is committed to inclusion and diversity. We also take affirmative action to offer employment and advancement opportunities to all applicants, including minorities, women, protected veterans, and individuals with disabilities. Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation or that of other applicants.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
Role Number: 200383775