Cloud SRE

Agilité logo

Agilité

View Salaries, Reviews, and more  

Job Summary


Job Type
-

Seniority
Mid

Years of Experience
Information not provided

Tech Stacks
Python Prometheus Grafana Azure ELK CI Google Cloud GitLab CI play Jenkins Docker AWS GitLab

Job Description

Position: Cloud Site Reliability Engineer (SRE)

Summary:

As a Cloud Site Reliability Engineer (SRE), you will play a key role in ensuring the reliability, scalability, and performance of our cloud-based systems and services. Working closely with cross-functional teams, you will proactively monitor, optimize, and troubleshoot cloud infrastructure, applications, and services to minimize downtime and deliver a seamless user experience. Your expertise in cloud technologies and dedication to automation and best practices will contribute to the stability and growth of our cloud operations.


Key Responsibilities:

  1. Reliability and Availability:
  • Implement best practices for high availability and disaster recovery across cloud environments.
  • Monitor system performance, availability, and incident response to ensure minimal downtime.
  • Create and maintain robust monitoring and alerting systems.
  1. Automation and Infrastructure as Code (IaC):
  • Develop and maintain automation scripts and Infrastructure as Code (IaC) templates for provisioning and managing cloud resources.
  • Automate routine tasks to increase operational efficiency and reduce manual interventions.
  1. Scalability and Performance Optimization:
  • Collaborate with development teams to design and implement scalable and performant cloud architectures.
  • Conduct performance analysis and tuning to optimize system response times and resource utilization.
  1. Incident Response and Troubleshooting:
  • Participate in incident response activities, including root cause analysis, resolution, and post-incident reviews.
  • Troubleshoot complex issues across the cloud stack and coordinate with relevant teams for resolution.
  1. Security and Compliance:
  • Implement security best practices and compliance measures in cloud environments.
  • Collaborate with security teams to ensure data protection and compliance with industry standards.
  1. Capacity Planning:
  • Monitor resource utilization and forecast capacity requirements to support business growth.
  • Implement scaling strategies to accommodate changing workloads.
  1. Documentation and Knowledge Sharing:
  • Maintain comprehensive documentation of cloud configurations, processes, and procedures.
  • Share knowledge and best practices with team members and contribute to a culture of continuous learning.


Qualifications/Requirements:

Basic Qualifications:

  • Bachelor's Degree in Computer Science, Information Technology, or a related field.
  • 4+ years of experience in cloud operations, SRE, or a related role.
  • Proficiency in cloud platforms such as AWS, Azure, or Google Cloud.
  • AWS Certification is Mandatory – AWS Certified Solution Architect.


Desired Characteristics:

  • Certification in cloud platforms (e.g., AWS Certified Solution Architect, Google Cloud Professional DevOps Engineer, Azure DevOps Engineer Expert).
  • Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
  • Knowledge of infrastructure monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Strong scripting and programming skills (e.g., Python, Bash, Go).
  • Familiarity with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI/CD).
  • Excellent problem-solving and communication skills.
  • Ability to work collaboratively in a cross-functional and fast-paced environment.


Conclusion:

As a Cloud Site Reliability Engineer, you will be at the forefront of ensuring our cloud infrastructure's reliability, scalability, and security. Your technical expertise, automation skills, and commitment to best practices will contribute to the success of our cloud operations, enabling us to deliver high-performance and highly available services to our customers. Join us in this critical role and be part of a dynamic team dedicated to excellence and innovation.



Interview Questions of Cloud SRE at Agilité

Currently, there aren't any interview questions for this role at Agilité shared by other job seekers.
View more interview questions of similar roles from other companies →
banner icon
Prepare For Your Interview in 1 Week?
Equip yourself with possible questions that interviewers might ask you, based on your work experience and job description.
Get Started!

Salary Insights of Cloud SRE at Agilité

Currently, there aren't any salaries for this role at Agilité shared by other job seekers.

View more salaries from Agilité →

Achieve your dream job with our top-notch tools!

Resume Checker Illustration

Resume Checker

Our free resume checker analyzes the job description and identifies important keywords and skills missing from your resume in just a minute!

Check Now
Interview Preparation Illustration

AI InterviewPrep

Utilizing advanced AI, our tool generates tailored interview questions based on your industry, role, and experience. Practice and receive feedback on your answers in real time!

Check Now
Resume Builder Illustration

Resume Builder

Let us show you the differences between a bad, good, and great resume, and guide you in building a resume that helps you stand out to employers, ensuring you land your next position faster!

Check Now