SRE Platform Engineer

mPokket logo

mPokket

View Salaries, Reviews, and more  

Job Description

The Dev-SRE Engineer is a hybrid technologist who blends software development and reliability engineering to ensure production systems are scalable, resilient, secure, and observable. This role goes beyond infrastructure operations; it involves writing code, building platforms, automating workflows, and ensuring end-to-end reliability from source to production. The engineer will develop backend and frontend components for internal tools, reliability dashboards, and automation portals while owning SRE KPIs such as SLOs, MTTR, and Error Budgets.

Responsibilities

  • Define, measure, and enforce SLOs, SLIs, and SLAs for all critical services.
  • Manage error budgets and reduce change failure rate through reliability reviews.
  • Improve MTTD and MTTR using proactive detection, tracing, and auto-remediation.
  • Contribute code fixes and architectural improvements to reduce fragility.
  • Build internal developer platforms, self-service portals, and automation tools.
  • Develop and maintain backend services (Python, Go, Java, Node.js ).
  • Design frontend dashboards (React, Angular, or Vue) for enhanced observability and control.
  • Implement resilience patterns (circuit breakers, retries, back-pressure, idempotency).
  • Automate load, soak, and stress testing in CI/CD environments.
  • Monitor latency (p95/p99) and throughput metrics; optimise code and infra performance.
  • Collaborate with dev teams to fix code-level performance bottlenecks and inefficiencies.
  • Ensure complete telemetry coverage (metrics, logs, traces).
  • Develop and maintain Golden Signal dashboards (L-T-E-S).
  • Reduce alert noise to < 20% with actionable and intelligent alerting.
  • Lead incident response, triage, and RCA with blameless postmortems.
  • Ensure detection and resolution SLAs are met and automate repetitive triage tasks.
  • Track RCA completion, recurrence, and use AI-driven enrichment where applicable.
  • Collaborate with AppSec and Engineering to remediate VA/PT findings (SAST, DAST).
  • Write and review secure code fixes in backend and frontend services.
  • Automate secrets management, IAM validation, and dependency scanning.
  • Develop event-driven automation and self-healing systems.
  • Implement AI-based anomaly detection and auto-remediation workflows.
  • Create reusable runbooks and automate reliability validation.
  • Drive compliance with SOC2 PCI DSS, ISO27001 and FinTech security requirements.
  • Partner with Dev, QA, and Security for reliability and scalability improvements.
  • Mentor developers on observability and secure coding practices.

  • Requirements

    • Bachelor's or Master's degree in Computer Science, Information Technology, or an Engineering discipline from a reputed institution.
    • 4-7 years of professional experience in Software Development, Site Reliability Engineering, or Platform Engineering.
    • Proven track record working in high-scale, high-availability production systems in Bangalore or similar Tier-1 tech environments.
    • Strong coding experience in backend and frontend frameworks such as Spring Boot, FastAPI, Node.js, React, Angular, or Vue.
    • Experience building and maintaining developer platforms, CI/CD pipelines, and observability dashboards.
    • Familiarity with Agile methodologies, Git workflows, and Kubernetes (EKS/GKE) environments.
    • Strong problem-solving, debugging, and analytical skills with the ability to handle 24x7 production operations.
    • Exposure to FinTech or BFSI environments with strict compliance (SOC2 ISO27001 PMLA, PCI DSS).
    • Excellent communication and documentation skills for cross-functional collaboration.
    • Candidates currently based in or willing to relocate to Bangalore are preferred.

    Technical Skills

    • Tools and Frameworks: Programming (Backend), Java, Go, Python, Node.js, Spring Boot, FastAPI, Express.js, Frontend, React, Angular.
    • Cloud and Containers: AWS, Kubernetes (EKS), Docker.
    • IaC and CI/CD: Terraform, CloudFormation, CDK, Jenkins, GitLab CI, Observability, Datadog, Prometheus, Grafana, Coralogix, ELK, Security, Snyk, OWASP ZAP, SonarQube, Checkov.
    • Messaging and DB: Kafka (MSK), RabbitMQ, Redis, RDS, PostgreSQL, DynamoDB.
    • Automation and Scripting: Python, Go, Bash.

    This job was posted by Rishika Datta from mPokket.

    Interview Questions of SRE Platform Engineer at mPokket

    Currently, there aren't any interview questions for this role at mPokket shared by other job seekers.
    View more interview questions of similar roles from other companies โ†’
    banner icon
    Prepare For Your Interview in 1 Week?
    Equip yourself with possible questions that interviewers might ask you, based on your work experience and job description.
    Get Started!

    Salary Insights of SRE Platform Engineer at mPokket

    Currently, there aren't any salaries for this role at mPokket shared by other job seekers.

    View more salaries from mPokket โ†’

    Achieve your dream job with our top-notch tools!

    Resume Checker Illustration

    Resume Checker

    Our free resume checker analyzes the job description and identifies important keywords and skills missing from your resume in just a minute!

    Check Now
    Interview Preparation Illustration

    AI InterviewPrep

    Utilizing advanced AI, our tool generates tailored interview questions based on your industry, role, and experience. Practice and receive feedback on your answers in real time!

    Check Now
    Resume Builder Illustration

    Resume Builder

    Let us show you the differences between a bad, good, and great resume, and guide you in building a resume that helps you stand out to employers, ensuring you land your next position faster!

    Check Now