Job Responsibilities:
1. Deeply understand the business, responsible for high availability governance of financial services, and continuously improve business SLA;
2. Through continuous comprehensive data operations (including availability indicators, historical accidents, resource utilization rates, etc.), identify system weaknesses and implement improvements to the project;
3. Continuously improve the monitoring system, enhance monitoring efficiency, and shorten the duration of fault location;
4. Accumulate best practices in operation and maintenance, provide guidance for business architecture design and component selection, and output operation and maintenance technical documents.
Job requirements:
1. More than 5 years of operation and maintenance or development experience in the Internet industry;
2. Proficient in Shell programming and proficient in 1-2 programming languages including Golang, Java, and Python;
3. Have good knowledge in networking, storage, security, and computer architecture;
4. Proficient in the working principles, deployment, and use of common middleware such as Nginx, LVS, Redis, Kafka, MySQL, etc;
5. Familiar with Jenkins, Gitlab, etc., with practical experience in developing and integrating CI/CD processes;
6. Familiarity with Docker/k8s container platform and related underlying technologies and principles is preferred;
7. Capable of responding and handling faults 24/7, with strong resilience, good service awareness, and team spirit;
8. Work meticulously, be good at thinking, and have strong abilities in data analysis and problem-solving.
Bonus points:
1. Candidates with experience in remote project assistance across regions are preferred;
2. Have relevant technical work experience in securities, futures companies, and blockchain;
3. Experience in developing complete automated operation and maintenance tools is preferred.
Job highlights:
1. High availability governance of the company's key fintech business lines;
2. Continuously promote high availability governance and improve business SLA through accident operation, quality operation, and risk operation;
3. Construction and refinement of automated operation and maintenance systems, continuously improving human efficiency.
Currently, there aren't any salaries for this role at Doo Group shared by other job seekers.
View more salaries from Doo Group โAchieve your dream job with our top-notch tools!
Resume Checker
Our free resume checker analyzes the job description and identifies important keywords and skills missing from your resume in just a minute!
AI InterviewPrep
Utilizing advanced AI, our tool generates tailored interview questions based on your industry, role, and experience. Practice and receive feedback on your answers in real time!
Resume Builder
Let us show you the differences between a bad, good, and great resume, and guide you in building a resume that helps you stand out to employers, ensuring you land your next position faster!