Site Reliability Engineer (SRE/DevOps)

Icon salary Salary
Negotiable
Icon Location Location
Ho Chi Minh

Job Overview And Responsibility

OBJECTIVES We are searching for a talented and motivated Site Reliability Engineer (SRE) to join our growing TechOps department. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our critical infrastructure. You will collaborate with developers and operations teams to automate IT processes, troubleshoot incidents, and implement best practices for infrastructure management. WHAT YOU WILL DO Collaborate with development teams to enhance our CI/CD pipeline for increased productivity. Build and optimize CI/CD for operation and deployment as automatically as possible. Implement and maintain Infrastructure-as-Code (IaC) tools like Terraform for automated infrastructure provisioning and configuration. Develop, automate, and implement monitoring and alerting systems to proactively identify and troubleshoot infrastructure issues. Cloud platform operations and optimize cloud spending to ensure efficient resource utilization. Implement and maintain robust security controls across the cloud infrastructure. Leverage automation tools and techniques to streamline infrastructure management tasks. Support the entire application lifecycle, from concept to design, test, release, documentation. Stay up-to-date with the latest trends and technologies in site reliability engineering. Document infrastructure processes and procedures.

Required Skills and Experience

QUALIFICATIONS Minimum 2 years of experience in a similar SRE/DevOps role. Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience) Experience with Linux/Unix operating systems. Experience with scripting languages (Python, Bash, etc.). Strong understanding of CI/CD workflows with experience in Gitlab and ArgoCD. Proficiency in building and optimizing Gitlab pipelines. Experience with container and container orchestration technology such as EKS, ECS, or GKE, and the ability to write helm charts. Familiarity with automation tools and scripting languages (Python, Shell scripting, Ansible, Terraform). Working knowledge of observability and monitoring tools (Prometheus, Grafana, HoneyComb, DataDog, PagerDuty). Excellent communication and collaboration skills. Ability to work independently and as part of a team. A passion for learning and staying up-to-date with the latest technologies.

Why Candidate should apply this position

Flexible working time Learning and leisure budget Annual Company Bonus Team-building budget Lunch and parking allowance Enjoyment of music activities Employee Assistance Program 13th month salary

Preferred skills and experiences

Deploying, operating, maintaining, and upgrading GitLab and ArgoCD infrastructure. Experience with cloud platforms (AWS, GCP) is a plus. Having cloud certifications is a plus (AWS or GCP Associate Level or higher).

Jenny Ho

Headhunter | Recruiter
Verified
employee 473 candidates
cup 112 interviews
health 27 offers

Apply for this job

Successfully!

Thank you, you have sent the information successfully.

← View more Jenny Ho's jobs
upload Click or drag file to this area to upload PDF only (3MB), You can update only 1 CV

Jenny Ho

Headhunter | Recruiter
Verified
Icon employee 473 candidates
Icon cup 112 interviews
Icon health 27 offers

Completed jobs (27)