DevOps

Site Reliability Operator

Full-time employment, full remote or on-site at Central Jakarta (Menteng), your choice. You will be responsible for maintaining services once they are alive by measuring and monitoring their performance and availability metrics and overall system health.

Tanggung Jawab

  • Automate key deployment, monitoring, and verification tools
  • Maintain services once they are alive by measuring and monitoring their performance and availability metrics and overall system health
  • Work collaboratively with the engineering team to support new features, releases, and services
  • Develop automation tools to streamline all aspects of automatic deployment

Persyaratan

  • Bachelor's degree (or above) in Computer Science, Statistics, Mathematics, or a similar quantitative field
  • 5+ years of in-depth experience using provisioning tools (Ansible, Chef, Puppet, etc)
  • 2+ years of exposure to orchestration systems (DC/OS, Docker Swarm, Kubernetes, etc)
  • Expertise in Python or any other scripting environments

Hasil Kerja

  • Consistent and predictable SLA for all services
  • Root cause analysis for every operation incident
Lamar