DevOps

Site Reliability Operator

Full-time employment, full remote or on-site at Central Jakarta (Menteng), your choice. You will be responsible for maintaining services once they are alive by measuring and monitoring their performance and availability metrics and overall system health.

Responsibilities

  • Automate key deployment, monitoring, and verification tools
  • Maintain services once they are alive by measuring and monitoring their performance and availability metrics and overall system health
  • Work collaboratively with the engineering team to support new features, releases, and services
  • Develop automation tools to streamline all aspects of automatic deployment

Requirements

  • Bachelor's degree (or above) in Computer Science, Statistics, Mathematics, or a similar quantitative field
  • 5+ years of in-depth experience using provisioning tools (Ansible, Chef, Puppet, etc)
  • 2+ years of exposure to orchestration systems (DC/OS, Docker Swarm, Kubernetes, etc)
  • Expertise in Python or any other scripting environments

Deliverables

  • Consistent and predictable SLA for all services
  • Root cause analysis for every operation incident
Apply