Site Reliability Engineer with LiveRamp

Singapore

$96K - 192K a year

The SRE team is responsible for owning and supporting deployments of global products, and providing first line operational support. We are looking for a Site Reliability engineer who is excited about establishing and advocating for best practices for product deployments and SRE. You will be able to leverage your software engineering expertise to understand the needs of teams and guide them in improving their systems.

Responsibilities

  • Support and/or own the deployment of global products including setting up production and internal environments
  • Provide 24/7 first line of Engineering support (via follow the sun teams in all regions) for any issues related to global product deployment, availability and internal operations support.
  • Drive effective resolutions of core product issues with Engineering teams
  • Setup and maintain Infrastructure & Product Reliability monitoring and alerting
  • Maintain and enhance CI/CD Tooling and Terraform scripts in support of the mission in close collaboration with DevOps team
  • Maintain and enhance Engineering Operational Documentation for supported products.
  • Provide expertise to build and maintain products operational documentation and setting up product SRE practices
  • Support Security and Compliance governance support in production environments
  • Work in close collaboration with SRE team members and Engineering organizations based in California, Paris, Nantong, London, Australia and others.

Qualifications

  • BSc degree in Computer Science, Engineering or relevant field.
  • 3+ years experience in a SRE/DevOps, or equivalent role.
  • Experience in Infrastructure as code (IaC) using Terraform.
  • Experience in building continuous integration declarative pipelines in Jenkins or CircleCI.
  • Experience with platforms like Kubernetes, Containers and public clouds (GCP or AWS).
  • Experience with deployment and monitoring of highly scalable products.
  • Experience in Python or Go programming language.
  • Experience with SRE best practices, working knowledge of observability principles is a big plus.
  • Ability to diagnose technical problems, debug code, and automate routine tasks
  • Experience with securing systems in a public cloud environment.
  • Understands how to engage other engineers as stakeholders.
  • Enjoy working as part of a distributed team: smart, ethical, friendly, hard-working, and productive.