Site Reliability Engineer with Nautilus

Zurich, Switzerland

CHFΒ 105K - 135K a year

Us: Do you have an eye for detail and enjoy solving complex problems in a team environment? We are currently looking to add a Site Reliability Engineer to our Dev Ops team. The right individual will help drive observability efforts for our connected fitness application - JRNY. This will primarily involve being responsible for application infrastructure monitoring and alerting to ensure that build, deployment and production infrastructure is highly available and reliable. You will also work as part of the incident response team helping investigate and resolve issues. The right individual will have an eye towards automation and process improvement.

πŸ‘πŸ½ Startup spirit prevails: agile, fast, uncomplicated, family-like. And surprisingly large amounts of beer for a fitness company. πŸ‘πŸ½ No funding problems – full focus on product & innovation. πŸ‘πŸ½ Office hosts a fully equipped gym 1min from Zurich main station. 🀌🏽 Startup spirit prevails: tasks are challenging and expectations are high.

You: Tech-driven fitness enthusiast or techie with a heart for sports. Looking for a job that is more than just work, for peers that are more than just co-workers. Problem-solver. Your mind works quick like Usain Bolt and you refuse to give in like a marathon runner. Agile. You challenge the status quo, always striving for better and best. Great skills as a DevOps engineer and the intent to keep learning every day. You don't take yourself too seriously and you get sarcasm. Up for every challenge. You speak up and don't have a problem with transparent and direct communication. No fear of responsibility, but desire to take ownership. Self-confidence and can-do-attitude.

Must:

β€’ Deep understanding of cloud architecture and services (AWS and/or Microsoft Azure) β€’ Understanding of CI/CD principles and practices especially Docker-based pipelines (e.g. CodeFresh or Jenkins). β€’ Strong understanding and familiarity with application alerting and monitoring: New Relic, Grafana, Datadog, etc β€’ Understanding of infrastructure as code and experience with Terraform and/or Cloudformation β€’ Understanding of orchestration technologies - Kubernetes β€’ Ability to collaborate with other engineers on code reviews, internal infrastructure improvements and process enhancements

Nice to have:

β€’ Scripting skills a plus (Java, Node.js, Python, Go) β€’ Experience with security scanning and compliance tools a plus

Responsibilities:

β€’ Manage and improve availability monitoring for JRNY microservices, build/deployment pipelines and a range of other AWS services. β€’ Work with tools such as New Relic, DataDog, Cloudwatch, Loggly and Firebase. β€’ Analyze a range of system performance metrics to drive proactive remediation of production impacting incidents and events. β€’ Help improve current SRE practices through root cause analysis and incident post mortems. β€’ Work with platform teams to help design and implement solutions that improve system reliability and performance.