Site Reliability Engineer (SRE) with SimSpace

Boston, United States

$100K - 160K a year

Do you want to help shape the future of training cyber security teams against malicious cyber criminals and foreign actors? Would you like your work to have a meaningful impact in an area as important as cyber security? SimSpace is looking for Site Reliability Engineers to ensure our products and services for cyber security testing, training, assessments, and tool development are always available for our users.

We are looking for Site Reliability Engineers (SREs) to deliver our products and services with high availability and scalability out of our data centers and on-prem solutions for our growing set of worldwide customers. As we expand our customer deployments, we are currently seeking experienced SREs to deliver insights from operating reliable, large scale systems and who bring fresh ideas, demonstrate a unique and informed viewpoint, and enjoy collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction. Our work integrates with many teams across the company and requires highly technical, innovative, flexible thinkers with excellent communication and passion for delivering the best experience to help people get jobs.

SimSpace is seriously disrupting the status quo for Cyber Security Ranges that help companies understand their risk and improve through practice and experimentation. We are a fast moving company looking for the right kind of talent and determination to join our team. We have a fun, but effective company culture and we want you to fit in. We’re a start-up that already ships products to large enterprise customers and governments. There are many technical challenges and we need your talent to succeed. We’re still small and growing, so it’s a great time to join us.

You will:

  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Provide primary operational support and engineering for multiple large distributed software applications and content
  • Design, code, test, and deliver software to automate manual operational work
  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
  • Engage with development, DevOps and content teams team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
  • Design self-healing and resiliency patterns
  • Design automated software and product upgrades, change management, and release management solutions
  • Participate in the 24x7 support coverage as needed

You are a good fit if you have:

  • Bachelor’s degree in computer science or other highly technical, scientific discipline
  • Experience writing code in Java, Go, Shell, Perl, Python, or a similar language
  • Experience with distributed storage technologies like NFS, S3 as well as dynamic resource management frameworks (Kubernetes, TKG)
  • Experience with VMWare and AWS
  • Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
  • Excellent debugging and troubleshooting skills
  • At least three years of experience in production datacenters
  • Experience working on enterprise products within Cloud, SaaS, and on-premises for enterprise companies
  • A working knowledge of Cyber Security or Information Security
  • Ability to clearly define success and ability to drive to get things done
  • Ability to describe complex topics in a clear concise manner
  • Ability to be a highly motivated self-starter that is accepting of other opinions and operates effectively in a team
  • U.S. citizenship as required by our existing U.S. Government contracts


  • Competitive salary and benefits (medical, dental, 401k)
  • Equity options in the company
  • Flexible hours provided you overlap the main part of the day to interact with others
  • Paid time-off

In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.