Job Description

Summary

As a Senior Site Reliability Engineer at Immutable, you will have the autonomy to influence our SRE practices, team, and processes and to create a truly customer-driven SRE culture. As part of the SRE team, you will have a unique opportunity to shape the infrastructure, observability and tooling patterns used at Immutable and to share what the best practice SRE for blockchain technologies will be. The role is broadly scoped to have a cross-team impact in all the work we do.

You'll Be Empowered To 🎮:

  1. Develop and release Infrastructure as code
  2. Create and maintain multiple Kubernetes clusters supporting a variety of internal and external use cases
  3. Manage our AWS Cloud environment in collaboration with our Security team
  4. Define SLOs, SLIs, monitoring, alerting and incident response practices.
  5. Set the bar for observability excellence within the organisation
  6. Measure systems' health, scalability and performance metrics and identify areas of improvement.
  7. Ensure our incident management processes, automation, and remediation are world-class
  8. Work as a deep technical expert on the services we have ownership over (e.g., cleanup tech debt, maintenance, new architecture patterns)
  9. Mentor (where possible) other team members based on your experience.
  10. Lead by example to proactively foster an inclusive, diverse, and positive engineering culture across the business.
  11. Champion community building efforts and inclusion initiatives.
  12. Work in close partnership with the management team to ensure a healthy engineering org. 

We'd Love You To Bring 🤝:

  1. 5+ years or more experience in a similar role
  2. A deep understanding of SRE best practices and processes.
  3. Experience in infrastructure as code development / or backend
  4. Experience with one or more coding languages (golang, python)
  5. Problem-solving approach (listening and reasoning)
  6. System design and strategic thinking
  7. Experience with AWS (or infrastructure at scale)
  8. Experience with Observability systems (SaaS and self-hosted)
  9. Extensive incident management experience
  10. Communication clarity and accuracy
  11. Team spirit over Silos
  12. Excellent collaboration skills to be able to work closely with product engineers and product owners to understand their context and co-design appropriate solutions which balance feature velocity with site reliability

Skills
  • AWS
  • Communications Skills
  • Development
  • Software Engineering
  • Strategic Thinking
  • Team Collaboration
© 2025 cryptojobs.com. All right reserved.