Job Description

Summary

We are seeking a Site Reliability Engineer (SRE) to design, optimize, and support highly available systems across our global trading infrastructure. As part of GSR’s SRE team, you will manage a multi-regional cloud environment while integrating and automating our physical server inventory using Infrastructure as Code (IaC).

You will work across all layers of infrastructure, including:

  1. Networking & Exchange Connectivity
  2. Linux Systems & Kubernetes Administration
  3. Microservice Orchestration & Observability
  4. Disaster Recovery & Security Optimization

Your mission is to improve latency, scalability, and reliability, ensuring GSR remains a best-in-class market maker. We value engineers who drive automation, reduce friction, and enhance developer velocity through better tooling, CI/CD, and infrastructure design.

Who We’re Looking For

Core Skills

  1. Containers & Orchestration: Strong expertise in container security and Kubernetes (multi-cluster/global deployment is a plus).
  2. Distributed Systems & Messaging: Knowledge of clusters, storage, Kafka, Aeron, and experience with multicast or HPC.
  3. Automation & IaC: Proficiency in Python, Golang, or Rust with experience in IaC tools and immutable infrastructure.
  4. Continuous Delivery & Config Management: Familiarity with FluxCD, ArgoCD, and custom CD deployments. Strong grasp of CI/CD pipelines.
  5. Linux & Networking: Solid understanding of Linux internals, cgroups, routing, switching, firewalls, and DNS/service discovery.
  6. Databases: Experience with MySQL, MongoDB, and database administration (Flyway or Liquibase a plus).

Bonus Experience

  1. Data center operations
  2. Crypto, fintech,bare-metal provisioning or trading experience

Skills
  • Database Management
  • Development
  • Networking
  • Python
  • Rust
  • Software Engineering
  • SQL
© 2025 cryptojobs.com. All right reserved.