Job Description

Summary

As a core member of our infrastructure team, you will build and maintain major features, through inception, design, implementation and launch, working closely with product and engineering disciplines across the company. You will spend the majority of your time on cross-functional self-contained feature teams focused on delivering value to the customer, while other projects will be more internally focused on integrations, scalability, and performance.

Responsibilities 

  1. Own the site reliability process and systems from design and implementation to deployment and maintenance
  2. Educate the platform software engineering team on reliability best practices and collaborate to evolve the software engineering process to accommodate reliability principles
  3. Provide service outage escalation response alongside software engineers
  4. Manage multiple Kubernetes clusters across multiple environments and regions
  5. Manage and build core services and infrastructure across the entire engineering organization
  6. Help build an adaptable, high-velocity team
  7. Participate in on-call rotations to assist in resolving production incidents

Things that we believe are critical

  1. Expertise in site reliability engineering in a multi-datacenter production cloud environment with demanding up-time, real-time performance, and security requirements
  2. Experience adopting and employing open-source, home-grown, and commercial technology products as appropriate in support of the Infra Engineering mission
  3. Strong familiarity with AWS and Kubernetes
  4. Background in Software Engineering
  5. Experience with leading teams and projects
  6. Comfort working with senior management to allocate and prioritize engineering energy in support of the Infra Engineering mission in a real-world resource-constrained environment

Extra Credit

  1. Experience with cloud infrastructure and networking in a production context
  2. Experience building and/or using low-latency cross-region databases or high-volume trading applications
  3. Experience with HashiCorp tools (Vault, and Terraform)
  4. Experience with Kafka, Redis, and Postgres
  5. Experience with cloud providers beyond AWS (Azure, GCP, etc.)
  6. Expertise in cloud network security

 

Skills
  • AWS
  • Leadership
  • Software Engineering
  • Team Collaboration
© 2024 cryptojobs.com. All right reserved.