Job Description
Summary
The role
As a core member of our infrastructure team, you will build and maintain major features, through inception, design, implementation and launch, working closely with product and engineering disciplines across the company. You will spend the majority of your time on cross-functional self-contained feature teams focused on delivering value to the customer, while other projects will be more internally focused on integrations, scalability, and performance.
Responsibilities
- Own the site reliability process and systems from design and implementation to deployment and maintenance
- Educate the platform software engineering team on reliability best practices and collaborate to evolve the software engineering process to accommodate reliability principles
- Provide service outage escalation response alongside software engineers
- Manage multiple Kubernetes clusters across multiple environments and regions
- Manage and build core services and infrastructure across the entire engineering organization
- Help build an adaptable, high-velocity team
- Participate in on-call rotations to assist in resolving production incidents
Things that we believe are critical
- Expertise in Security and DevSecOps
- Strong compliance knowledge - SOC 2
- Expertise in site reliability engineering in a multi-datacenter production cloud environment with demanding up-time, real-time performance, and security requirements
- Experience adopting and employing open-source, home-grown, and commercial technology products as appropriate in support of the Infra Engineering mission
- Strong familiarity with AWS and Kubernetes
- Experience with leading teams and projects
- Comfort working with senior management to allocate and prioritize engineering energy in support of the Infra Engineering mission in a real-world resource-constrained environment
Extra Credit
- Experience with cloud infrastructure and networking in a production context
- Experience building and/or using low-latency cross-region databases or high-volume trading applications
- Experience with HashiCorp tools (Vault, and Terraform)
- Experience with Kafka, Redis, and Postgres
- Experience with cloud providers beyond AWS (Azure, GCP, etc.)
- Expertise in cloud network security
Remote (US or EU preferred)
This position is flexible on-site or hybrid position at our headquarters office in Singapore. We are able to provide financial and logistical support for work Visa procurement and relocation for Singapore, if applicable.
Skills
- AWS
- Leadership
- Networking
- Team Collaboration