Job Description
Summary
What You Will Do:
- Contribute to projects on new and existing cloud infrastructure and services (AWS) using IaC tooling and devops philosophy.
- Adopt new architectures, and (re)design the compute/network infrastructure to solve scale and cloud governance problems.
- Build tools, integrations and services to help boost the developer productivity and automate workflows.
- Implement robust observability practices by designing and maintaining monitoring, logging, tracing and alerting solutions.
- Apply SRE principles to drive reliability engineering initiatives, including defining and monitoring service level objectives (SLOs) and error budgets.
- Participate in production operations, on-call and cloud governance practices.
What You Should Have:
- 3+ years of experience with cloud infrastructure, platform and services (AWS).
- Solid programming skills in a high level language (Python ,Go) and shell scripting.
- Good understanding of devops principles, IaC, and unix fundamentals.
- Experience with containerised distributed compute platforms like Kubernetes.
- Experience with modern monitoring stacks and alerting as code.
- Experience with version control, release processes, GitOps, CI and CD.
Skills
- AWS
- Cloud Computing
- Development
- Python