Job Description
Summary
What will you be working on?
- The individual will be responsible for monitoring our exchange service status, refining deployment pipeline, monitoring, troubleshooting and identifying the root cause of issues.
- Focus on ensuring the stability of the service and helping dev-team to deploy features to production.
- Responsible for handling and solving issues when service goes wrong.
- Responsible for designing a better deployment pipeline.
- Build tools to monitor systems and identify issues.
Who will you be working with?
- Work with development and test engineers.
- What challenges will you face?
- Knowledge of blockchain/crypto service or expertise in the online blockchain/crypto industry is a BIG plus.
- Enjoys breaking things and solving problems - not just able to find out the 'what', but also the 'why'.
- Excellent troubleshooting, listening and problem-solving skills with the ability to set project expectations and meet deadlines.
- Ability to work in a fast-paced, multi-task environment.
What tech stacks/skills will you be using?
- Must to have! - Terraform
- Nice to have! - Telegram bot
- Nice to have! - Security related experience
- Nice to have! - Develop tools to simplify DevOps work
- Familiar with GCP, AWS or other cloud services.
- Have experience in CI|CD Workflow.
- Familiar with Docker, kubernetes and have experience in using Kubernetes to manage production-grade cluster.
- Familiar with mysql, mq and redis.
- Familiar with monitoring system prometheus, be able to customize grafana dashboard.
- Familiar with logging system ELK or EFK.
- Coding ability: python or golang.
- 24/7 on-call when there are urgent issues to handle and must be handled in a timely + responsible manner.
Skills
- AWS
- Development
- Python
- Software Engineering