Job Description
Summary
As an On-Chain Protocols Data Engineer, you’ll work with our data team to expand our understanding of the on-chain protocol ecosystem at large. You will be responsible for services that constantly monitor all of the world’s most popular blockchains and watch for any new smart contract activity that may occur. When this activity happens, our pipelines index, extract, standardize, and organize all of the information retrieved from on-chain across dozens of categories, including DEXes, Bridges, Staking and Lending protocols, and many more. Together, these pipelines form the basis of the world’s most comprehensive blockchain intelligence layer.
In this role, you’ll:
- Work as part of our On-Chain Protocols team to develop and maintain data pipelines responsible for indexing, categorizing, standardizing, and organizing all smart contract events that occur on-chain.
- Collaborate to define the roadmap for new chain and new protocol onboarding efforts into these systems.
- Design the schema and structure of datasets for use by our customers.
- Lead and contribute to efforts to improve the scalability, reliability, and efficiency of these data indexing systems (new protocols, new chains, etc.).
- Explore new opportunities for integrating emerging technologies (artificial intelligence, machine learning, large language models) into the on-chain protocol indexing modality.
- Work with our Data Engineering team to optimize the core database layer (schemas, indexes, etc.) to improve dataset usability.
We’re looking for candidates who have:
- Strong development experience in Python
- Excellent SQL Skills
- Knowledge of OLTP and OLAP database technologies
- A deep understanding of on-chain protocols/smart contracts
- What is an RPC call?
- What is a topic0?
- What is a log/trace?
- A passion for the web3/crypto & DeFi ecosystem
Nice to have experience:
- Experience deploying workloads in Kubernetes
- Experience with developing ETL pipelines
- Experience with Databricks
- Experience with PostgreSQL
- Experience reading & writing Solidity
- Experience integrating large language models into software systems
- Experience with TheGraph (or writing subgraphs in general)
Technologies we use:
- Python
- SQL (PostgreSQL, Databricks)
- Kubernetes
- Databricks
- Etherscan/blockchain explorers
- Redis
Skills
- Database Management
- Development
- Python
- Software Engineering
- SQL