Job Description
Summary
The Data Solutions/Transpose team is an exciting new member of the Chainalysis family, working to deliver a cutting edge blockchain intelligence platform used by the world’s foremost intelligence, law enforcement, tax, and regulatory agencies, alongside many market leaders in the private sector. Our technology is used to automate and simplify solutions to complex investigations, provide real-time threat actor monitoring, give insight into the topology of threat landscapes, and much more.
As a Senior Data Scientist, you’ll lead the design and curation of all of the data assets that are provided to our customers within our product to execute on these mission areas. You will be responsible for interacting directly with our customers, understanding unmet mission needs, and translating this into the development and improvement of data assets that leverage the data produced by dozens of other teams across Chainalysis, as well as datasets that we purchase from external providers.
You’ll help us to raise the bar on data quality, by working closely with Data Engineers to implement reliability, trustworthy pipelines and automated data quality checks. In aggregate, the work that you lead will provide our customers with the ability to derive powerful and unique insights that are used to drive global investigatory efforts, trans-national threat actor monitoring, national security enhancements, and much more.
In this role, you’ll:
- Work directly with our customers, their account teams, and our product management function within Chainalysis, to deeply understand customer focuses and mission areas. With this in mind, you’ll then work with other Data Scientists to identify opportunities to further assist our customers by creating or improving existing data assets.
- Work with our Data Engineers to implement pipelines that create these data assets in a scalable, trustworthy, and accurate way.
- Create and maintain methodologies to ensure that data assets we create are maintainable, observable, and do not develop regressions over time.
- Work with other teams across Chainalysis to ensure that we are tracking the right ‘building block’ datasets to build these data assets on top of.
- Ensure that our landscape of data assets is standardized, self-explanatory, well-documented, easily usable, and the linkage between customer mission area and relevant data assets is clear.
- Conduct data analysis and customer/stakeholder conversations to understand the success of customers with our data, and our product.
- Experiment with new methodologies to derive powerful conclusions from the hundreds of terabytes worth of datasets we already have accessible in our platforms.
- Manage the lifecycle of data assets (customer-facing launches, changes, deprecations, etc.)
- Have a significant impact on the landscape of national security, international law enforcement, and promoting safe adoption of cryptocurrency in the mainstream.
We’re looking for candidates who have:
- Strong development experience in Python
- Excellent SQL Skills
- Knowledge of SQL database technologies
Nice to have experience:
- Experience designing and launching customer-facing data assets
- Experience working with blockchain/crypto data
- Experience with Databricks
- Experience with PySpark/Spark SQL/DLT
- Experience with Apache Flink
Technologies we use:
- Python
- SQL (PostgreSQL, Databricks SQL Warehouses)
- Databricks (Spark SQL, DLT, PySpark)
- Kafka
- Google Cloud Platform
Skills
- Cryptocurrency
- Database Management
- Development
- Python
- SQL
- Team Collaboration