Enable job alerts via email!

Lead Data Engineer

Mysten Labs

United States

Remote

USD 120,000 - 160,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Start fresh or import an existing resume

Job summary

Mysten Labs is seeking a Staff Data Engineer to design and execute advanced data systems for blockchain analytics. The role involves building scalable data ingestion frameworks and requires expertise in SQL and Python along with a strong background in data governance and orchestration.

Qualifications

  • 5+ years experience in data engineering.
  • Experience with SQL and Python required.
  • Strong opinions on data governance and orchestration.

Responsibilities

  • Design and implement scalable ingestion pipelines.
  • Optimize data warehousing with clear schemas.
  • Enable data discoverability and governance.

Skills

SQL
Python
Data Governance
Data Orchestrators

Job description

Mysten Labs believes that decentralized and open protocols are the bedrock of the internet of value. This is why at Mysten Labs, we are creating foundational infrastructure to accelerate the adoption of decentralized protocols based on blockchain technologies.

The Data team is looking to hire a Staff Data Engineer to design and execute on the next iteration of our data systems. This is an exciting opportunity for someone who wants to design web3 analytics systems that run at web2 scale, reliability, and robustness. Interested candidates will have the opportunity to help define the new standards for blockchain analytics on top of the world’s premier object-centric blockchain.

This role requires a combination of high capacity hands-on individual contributor work and planning data architecture. As a Staff Data Engineer you will partner with the heads of Data and Engineering to effectively and smoothly process high-volume data in a reliable and robust way. You will get to touch all parts of the business at Mysten Labs and become the expert on data structures on the Sui blockchain.

Responsibilities:

  • Design and implement scalable ingestion pipelines with a reusable and modular framework

    • Build robust, reusable frameworks to ingest data from internal sources (e.g., Prod DBs, cloud buckets, etc) and external APIs or files (e.g., CSVs, webhooks).

    • Ensure idempotency, backfill support, and error handling in pipeline design.

  • Optimize and own data warehousing, with clear table definitions, schemas, cost efficiency

    • Architect a centralized data lake/warehouse with clear schemas and partitioning strategies.

    • Support both batch and streaming workloads, and optimize for cost and performance.

  • Enable data discoverability, usability, and governance

    • Implement or integrate data cataloging and lineage tools

    • Define naming conventions, documentation standards, and ownership metadata to make data self-serve and intuitive for data scientists, analysts, and product / GTM teams.

    • Set up a mechanism for scalable access controls (with RBAC or ABAC), PII tagging, and data obfuscation.

    • Enable approaches for data quality checks, validation pipelines, and alerting for broken or stale data

  • Develop a strong understanding of how to use on-chain and off-chain data together

Required Qualifications:

  • 5+ years experience in data engineering

  • Strong SQL and Python

  • Strong and informed opinions on data orchestrators, catalogs, governance, and testing frameworks

  • Experience combining in-house and external data

Preferred Qualifications:

  • Experience with, or interest in learning, Rust

  • Prior blockchain and cryptocurrency experience

  • Experience designing and implementing streaming data solutions

  • Experience or interest in security analytics and data for security teams

Employment is contingent upon the successful completion of a background check, which may include verification of employment history, education credentials, criminal history, and other relevant information.

Regarding the recent rash of technology job scams: Be aware that emails from genuine Mysten Labs group recruiters will always come from the @mystenlabs.com domain or related subdomains (e.g., mystenlabs.com/careers). Remember: you can always verify positions on our job boards at www.mystenlabs.com/careers.

Our team is remote first and we are hiring across the world. Here at Mysten Labs, you’ll be joining a world-class team with tremendous growth potential as we bring the next billion users to web3. We raised a $300M Series B round from top Silicon Valley led venture funds like Jump Crypto, Andreessen Horowitz (a16z), Binance Labs, Redpoint, Lightspeed, Coinbase Ventures, Electric Capital, Standard Crypto, NFX, Slow Ventures, Scribble Ventures, Samsung Next, Lux Capital, among other investment firms and strategic partners. Come join us and build the future of web3!

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.