Enable job alerts via email!

Senior Data Engineer (Generative AI, Agentic AI)

Stealth AI Startup

San Francisco (CA)

On-site

USD 200,000 - 300,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A rapidly growing AI startup in San Francisco seeks a Senior Data Engineer passionate about Generative AI and cybersecurity. This role involves developing scalable data infrastructures that empower teams, ensuring data quality, and driving innovation in cybersecurity solutions with competitive compensation and a collaborative culture.

Benefits

Equity options
Generous wellness stipends
Access to latest AI/ML tools

Qualifications

  • Experience in developing ETL/ELT data pipelines.
  • Strong knowledge of data governance and storage solutions.
  • Proficiency in real-time data streaming.

Responsibilities

  • Design, build, and maintain ETL/ELT pipelines.
  • Set up and manage scalable data storage solutions.
  • Ensure data quality and security through governance practices.

Skills

ETL/ELT processes
Data governance
Scalable data storage
Real-time data streaming
Collaboration

Education

Bachelor's degree in Computer Science, Data Science or related field

Tools

Snowflake
Databricks
Apache Kafka
Azure Data Lake

Job description

Senior Data Engineer (Generative AI, Agentic AI)
Senior Data Engineer (Generative AI, Agentic AI)

This range is provided by Stealth AI Startup. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$200,000.00/yr - $300,000.00/yr

Additional compensation types

Annual Bonus and Stock options

Are you passionate about Gen AI and eager to make a significant impact in the cybersecurity space? Join us at our cutting-edge AI startup in San Francisco Bay Area, where we are assembling a world-class team to tackle some of the most pressing challenges in cybersecurity.

Why Join Us?

  • $25M Seed Funding: We are well-funded, with $25 million raised in our seed round, providing the resources to innovate and scale rapidly.
  • Proven Early Success with Fortune 500 Customers: We have started partnering with Fortune 500 companies, marking the early success and growing trust in our innovative solutions. This highlights the immense potential and reliability of our AI-powered cybersecurity offerings.
  • Experienced Leadership: Our founding team consists of second and third-time entrepreneurs, each with over 25 years of experience in the cybersecurity industry. They are the owners of successful cybersecurity companies, with previous ventures achieving valuations of over $3 billion. Their proven expertise and vision drive our ambitious goals, positioning us to lead in the AI-powered cybersecurity space.
  • World-Class Leadership Team: Our Heads of AI, Engineering, and Product bring extensive experience from some of the world’s most influential companies, ensuring top-tier mentorship, direction, and vision.
  • Cutting-Edge AI Solutions: Our team leverages the most advanced AI technologies, including Large Language Models (LLMs) and Generative AI.
  • Generous Compensation: We offer highly competitive salaries, equity options, and a supportive work environment. Your contributions will be valued and rewarded as we grow together.
  • Cybersecurity Knowledge Preferred but Not Required: While experience in cybersecurity is a plus, we are primarily seeking top-tier talent in AI/ML and data science who are passionate about solving complex problems.

About the Role

We are hiring a Senior Data Engineer to develop, scale, and maintain the essential data infrastructure supporting our AI-driven cybersecurity platform. The Senior Data Engineer ensures reliable and integrated data flows, robust storage systems, and comprehensive pipeline solutions, empowering our Applied Scientists and ML Engineers with timely, high-quality data.

What You’ll Do

  • Design, build, and maintain scalable ETL/ELT pipelines, facilitating efficient data ingestion, transformation, and delivery of diverse datasets critical for AI-driven cybersecurity analysis and modeling.
  • Set up and manage scalable data storage solutions, including data lakes, data warehouses, and databases (Snowflake, Redshift, Databricks, Azure Data Lake), optimized for large-scale analytics and AI workloads.
  • Implement rigorous data transformation, cleaning, normalization, and embedding processes (text, code, image embeddings using Unsloth, Databricks, and Google GenAI) to ensure datasets are prepared effectively for analysis and AI model training.
  • Optimize data storage and processing workflows for performance, scalability, and efficiency, employing indexing, partitioning, and distributed computing strategies.
  • Integrate diverse internal and external data sources, including APIs, databases, event-driven streams (Kafka), and third-party cybersecurity data services, ensuring consistent, high-quality data delivery.
  • Closely collaborate with Applied Scientists, ML Engineers, and cybersecurity stakeholders to meet complex and evolving data requirements essential for model development, analytics, and experimentation.
  • Ensure data quality, integrity, and security via robust data governance practices, leveraging orchestration and monitoring tools such as Apache Airflow for anomaly detection and pipeline management.
  • Develop and maintain real-time data streaming pipelines using Apache Kafka, Spark Streaming, and Flink to enable responsive and dynamic cybersecurity threat detection.
  • Manage relational and NoSQL databases to maintain performance, scalability, and secure data retrieval essential for AI-driven decision-making processes.
  • Create and support infrastructure for embedding vector databases (Milvus, Pinecone), crucial for retrieval, search operations, and supporting ML workflows.

Our Culture and Team

  • Collaborative Environment: You’ll join a dynamic, fast-paced startup where innovation thrives, and every team member's voice is valued.
  • World-Class Leadership: Our Heads of AI, Engineering, and Product bring extensive experience from some of the world’s best and most influential companies, ensuring top-tier mentorship and strategic direction.
  • Growth Opportunities: We support your professional development through mentorship, access to industry conferences, and opportunities to work on cutting-edge AI projects that make a global impact.
  • Diversity and Inclusion: We are committed to building a diverse and inclusive team that brings a variety of perspectives to solving today’s cybersecurity challenges.

Work Location

Our office is located in Silicon Valley Center in North San Jose, CA, providing a collaborative environment where innovation thrives.

Perks and Benefits

  • Equity options, ensuring you have a stake in the company’s success.
  • Generous wellness stipends and professional development budgets.
  • Access to the latest tools and technologies for AI/ML development.

Ready to join us on this groundbreaking journey? Apply today to become part of our mission to revolutionize cybersecurity with AI!

#AI #MachineLearning #DataScience #Cybersecurity #Startup #Hiring #SanFranciscoBayArea #Innovation #TechJobs

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
  • Industries
    Technology, Information and Internet

Referrals increase your chances of interviewing at Stealth AI Startup by 2x

Sign in to set job alerts for “Senior Data Engineer” roles.
Software Engineer, AI Intern (Summer 2025)

San Francisco, CA $145,000.00-$250,000.00 2 days ago

San Francisco, CA $130,000.00-$238,000.00 2 weeks ago

Software Engineer, AI Platform - New Grad

Mountain View, CA $125,400.00-$188,100.00 2 weeks ago

Software Engineer Frontend (Multiple Levels) - Slack

San Francisco, CA $172,000.00-$334,600.00 1 day ago

San Francisco, CA $170,000.00-$270,000.00 2 days ago

San Francisco, CA $130,000.00-$190,000.00 2 weeks ago

(General Hire) Software Engineer Graduate (Advertisement Team) - 2025 Start (BS/MS)

San Jose, CA $113,500.00-$250,000.00 2 weeks ago

San Francisco, CA $150,000.00-$230,000.00 1 month ago

Menlo Park, CA $56.25-$173,000.00 2 weeks ago

Software Engineer, Frontend (All Levels)

San Francisco, CA $150,000.00-$220,000.00 2 weeks ago

San Francisco, CA $125,000.00-$175,000.00 4 days ago

San Francisco, CA $105,600.00-$198,000.00 2 weeks ago

San Francisco, CA $130,000.00-$240,000.00 3 days ago

Full Stack Software Engineer (L4), Product Localization Engineering
Full Stack Software Engineer - Post-training

San Francisco, CA $35.00-$40.00 11 hours ago

Alameda, CA $130,000.00-$160,000.00 3 days ago

New Grads 2025 - Software Engineer, Algorithm

San Jose, CA $120,000.00-$165,000.00 8 months ago

San Francisco, CA $140,000.00-$195,000.00 3 days ago

San Francisco, CA $176,000.00-$250,000.00 3 days ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Machine Learning Engineer (Generative AI, Agentic AI)

Stealth AI Startup

San Francisco

On-site

USD 200,000 - 300,000

4 days ago
Be an early applicant

Senior Machine Learning Engineer

Adobe

San Jose

On-site

USD 142,000 - 258,000

4 days ago
Be an early applicant

Sr Principal AI Researcher/Machine Learning Engineer (Cortex)

Palo Alto Networks

Santa Clara

On-site

USD 170,000 - 277,000

30+ days ago

Principal Data Scientist

Genentech

South San Francisco

On-site

USD 207,000 - 386,000

30+ days ago