Enable job alerts via email!

Generative AI Data Engineer - Remote

The Hartford

Hartford (CT)

Remote

USD 125,000 - 189,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking insurance company as a Staff Data Engineer, where you will play a pivotal role in shaping the future of Generative AI. This innovative position involves designing and building robust infrastructure to support cutting-edge AI applications, collaborating with diverse teams, and developing data pipelines that drive innovation at scale. With a hybrid work schedule, you will have the flexibility to work both remotely and in the office, contributing to exciting projects that enhance the company's technological capabilities. If you're passionate about data engineering and eager to make a significant impact, this opportunity is perfect for you.

Benefits

Short-term bonuses
Long-term incentives
On-the-spot recognition

Qualifications

  • 4+ years of experience with AWS cloud and 8+ years in data-intensive solutions.
  • Strong programming skills in Python and Java with expertise in CI/CD pipelines.

Responsibilities

  • Design and build fault-tolerant infrastructure for Generative AI applications.
  • Collaborate with cross-functional teams to develop scalable solutions.

Skills

AWS Cloud
Python
Java
Problem-solving
Communication
Agile methodologies
Data engineering
Natural Language Processing

Education

Bachelor's degree in Computer Science
Bachelor's degree in Computer Engineering

Tools

Terraform
Cloud Formation
Jenkins
GitHub Actions
AWS OpenSearch
SageMaker
Docker

Job description

Staff Data Engineer - GE07CE

We’re determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals – and to help others accomplish theirs, too. Join our team as we help shape the future.

At the Hartford, we are seeking a GEN AI Data Engineer who is responsible for building fault-tolerant infrastructure to support the Generative AI applications, and also designing, developing, and deploying data pipelines to solve complex problems and drive innovation at scale.

We are founding a dedicated Generative AI platform engineering team to build our internal developer platform and are looking for an experienced Staff Data Platform Engineer - Generative AI, to help us build the foundation of our Generative AI capability. You will work on a wide range of initiatives, whether that’s building ETL pipeline, or training a retrieval re-ranker, or working with the DevSecOps team to build the CICD pipeline, or designing a Generative AI Infrastructure that conforms to our strict security standards/guardrails, or working with the data science team in their pursuit of improving the accuracy of the LLM models.

The Generative AI team is comprised of a multiple cross-functional group that works in unison and ensures a sound move from our research activities to scalable solutions. You will collaborate closely with our cloud, security, infrastructure, enterprise architecture and data science team to conceive and execute essential functionalities.

This role has a Hybrid work schedule. Candidates who live near one of our office locations (Hartford, CT, Charlotte, NC, Chicago, IL, Columbus, OH) will have the expectation of working in an office 3 days a week (Tuesday through Thursday).

Candidates must be eligible to work in the US without sponsorship now or in the future.

Responsibilities:

  • Design and build fault-tolerant infrastructure to support the Generative AI Ref architecture (RAG, Summarization, Agent etc).
  • Ensure code is delivered without vulnerabilities by enforcing engineering practices, code scanning, etc.
  • Build and maintain IAC (terraform/Cloud Formation), CICD (Jenkins) scripts, CodePipeline, uDeploy, & GitHub Actions.
  • Partner with our shared service teams like Architecture, Cloud, Security, etc to design and implement platform solutions.
  • Collaborate with the DS team to develop a self-service internal developer Generative AI platform.
  • Design and build the Data ingestion pipeline for Finetuning LLM Models.
  • Create templates (Architecture As Code) implementing Ref architecture application’s topology.
  • Build a feedback system using HITL for Supervised finetuning.

Qualifications:

  • Bachelor's degree in Computer Science, Computer Engineering, or a technical field.
  • 4+ years of experience with AWS cloud.
  • At least 8 years of experience designing and building data-intensive solutions using distributed computing.
  • 8+ years building and shipping software and/or platform infrastructure solutions for enterprises.
  • Experience with CI/CD pipelines, Automated Testing, Automated Deployments, Agile methodologies, Unit Testing and Integration Testing tools.
  • Experience with building scalable serverless application (real-time / batch) on AWS stack (Lambda + step function).
  • Knowledge of distributed NoSQL database systems.
  • Experience with data engineering, ETL technology, and conversation UX is a plus.
  • Experience with HPCs, vector embedding, and Hybrid/Semantic search technologies.
  • Experience with AWS OpenSearch, Step/Lambda Functions, SageMaker, API Gateways, ECS/Docker is a plus.
  • Proficiency in customization techniques across various stages of the RAG pipeline, including model fine-tuning, retrieval re-ranking, and hierarchical navigable small-world graph (HNSW) is a plus.
  • Strong proficiency in embeddings, ANN/KNN, vector stores, database optimization, & performance tuning.
  • Extensive programming experience with Python, Java.
  • Experience with LLM orchestration frameworks like Langchain, LlamaIndex etc.
  • Foundational understanding of Natural Language Processing, and Deep Learning.
  • Excellent problem-solving skills and the ability to work in a collaborative team environment.
  • Excellent communication skills.
  • Candidate must be authorized to work in the US without company sponsorship. The company will not support the STEM OPT I-983 Training Plan endorsement for this position.

Compensation

The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford’s total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is: $125,760 - $188,640

Equal Opportunity Employer/Females/Minorities/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age

About Us | Culture & Employee Insights | Diversity, Equity and Inclusion | Benefits

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

GEN AI Sr Data Scientist/Data Scientist

The Hartford

Hartford

Remote

USD 90,000 - 167,000

7 days ago
Be an early applicant

Envista Sr. Data Scientist (Remote-Brea, CA)

Lensa

Harrisburg

Remote

USD 96,000 - 179,000

2 days ago
Be an early applicant

PRINCIPAL DATA SCIENTIST - GENERATIVE AI, MACHINE LEARNING, PYTHON, R - REMOTE

Lensa

Milwaukee

Remote

USD 117,000 - 276,000

Today
Be an early applicant

Innovation Data Scientist III

WEX, Inc.

South Portland

Remote

USD 113,000 - 150,000

3 days ago
Be an early applicant

Data Scientist (AI)

Wiraa

Virginia

Remote

USD 146,000 - 184,000

Today
Be an early applicant

Chief Data Scientist

Blue Yonder

Town of Texas

Remote

USD 186,000 - 285,000

5 days ago
Be an early applicant

Artificial Intelligence (AI) Data Scientist

General Dynamics Information Technology

Fairfax

Remote

USD 146,000 - 184,000

5 days ago
Be an early applicant

Practice Innovation Data Engineer

Ogletree Deakins

Phoenix

Remote

USD 110,000 - 166,000

2 days ago
Be an early applicant

LLM Data Engineer | United States | Fully Remote

Halo Media

Orlando

Remote

USD 90,000 - 150,000

Yesterday
Be an early applicant