Enable job alerts via email!

Data Engineer I

Howard Hughes Medical Institute (HHMI)

Washington (District of Columbia)

On-site

USD 86,000 - 141,000

Full time

6 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative research center is seeking a Data Engineer to enhance scientific discovery through robust data infrastructure. This role involves designing and optimizing scalable data pipelines for large datasets, directly supporting cutting-edge computational research initiatives, including AI applications. The ideal candidate will collaborate with multidisciplinary teams, ensuring data quality and accessibility. With a strong focus on innovation, this position offers the opportunity to work alongside world-class researchers and contribute to impactful science. Join a dynamic community committed to advancing our understanding of life sciences and enjoy a supportive team environment that promotes collaboration and knowledge sharing.

Benefits

Comprehensive health benefits
On-site childcare
Free gyms
On-campus housing
Social and dining spaces
Shuttle bus service

Qualifications

  • Proficiency in Python and R for data analysis and visualization.
  • Experience in data mining and statistical methods for insights extraction.

Responsibilities

  • Design and optimize scalable data pipelines for large datasets.
  • Collaborate with teams to ensure data quality and reproducibility.

Skills

Python
R
Data Analysis
Data Visualization
Linux Command Line
Data Mining
Statistical Methods
Communication Skills

Education

Bachelor's degree in Computer Science
Bachelor's degree in Data Science
Bachelor's degree in Statistics
Bachelor's degree in Applied Mathematics

Tools

AWS
Google Cloud
Numpy
Pandas
HDF5
Matplotlib
Jupyter Notebooks

Job description

Primary Work Address: 19700 Helix Drive, Ashburn, VA, 20147

Current HHMI Employees, click here to apply via your Workday account.

The Howard Hughes Medical Institute's Janelia Research Campus is a pioneering research center in Ashburn, Virginia, where scientists pursue fundamental questions in the life sciences. Our integrated teams of biologists, computational scientists, and tool-builders innovate research practices and technologies to solve biology's deepest mysteries. HHMI launched Janelia in 2006, establishing an intellectually enriching environment for scientists to do creative, collaborative, hands-on work. We share our methods, results, and tools with the scientific community.

Summary:

AI@HHMI: HHMI is investing $500 million over the next 10 years to support AI-driven projects and to embed AI systems throughout every stage of the scientific process in labs across HHMI. The AI initiative will be centered at HHMI's Janelia Research Campus. Janelia has been at the forefront of AI-driven research in biology for more than 15 years. Its forward-thinking structure, centralized funding, and collaborative culture make it ideally suited to take this bold leap forward. To learn more about the initiative, visit here.

About the role:

We're seeking a skilled Data Engineer to drive scientific innovation through robust data infrastructure. In this role, you'll design, develop, and optimize scalable data pipelines and tools for the ingestion, transformation, and integration of large, heterogeneous datasets. Your work will directly support computational research initiatives, including machine learning and AI applications. Collaborating closely with multidisciplinary teams of computational and experimental scientists, you'll help define and implement best practices in data engineering, ensuring data quality, accessibility, and reproducibility. You'll also be responsible for maintaining detailed documentation and automating workflows to streamline the path from raw data to scientific insight.

What we provide:

  • A competitive compensation package, with comprehensive health and welfare benefits.

  • A supportive team environment that promotes collaboration and knowledge sharing.

  • The opportunity to engage with world-class researchers, software engineers and AI/ML experts, contribute to impactful science, and be part of a dynamic community committed to advancing humanity's understanding of fundamental scientific questions.

  • Amenities that enhance work-life balance such as on-site childcare, free gyms, available on-campus housing, social and dining spaces, and convenient shuttle bus service to Janelia from the Washington D.C. metro area.

What you'll do:

  • Design and customize data pipelines, leveraging appropriate tools, methods, and storage formats to process large structured and unstructured datasets for analysis and AI model training.

  • Source, consolidate, and curate data to support a range of computational research needs, ensuring reproducibility through careful documentation of code, data, and workflows.

  • Apply statistical and programming tools (e.g., Python, R) to analyze datasets, extract insights, and communicate findings through clear visualizations.

  • Establish and maintain standards for data formats, storage, and processing workflows, while continuously learning new tools and collaborating closely with interdisciplinary teams.

What you bring:

  • A Bachelor's degree in Computer Science, Data Science, Statistics, Applied Mathematics or related fields and 0 to 2 years of relevant experience. An equivalent combination of education and relevant experience will be considered.

  • Proficiency in the use of the Linux command line, programming languages and frameworks and formats for data management (e.g., Python, R, Numpy, Pandas, HDF5).

  • Familiarity with high-performance computing environments and cloud storage (e.g., AWS, GoogleCloud).

  • Proficiency in the application of data mining and data analysis methods and techniques.

  • Proficiency in utilizing data visualization libraries and software (e.g., Matplotlib, R, Jupyter notebooks).

  • Detail-oriented, creative, and organized team player with strong communication skills and a collaborative mindset.

  • Able to effectively manage time, prioritize tasks, and clearly convey complex data concepts to technical and non-technical audiences.

Physical Requirements:

Remaining in a normal seated or standing position for extended periods of time; reaching and grasping by extending hand(s) or arm(s); dexterity to manipulate objects with fingers, for example using a keyboard; communication skills using the spoken word; ability to see and hear within normal parameters; ability to move about workspace. The position requires mobility, including the ability to move materials weighing up to several pounds (such as a laptop computer or tablet).

Persons with disabilities may be able to perform the essential duties of this position with reasonable accommodation. Requests for reasonable accommodation will be evaluated on an individual basis.

Please Note:

This job description sets forth the job's principal duties, responsibilities, and requirements; it should not be construed as an exhaustive statement, however. Unless they begin with the word "may," the Essential Duties and Responsibilities described above are "essential functions" of the job, as defined by the Americans with Disabilities Act.

#LI-BG1

Compensation and Benefits

Our employees are compensated from a total rewards perspective in many ways for their contributions to our mission, including competitive pay, exceptional health benefits, retirement plans, time off, and a range of recognition and wellness programs. Visit our Benefits at HHMI site to learn more.

Compensation Range

$86,181.60 (minimum) - $107,727.00 (midpoint) - $140,045.10 (maximum)

Pay Type:

Annual

HHMI's salary structure is developed based on relevant job market data. HHMI considers a candidate's education, previous experiences, knowledge, skills and abilities, as well as internal consistency when making job offers. Typically, a new hire for this position in this location is compensated between the minimum and the midpoint of the salary range.

HHMI is an Equal Opportunity Employer

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Data Engineer I

Mercury Insurance

Remote

USD 85,000 - 158,000

10 days ago

Data Engineer I

Fearless

Baltimore

Remote

USD 79,000 - 124,000

30+ days ago

Data Engineer I

Charter Communications

Town and Country

On-site

USD 60,000 - 95,000

2 days ago
Be an early applicant

Data Engineer - I (AWS, Python, Databricks)

Travelers

Hartford

On-site

USD 106,000 - 176,000

6 days ago
Be an early applicant

Data Engineer - I (AWS, Python, Databricks)

The Travelers Indemnity Company

Hartford

On-site

USD 106,000 - 176,000

6 days ago
Be an early applicant

Senior Software Engineer I - Full Stack (Remote Eligible)

Smartsheet

Washington

Remote

USD 140,000 - 200,000

5 days ago
Be an early applicant

Senior Software Engineer I, Front End - Chart View (Remote Eligible)

Smartsheet

Washington

Remote

USD 140,000 - 185,000

5 days ago
Be an early applicant

Data Engineer I

Howard Hughes Medical Institute (HHMI)

Ashburn

On-site

USD 86,000 - 141,000

13 days ago

Field Engineer I, II, or III (Wind)

IEA Constructors LLC, a MasTec Company

Coal City

Remote

USD 65,000 - 90,000

7 days ago
Be an early applicant