Enable job alerts via email!

Data Engineer II – Clinical

Scorpion Therapeutics

United States

Remote

USD 105,000 - 125,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Scorpion Therapeutics is hiring a Data Engineer II for a remote position focused on clinical data warehousing solutions and integration. The role involves developing methodologies and collaborating with cross-functional teams to enhance data-driven decisions in pharmaceutical and clinical settings. Candidates should have an advanced degree and several years of relevant experience, with strong programming and analytical skills.

Benefits

401k
Medical, dental, vision coverage
Paid sick leave
Generous paid time off program

Qualifications

  • 3+ years in clinical diagnostics, R&D fields.
  • Proficient in SQL and knowledge of NoSQL query languages.
  • Experience with ETL processes and data pipelines.

Responsibilities

  • Develop new methods for data management and governance.
  • Collaborate with teams for data integration projects.
  • Perform data analysis using statistical techniques.

Skills

Programming languages
Data analytics
Cloud computing
Data governance
Problem-solving

Education

PhD or Master's in Data Science, Computer Science

Tools

AWS
SQL
NoSQL
Git

Job description

If you're aiming for a job like this, you need an edge.

JobsAI gives you that edge—by helping you upgrade your resume, analyze hiring managers’ LinkedIn profiles, and prepare with a built-in interview coach.

This is the smarter, faster way to compete in today’s job market.

Check out JobsAI

Data Engineer II - Clinical – Remote

Compensation: $105,000 - $125,000 per year. You are eligible for a Short-Term Incentive Plan with the target at 7.5% of your annual earnings; terms and conditions apply.

Main Responsibilities

The main responsibility of this role is to collaborate with stakeholders across the organization to design methodologies and tools that leverage multimodal data to guide clinical, pharmaceutical, commercial, and business decisions. This position involves creating new clinical data warehousing solutions, data transformations, and data integration assets, as well as supporting changes, enhancements, and maintenance of existing assets for clinical data initiatives. The role includes building infrastructure to support query functionality for our databases, performing complex queries, and conducting advanced data interpretation. Responsibilities may include reporting, online analytical processing, analytics, data mining, business performance management, benchmarking, text mining, and predictive analytics. This role collaborates with various groups across the organization, including Commercial, Clinical, and Lab personnel.

Essential Functions
  • Develop and implement new methods, protocols, and algorithms for data queries, management, and governance
  • Collaborate with statisticians and machine learning specialists to support Advanced Analytics, Application Delivery, Clinical, Commercial, R&D, and Lab teams with data access and tools for research and analysis
  • Serve as a subject matter expert to support data interrogation, database consistency, and mapping for stakeholders' needs, including business partnerships and data integration
  • Work closely with cross-functional teams, including healthcare professionals, data architects, and IT specialists, to develop robust data pipelines, implement data quality controls, and generate insights to support clinical decision-making
  • Utilize cloud-based solutions, particularly on AWS, for datalake expertise and manage ETL processes to structure clinical data for analysis
  • Contribute to the growth of the Data Warehouse architecture by creating and implementing custom clinical data models and ETL processes
  • Coordinate data integration projects, such as EHR, with system architects, DBAs, and vendors
  • Deploy data warehouse content using approved AWS systems and processes
  • Perform comprehensive analysis of clinical data using statistical techniques and data mining methodologies
  • Troubleshoot and analyze ETL process failures, data anomalies, and other data warehouse issues, recommending improvements as needed
  • Create and maintain accurate metadata models for custom warehouse data structures
  • Provide technical expertise to support end users and offer business logic for data pipeline transformations
  • Respond to and track issues using Jira Service Desk, gathering additional information from customers and resolving or escalating as needed
  • Manage projects, create timelines, identify risks and milestones, and provide status reporting to stakeholders
  • Support data warehouse developers, analysts, and users to validate data and ensure data warehouse validity
  • Utilize approved development tools to identify data quality and relationships for efficient reporting solutions
  • Other duties as assigned
Qualifications
  • Advanced degree (PhD or Master’s) in Data Science, Computer Science, or related field, or equivalent combination of education and work experience
  • 3+ years of experience in computer programming, data analytics, and research and development in fields related to clinical diagnostics
  • Proficiency in programming languages (e.g., Perl, Python, R, C/C++, Java) and expertise in SQL and familiarity with NoSQL query languages
  • Knowledge of cloud computing platforms (e.g., AWS, Azure, Google Cloud) and experience in designing and developing cloud-based data pipelines and ETL processes
  • Strong understanding of data warehousing architecture, dimensional modeling concepts, and version control systems (e.g., Git)
  • Understanding of data governance principles, data privacy and compliance regulations specific to genomics data (e.g., HIPAA, GDPR)
  • Strong analytical and problem-solving abilities, excellent oral and written communication skills, project management skills with attention to detail, and ability to collaborate effectively with cross-functional teams
Preferred Qualifications
  • Knowledge of genomics data formats and standards (e.g., VCF, BAM, FASTQ), clinical data ontologies (e.g., SNOMED, ICD-10, HPO), and medical data models (OMOP)
  • Familiarity with molecular biology, genomics, clinical fields, and/or bioinformatics
  • Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes) for deployment and scalability
  • Knowledge of genomics research workflows and data analysis pipelines
  • Proficiency in big data technologies (e.g., Hadoop, Spark) for processing and analyzing large-scale genomics data
  • Experience in a genetics laboratory or basic genetics setting
  • Strongly preferred experience with EHR integrations, either HL7 or FHIR based
  • Proven track record of implementing scalable and efficient solutions for genomics data storage and processing in the cloud
  • Knowledge of genomics data formats and standards (e.g., VCF, BAM, FASTQ)
  • Project management skills with a keen attention to detail and ability to handle multiple tasks simultaneously
About Us

Ambry Genetics Corporation is a CAP-accredited and CLIA-licensed molecular genetics laboratory based in Aliso Viejo, California. We are a genetics-based healthcare company that is dedicated to open scientific exchange so we can work together to understand and treat all human disease faster.

At Ambry, everyone is welcome. A career at Ambry Genetics is a chance to be part of a dynamic company that aims to improve health by understanding the relationships between genetics and human disease. We earned our reputation as industry leaders by responsibly introducing cutting-edge genetic testing solutions and continually sharing what we learn with the global scientific community.

At Ambry you will be learning, challenging yourself, and having fun while collaborating with teammates through the open exchange of ideas. Our outstanding benefits program includes 401k, medical, dental, vision, FSA, paid sick leave and generous paid time off (PTO) program. Ambry Genetics is an Equal Opportunity Employer (EOE) and we maintain a drug-free work environment.

The Company believes in second chance employment. Qualified applicants with arrest or conviction history will be considered regardless of their arrest or conviction history, consistent with local laws such as Los Angeles County Fair Chance Ordinance and the California Fair Chance Act. You do not need to disclose your criminal history or participate in a background check until a conditional job offer is made to you. After making a conditional offer and running a background check, if the Company is concerned about conviction that is directly related to the job, you will be given the chance to explain the circumstances surrounding the conviction, provide mitigating evidence, or challenge the accuracy of the background report. For the purpose of the above job description, “Essential Functions” are “Material Job Duties”.

Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

All qualified applicants will receive consideration for employment without regard to race (and traits historically associated with race, including, but not limited to hair texture and protective hairstyles such as braids, locks, and twists), color, creed, religion, sex, sexual orientation, gender identity, gender expression (including transgender status), national origin, ancestry, age, marital status or protected veteran status and will not be discriminated against on the basis of disability, protected medical condition as defined by applicable state or local law, genetic information, or any other characteristic protected by applicable federal, state, or local laws and ordinances. If you have a disability or special need that requires accommodation, please contact us at careers@ambrygen.com

Ambry does not accept unsolicited resumes from individual recruiters, third party recruiting agencies, outside recruiters or firms without an executed contract in place. We are not responsible for any fees related to resumes that are unsolicited or are received by Ambry. Such resumes will be deemed the sole property of Ambry and will be processed accordingly.

PRIVACY NOTICES

To review Ambry’s Privacy Notice, Click here: https://www.ambrygen.com/legal/privacy-policy

To review the California privacy notice, click here: California Privacy Notice | Ambry Genetics

To review the UKG privacy notice, click here: California Privacy Notice | UKG

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

USA - Data Scientist II (Clinical) (contract)

卡湯晩

Cambridge

Remote

USD 90,000 - 130,000

11 days ago