Aktiviere Job-Benachrichtigungen per E-Mail!

Data Scientist

Barmont

Deutschland

Remote

USD 85.000 - 95.000

Vollzeit

Vor 2 Tagen
Sei unter den ersten Bewerbenden

Zusammenfassung

A public records organization is seeking a remote Data Scientist to support data exploration and quality assurance for its database. This role requires a minimum of 5 years of experience in data science and proficiency in SQL. Responsibilities include developing data quality standards, monitoring processing errors, and ensuring compliance with data privacy regulations. The position offers a salary range of $85,000 to $95,000, along with a comprehensive benefits package.

Leistungen

100% employee coverage for medical, dental, vision
Retirement savings plan
Flexible schedules
Generous vacation leave
Professional development opportunities

Qualifikationen

  • Minimum 5 years of experience as a Data Scientist or Data Engineer.
  • High proficiency in SQL for data extraction.
  • Experience in data quality assurance and identity resolution.

Aufgaben

  • Develop and maintain data quality standards and processes.
  • Design quality assurance test plans for data integrity.
  • Collaborate with data engineers and analysts on data quality.

Kenntnisse

SQL proficiency
Analytical skills
Problem-solving skills
Data engineering

Jobbeschreibung

Oklahoma Data Exchange (OK Data) is seeking an experienced Data Scientist to support data exploration, quality assurance, and identity resolution for our growing public records database, currently focused on supporting work around the justice system. Founded in 2025, OK Data works with governmental and nonprofit organizations to build human and technical capacity to share, link, and understand critical data across the systems that serve Oklahomans.
The ideal candidate will have a proven track record in data engineering and/or data science, with significant experience designing and implementing database quality assurance tests and identity resolution algorithms. The position is remote and open to residents of Oklahoma.

Data Exploration

  • Collaborate with Lead Data Engineer on creation of dashboards and reports using code (Python or Ruby) and/or a BI platform
  • Explore, document and diagram various data sources keeping in mind the following:
    • Mappings and standardization
    • Data over time (temporal data)
    • Scraping workflows
    • Data scope
    • Real-life data scenarios
    • Edge cases and other sources of complexity
  • Find patterns and suggest algorithms for designing database structure and gleaning insights

Database Quality Assurance

  • Develop and maintain data quality standards, metrics, and processes
  • Design and execute quality assurance test plans that ensure the integrity and currentness of data collected from public and privileged data sources
  • Maintain working knowledge of OK Data data collection processes and storage infrastructure
  • Monitor data pipelines and ETL processes to detect and resolve data anomalies
  • Collaborate with Lead Data Engineer to quickly resolve errors in data processing and storage
  • Collaborate with data engineers, analysts, and external stakeholders to understand data requirements and quality expectations
  • Document data quality issues and work with relevant teams to implement corrective actions
  • Create and maintain data quality dashboards and reports
  • Support data governance initiatives and contribute to data stewardship efforts

Identity Resolution Support

  • Test and benchmark identity resolution pipelines using deterministic and probabilistic matching algorithms
  • Develop and maintain data ingestion and transformation processes to support identity stitching
  • Collaborate with internal and external partners to define identity resolution rules and data quality standards
  • Optimize performance of identity resolution workflows for scalability and accuracy
  • Monitor and troubleshoot data matching issues and continuously improve match rates
  • Ensure compliance with data privacy regulations (e.g., GDPR, CCPA) in identity resolution processes
  • Document technical designs, data flows, and resolution logic

Benefits and Compensation

  • Starting salary range for this position is $85,000 to $95,000, commensurate with experience
  • Benefits package that includes 100% of employee coverage for medical, dental, vision plans
  • Retirement savings plan with a safe harbor match at 5% of salary
  • Fully remote work, stipend for work-from-home expenses, and flexible schedules
  • Generous vacation leave and paid holidays
  • Professional development opportunities

Qualifications

  • Experience as a Data Scientist, Data Engineer, or relevant role, with a minimum of 5 years of experience
  • High proficiency using SQL to create and extract custom data sets from multiple tables
  • Demonstrated analytical and problem-solving skills with a keen attention to detail
  • Familiarity with Oklahoma public data sources is a big plus
Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.
eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.