Enable job alerts via email!

NLP Data Acquisition Engineer Software Engineer

NLP PEOPLE

Burlington

On-site

CAD 70,000 - 90,000

Full time

28 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in healthcare research is seeking an Engineer to build a clinical language understanding system. The role involves creating annotated corpora, managing data acquisition, and ensuring data quality. Ideal candidates will have strong programming skills, particularly in Python, and experience in clinical data management.

Qualifications

  • 3-5 years professional development experience.
  • Excellent programming skills in Python.
  • Experience with large corpora and data analysis.

Responsibilities

  • Select and distribute data for annotation to internal staff.
  • Create and organize large annotated clinical text corpora.
  • Monitor data annotation consistency.

Skills

Python
SQL
Bash / Unix scripting
Communication

Education

BSCS
MSCS

Tools

Java
C++

Job description

Engineer in the Healthcare Research organization working on building a state of the art clinical language understanding system by creating appropriate corpora. Major duty is to select and distribute data for annotation to internal staff, collect and analyze the annotated data, and integrate this into the research pipeline.

Main responsibility is creating and organizing large annotated clinical text corpora for training models for clinical information extraction / understanding. The candidate will be responsible for the end-to-end research data acquisition process which includes :

  • Identify data requirements for models and research.
  • Select the appropriate data from an unannotated large corpus.
  • Conduct data analysis to identify and correct potential problems.
  • Distribute the data to annotation staff.
  • Monitor data annotation consistency during and after annotation phase.
  • Convert the data to appropriate format for further research.
  • Creation and maintenance of the data acquisition and validation tools.

Company : Nuance

Qualifications : Required Skills :

3-5 years professional development experience.

Excellent programming skills and at least 2 years of experience programming in Python is required.

SQL or other Database knowledge is required.

Bash / Unix scripting skills are required.

Must be self-motivated and detail-oriented, with exceptional communication and inter-personal skills.

BSCS required. MSCS strongly preferred.

Preferred Skills :

Experience with collecting, creating and maintaining large corpora is a definite PLUS.

Experience with Java or C++ is a definite PLUS.

Experience working with speech recognition, language modeling / processing, machine translation, information extraction etc is a definite PLUS.

Experience in medical text analytics and / or clinical information retrieval is a definite PLUS.

Experience working with UMLS components (MeSH, Specialist etc) and / or SNOMED is a definite PLUS.

Level of experience (years) :

Mid Career (2+ years of experience)

How to apply :

Please mention NLP People as a source when applying.

Apply here

J-18808-Ljbffr

Create a job alert for this search

Software Engineer • Burlington, ON, Canada

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Security Software Engineer

Canonical

Mississauga

Remote

CAD 80,000 - 120,000

6 days ago
Be an early applicant

Ubuntu Core Software Engineer

Canonical

Mississauga

Remote

CAD 70,000 - 100,000

2 days ago
Be an early applicant

Security Software Engineer

Canonical

Hamilton

Remote

CAD 80,000 - 120,000

6 days ago
Be an early applicant

HPC Software Engineer

Canonical

Mississauga

Remote

CAD 80,000 - 120,000

15 days ago

Software Engineer

DataAnnotation

Newfoundland and Labrador

Remote

CAD 80,000 - 90,000

5 days ago
Be an early applicant

Open Source Networking Software Engineer - ToR Switch / SmartNIC / DPU

Canonical

Vancouver

Remote

CAD 70,000 - 120,000

2 days ago
Be an early applicant

System Software Engineer - GCC/LLVM compiler, tooling, and ecosystem

Canonical

Victoria

Remote

CAD 80,000 - 120,000

2 days ago
Be an early applicant

Senior Software Engineer

Infios

Moncton

Remote

CAD 80,000 - 120,000

2 days ago
Be an early applicant

Software Developer Engineer in Test (SDET) – Core Protection Technology

McAfee

Waterloo

Remote

CAD 70,000 - 90,000

2 days ago
Be an early applicant