Enable job alerts via email!

Software Engineer, Data Infrastructure & Acquisition - Oslo, Norway

Speechify

Oslo

On-site

NOK 70,000 - 90,000

Full time

26 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A growing AI company is seeking an experienced Data Engineer to enhance their AI team's data operations. You will find and integrate audio data at scale, collaborate with scientists, and help shape the product roadmap. Ideal candidates will have a strong background in software development, cloud infrastructures, and scripting, with an emphasis on creating impactful products that support learning differences.

Benefits

Competitive salaries
Friendly atmosphere
Remote work flexibility
Impactful work on a transformative product

Qualifications

  • 5+ years of industry experience in software development.
  • Proficiency in Linux environments.
  • Ability to handle multiple tasks and adapt to changing priorities.

Responsibilities

  • Find new sources of audio data for ingestion pipeline.
  • Operate and extend the cloud infrastructure on GCP.
  • Collaborate with AI Team to craft dataset roadmap.

Skills

bash/Python scripting
Docker
Large-scale data processing workflows
Cloud Infrastructure
Strong communication skills

Education

BS/MS/PhD in Computer Science or related field

Tools

Google Cloud Platform (GCP)
Job description
Overview

The mission of Speechify is to make sure that reading is never a barrier to learning.

Speechify’s products include its iOS app, Android app, Mac app, Chrome Extension, and Web App. Speechify is a 100% distributed company with no office, with a global team including frontend and backend engineers, AI research scientists, and others from major companies and universities.

Overview

We are hiring for the Data side of our AI team. This role is responsible for all aspects of data collection to support model training operations, enabling high-quality datasets at petabyte scale and low cost through tight integration of infrastructure, engineering, and research work.

What You’ll Do
  • Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
  • Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform
  • Collaborate closely with Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models
  • Collaborate with the AI Team and Speechify Leadership to craft the AI Team’s dataset roadmap to power Speechify’s next-generation consumer and enterprise products
An Ideal Candidate Should Have
  • BS/MS/PhD in Computer Science or a related field
  • 5+ years of industry experience in software development
  • Proficiency with bash/Python scripting in Linux environments
  • Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP)
  • Experience with web crawlers, large-scale data processing workflows is a plus
  • Ability to handle multiple tasks and adapt to changing priorities
  • Strong communication skills, both written and verbal
What We Offer
  • A fast-growing environment where you can help shape the company and product
  • An entrepreneurial-minded team that supports risk, intuition, and hustle
  • A hands-off management approach so you can focus and do your best work
  • An opportunity to make a big impact in a transformative industry
  • Competitive salaries, a friendly and laid-back atmosphere, and a commitment to building a great asynchronous culture
  • Opportunity to work on a life-changing product that millions of people use
  • Build products that directly impact and support people with learning differences like dyslexia, ADD, low vision, concussions, autism, and more
  • Work in one of the fastest-growing sectors of tech, the intersection of artificial intelligence and audio

Speechify is committed to a diverse and inclusive workplace. Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Interested in building your career at Speechify? Get future opportunities sent straight to your email.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.