¡Activa las notificaciones laborales por email!

Software Engineer, Data Infrastructure & Acquisition - San Sebastián, Spain

Speechify

Donostia/San Sebastián

A distancia

EUR 60.000 - 80.000

Jornada completa

Hoy
Sé de los primeros/as/es en solicitar esta vacante

Descripción de la vacante

A leading text-to-speech technology company in Donostia/San Sebastián is seeking a Software Engineer for their AI team. The ideal candidate will manage data collection processes and enhance cloud infrastructure for efficient model training. Candidates should have substantial software development experience and expertise in cloud technologies. This role offers competitive salaries and an opportunity to work on impactful projects that improve accessibility for people with learning differences.

Servicios

Competitive salaries
Hands-off management approach
Friendly and laid-back atmosphere
Opportunity to work on life-changing products

Formación

  • 5+ years of industry experience in software development.
  • Experience with web crawlers and large-scale data processing workflows is a plus.
  • Ability to handle multiple tasks and adapt to changing priorities.

Responsabilidades

  • Find new sources of audio data for ingestion pipeline.
  • Operate cloud infrastructure for ingestion pipeline running on GCP.
  • Collaborate with scientists to improve data quality and scale.

Conocimientos

Proficiency with bash/Python scripting in Linux environments
Proficiency in Docker
Strong communication skills

Educación

BS/MS/PhD in Computer Science or a related field

Herramientas

Google Cloud Platform (GCP)
Descripción del empleo

The mission of Speechify is to make sure that reading is never a barrier to learning.

Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify’s text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its 2025 Design Award winner for Inclusivity.

Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.

Overview

The responsibilities of our Platform team include building and maintaining all backend services, including, but not limited to, payments, analytics, subscriptions, new products, text to speech, and external APIs.

This is a key role and ideal for someone who thinks strategically, enjoys fast-paced environments, is passionate about making product decisions, and has experience building great user experiences that delight users.

We are a flat organization that allows anyone to become a leader by showing excellent technical skills and delivering results consistently and fast. Work ethic, solid communication skills, and obsession with winning are paramount.

Our interview process involves several technical interviews and we aim to complete them within 1 week.

Overview

We’re looking to hire for our Data side of our AI team at Speechify. This role is responsible for all aspects of data collection to support our model training operations. We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. We are looking for a skilled Software Engineer to join us.

What You’ll Do

  • Be scrappy to find new sources of audio data and bring it into our ingestion pipeline.
  • Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
  • Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.
  • Collaborate with others on the AI Team and Speechify Leadership to craft the AI Team’s dataset roadmap to power Speechify’s next-generation consumer and enterprise products.

An Ideal Candidate Should Have

  • BS/MS/PhD in Computer Science or a related field.
  • 5+ years of industry experience in software development.
  • Proficiency with bash/Python scripting in Linux environments.
  • Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP).
  • Experience with web crawlers, large-scale data processing workflows is a plus.
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong communication skills, both written and verbal.

What we offer

  • A fast-growing environment where you can help shape the company and product.
  • An entrepreneurial-minded team that supports risk, intuition, and hustle.
  • A hands-off management approach so you can focus and do your best work.
  • An opportunity to make a big impact in a transformative industry.
  • Competitive salaries, a friendly and laid-back atmosphere, and a commitment to building a great asynchronous culture.
  • Opportunity to work on a life-changing product that millions of people use.
  • Build products that directly impact and support people with learning differences like dyslexia, ADD, low vision, concussions, autism, and more.
  • Work in one of the fastest-growing sectors of tech, the intersection of artificial intelligence and audio.

Think you’re a good fit for this job?

Tell us more about yourself and why you're interested in the role when you apply. And don’t forget to include links to your portfolio and LinkedIn.

Not looking but know someone who would make a great fit?

Refer them!

Speechify is committed to a diverse and inclusive workplace.

Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Speechify’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Voluntary Self-Identification of Disability

Form CC-305

Page 1 of 1

OMB Control Number 1250-0005

Expires 04/30/2026

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.