Job Search and Career Advice Platform
10,000+

Jobs in Granada, Spain

Machine Learning Engineer

Application House

United States
Remote
USD 100,000 - 145,000
Yesterday
Be an early applicant
I want to receive the latest job alerts for jobs in Granada

Staff Enterprise Application Engineer - Palantir

GE Aerospace

Indianapolis (IN)
Remote
USD 110,000 - 185,000
Yesterday
Be an early applicant

Senior Finance Systems Analyst

CBRE

Raleigh (NC)
Remote
USD 75,000 - 100,000
Yesterday
Be an early applicant

Sr. Solution Architect ,Process Automation - Experience in Life Science industry required (remote)

Cognizant

Madison (WI)
Remote
USD 100,000 - 181,000
Yesterday
Be an early applicant

Data Center Design Execution Senior Project Manager

CBRE

Lincoln (NE)
Remote
USD 180,000 - 200,000
Yesterday
Be an early applicant
Discover more opportunities than anywhere else.
Find more jobs now

Sr. Associate Actuary, Retirement Plan Services

Lincoln Financial

Concord (NH)
Remote
USD 93,000 - 170,000
Yesterday
Be an early applicant

Chronic Care Specialty Sales Representative – NC - SC - VA

Merck

Lincoln (NE)
Remote
USD 77,000 - 164,000
Yesterday
Be an early applicant

Quality Analyst with Healthcare - Claims & Membership

Cognizant

Juneau (AK)
Remote
USD 53,000 - 93,000
Yesterday
Be an early applicant
HeadhuntersConnect with headhunters to apply for similar jobs

Senior Producer

Global Experience Specialists, Inc.

California (MO)
Remote
USD 60,000 - 80,000
Yesterday
Be an early applicant

Affiliate & Influencer Marketing

Working Nomads

United States
Remote
USD 80,000 - 110,000
Yesterday
Be an early applicant

Principal Solution Architect – FirstRx Ecosystem - Remote

Prime Therapeutics

Lincoln (NE)
Remote
USD 124,000 - 211,000
Yesterday
Be an early applicant

Billing and Medical Coding Specialist

Spring Health

United States
Remote
USD 10,000 - 60,000
Yesterday
Be an early applicant

Customer Service Specialist – Work From Home – Remote

Cornerstone Financial Solutions

United States
Remote
USD 100,000 - 300,000
Yesterday
Be an early applicant

Entry Level Remote Sales Representative - Part-Time or Full-Time

White Rose Group

Cincinnati (OH)
Remote
USD 10,000 - 60,000
Yesterday
Be an early applicant

Project Manager I - Cell & Gene Therapy (Sponsor-Dedicated/ Remote)

Syneos Health/ inVentiv Health Commercial LLC

North Carolina
Remote
USD 70,000 - 90,000
Yesterday
Be an early applicant

Remote LCSW / LPC / LMFT - Work with students (Spanish preferred)

Daybreak Health

Houston (TX)
Remote
USD 80,000 - 100,000
Yesterday
Be an early applicant

Sr. Security Engineer, CyberArk

MDMS Recruiting

United States
Remote
USD 100,000 - 130,000
Yesterday
Be an early applicant

Youth Leadership Fellow (Greater Boston)

Nichols College

Dudley (MA)
Remote
USD 10,000 - 60,000
Yesterday
Be an early applicant

Industrial Field Service Tech, Mechanical

Ingersoll Rand

Davenport (IA)
Remote
USD 50,000 - 70,000
Yesterday
Be an early applicant

SQF Lead Auditor- (Subcontractor)

TUV SUD America

Durham (NC)
Remote
USD 60,000 - 80,000
Yesterday
Be an early applicant

Project Manager I - Cell & Gene Therapy (Sponsor-Dedicated/ Remote)

Syneos Health/ inVentiv Health Commercial LLC

Louisiana (MO)
Remote
USD 80,000 - 110,000
Yesterday
Be an early applicant

Become a Luxury Brand Evaluator in Roseville, CA - Apply Now

CXG

Citrus Heights (CA)
Remote
USD 60,000 - 80,000
Yesterday
Be an early applicant

Partner in Tax Law Division

Carrie Rikon & Associates, LLC.

United States
Remote
USD 235,000 - 250,000
Yesterday
Be an early applicant

Project Development Engineer

Ameresco

Massachusetts
Remote
USD 80,000 - 100,000
Yesterday
Be an early applicant

Software Engineer, AI Products

Ada

United States
Remote
USD 80,000 - 110,000
Yesterday
Be an early applicant

Top job titles:

Temporal jobsExterna jobsProfesor Educacion Fisica jobsSecretaria Virtual jobsUniversidad jobsProduccion jobsComedor jobsCobol jobsPatronista jobsRestauracion jobs

Top companies:

Jobs at CorreosJobs at Leroy MerlinJobs at MoventiaJobs at CepsaJobs at NestleJobs at RitualsJobs at BpJobs at DisneyJobs at Grupo PlanetaJobs at Seat

Top cities:

Jobs in MadridJobs in BarcelonaJobs in SevillaJobs in ZaragozaJobs in BilbaoJobs in MallorcaJobs in GironaJobs in DonostiaJobs in TenerifeJobs in Gijon
Machine Learning Engineer
Application House
United States
Remote
USD 100,000 - 145,000
Part time
Yesterday
Be an early applicant

Job summary

A leading technology firm is seeking a Machine Learning Engineer (Inference & Systems) to design and optimize AI inference systems. This remote role offers flexible hours and the opportunity to work with modern ML frameworks. Ideal candidates will have extensive experience in deep learning inference, and the role includes responsibilities like deploying ML models and optimizing GPU performance. Join a diverse team and drive innovation in AI infrastructure.

Benefits

Flexible working hours
Professional development budget
Career growth opportunities
Remote work

Qualifications

  • 3+ years in deep learning inference or distributed systems.
  • Experience in deploying ML/LLM models to production.
  • Hands-on experience with an inference engine.

Responsibilities

  • Deploy and maintain LLMs and ML models using serving engines.
  • Design fault-tolerant distributed inference engines.
  • Build monitoring and observability for inference services.

Skills

Deep learning inference
Fault-tolerant distributed systems
High-performance computing
GPU programming
REST APIs
Python
CUDA
Docker
Kubernetes

Tools

vLLM
Hugging Face TGI
TensorRT-LLM
FasterTransformer
Triton
Job description
Overview

Job Title: Machine Learning Engineer (Inference & Systems) (also known as Inference Engineer)

Location: Work from Home (fully flexible Remote)

Job Timing: Part-Time, flexible any time that can be optionally transformed to full-time

About the Role

We’re building a next-generation cloud platform to serve multimodal AI, LLMs, vision, audio and other machine learning models at scale. As a Machine Learning Engineer (Inference & Systems), you’ll design and optimize runtime systems, OpenAI-compatible APIs, and distributed GPU pipelines for fast, cost-efficient inference and fine-tuning.

You’ll work with frameworks like vLLM, TensorRT-LLM, and TGI to design, optimize, and deploy distributed inference engines that serve text, vision, and multimodal models with low latency and high throughput. This includes deploying models such as LLaMA 3, Mistral, diffusion, ASR, TTS, and embeddings, while focusing on GPU/accelerator optimizations, software–hardware co-design, and fault-tolerant large-scale systems that power real-world applications and developer tools.

You’ll work at the intersection of machine learning, cloud infrastructure, and systems engineering, focusing on high-throughput, low-latency inference and cost-efficient deployment. This role offers a unique opportunity to shape the future of AI inference infrastructure, from cutting-edge model serving systems to production-grade deployment pipelines.

If you’re passionate about pushing the boundaries of AI inference, we’d love to hear from you!

Key Responsibilities
  • Deploy and maintain LLMs (e.g., LLaMA 3, Mistral) and ML models using serving engines such as vLLM, Hugging Face TGI, TensorRT-LLM, or FasterTransformer.
  • Design and develop fault-tolerant, high-concurrency, large-scale distributed inference engines for text, image, LLMs and multimodal models that are fault-tolerant, high-performance, and cost-efficient.
  • Implement, optimize distributed inference and parallelism strategies: Mixture of Experts (MoE), tensor parallelism, pipeline parallelism, and related techniques for high-performance serving.
  • Integrate vLLM, TGI, SGLang, FasterTransformer, and explore emerging inference frameworks.
  • Build and scale an OpenAI-compatible API layer to expose models for customer use.
  • Experiment with model quantization, caching, and parallelism to reduce inference costs.
  • Optimize GPU usage, memory, and batching to achieve low-latency, high-throughput inference.
  • Optimize GPU performance using CUDA graph optimizations, TensorRT-LLM, Triton kernels, PyTorch compilation (torch.compile), quantization, and speculative decoding to maximize efficiency.
  • Work with cloud GPU providers (RunPod, Vast.ai, AWS, GCP, Azure) to manage costs and availability.
  • Develop runtime inference services and APIs for LLMs, multimodal models, and fine-tuning pipelines.
  • Build monitoring and observability for inference services to integrate inference metrics (latency, throughput, GPU utilization) into monitoring dashboards (Grafana, Prometheus, Loki, OpenTelemetry).
  • Collaborate with backend and DevOps engineers to ensure secure, reliable APIs with rate-limiting and billing hooks.
  • Document deployment processes and provide guidance to other engineers using the platform.
Requirements
  • Experience: 3+ years in deep learning inference, fault-tolerant distributed systems, or high-performance computing.
  • Proven experience in deploying ML/LLM models to production.
  • Inference: Hands-on experience with at least one inference engine: vLLM, TGI, SGLang, TensorRT-LLM, FasterTransformer, or Triton.
  • Runtime Services: Prior work implementing large-scale inference or serving pipelines.
  • Solid understanding of GPU memory management, batching, and distributed inference. Strong knowledge of GPU programming (CUDA, Triton, TensorRT), compiler, model quantization, and GPU cluster scheduling.
  • Experienced in the GPU/ML stack, including PyTorch, Hugging Face Transformers, and GPU-accelerated inference.
  • Deep understanding of Transformer architectures, LLM/VLM/Diffusion model optimization, and KV cache systems like Mooncake, PagedAttention, or custom in-house variants that support long-context serving and inference optimization techniques.
  • Comfortable working with cloud GPU platforms (AWS/GCP/Azure) or GPU marketplaces (RunPod, Vast.ai, TensorDock) to profile bottlenecks and optimize GPU utilization.
  • Benchmark and tune multi-GPU clusters for throughput and memory efficiency.
  • Experience building REST APIs or gRPC services (FastAPI, Flask, or similar).
  • Programming: Proficient in Python, Go, Rust, C++, CUDA for high-performance systems.
  • Systems knowledge: Distributed systems experience (storage, search, compute, or inference); strong understanding of multi-threading, memory management, networking, storage, and performance tuning.
  • Familiarity with containerization (Docker) and orchestration (Kubernetes).
  • Strong problem-solving and debugging skills across ML + infra stack.
  • Familiarity with distributed storage (Ceph, HDFS, 3FS).
  • Knowledge of datacenter networking (RDMA, RoCE).
Nice to Have
  • Experience with Stripe or other billing systems for metered API usage.
  • Experience with large-scale datacenter networking (RDMA/RoCE).
  • Familiarity with distributed storage (Ceph, HDFS, 3FS).
  • Knowledge of Redis or Envoy for request rate limiting.
  • Familiarity with observability tools (Grafana, Prometheus, Loki).
  • Exposure to MLOps pipelines (CI/CD with Azure DevOps or GitHub Actions).
  • Exposure to observability stacks (Prometheus, Grafana, Loki).
  • Experience with model fine-tuning pipelines and GPU scheduling.
  • Understanding of rate limiting, quota enforcement, and billing hooks in ML APIs.
  • Prior work at an AI infra company (Together.ai, Modal, Anyscale, Replicate, etc.)
Why Join Us?
  • Work from Anywhere — 100% remote, with the freedom to work from anywhere in the world.
  • Fully Flexible Shifts — complete control over your working hours; results matter more than clocking in.
  • Career Growth & Fast-Track Promotions — we guarantee the quickest promotion opportunities and clear pathways for advancement.
  • Professional Development — training budget, mentorship, and exposure to cutting-edge Salesforce, AI/ML, and cloud technologies.
  • Global Collaboration — work with an international, diverse, and inclusive team.
  • Innovative Environment — freedom to experiment with new tools, frameworks, and ideas.
  • Accelerated Salary Growth + Performance Incentives — ambitious and hard-working team members are rewarded with fast upward salary progression alongside strong performance bonuses.
  • Previous
  • 1
  • ...
  • 52
  • 53
  • 54
  • ...
  • 400
  • Next

* The salary benchmark is based on the target salaries of market leaders in their relevant sectors. It is intended to serve as a guide to help Premium Members assess open positions and to help in salary negotiations. The salary benchmark is not provided directly by the company, which could be significantly higher or lower.

Job Search and Career Advice Platform
Land a better
job faster
Follow us
JobLeads Youtube ProfileJobLeads Linkedin ProfileJobLeads Instagram ProfileJobLeads Facebook ProfileJobLeads Twitter AccountJobLeads Xing Profile
Company
  • Customer reviews
  • Careers at JobLeads
  • Site notice
Services
  • Free resume review
  • Job search
  • Headhunter matching
  • Career advice
  • JobLeads MasterClass
  • Browse jobs
Free resources
  • 5 Stages of a Successful Job Search
  • 8 Common Job Search Mistakes
  • How Long should My Resume Be?
Support
  • Help
  • Partner integration
  • ATS Partners
  • Privacy Policy
  • Terms of Use

© JobLeads 2007 - 2025 | All rights reserved