Enable job alerts via email!

AI Research Scientist

AI Tech Suite

Cupertino (CA)

On-site

USD 80,000 - 150,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative development platform is seeking a talented individual to join their team and contribute to the creation of high-performance AI applications. This role focuses on leveraging cutting-edge technologies and methodologies to enhance language models and optimize performance across devices. You will engage in exciting tasks such as building datasets, developing fine-tuning techniques, and collaborating on SDK development for various platforms. If you're passionate about AI and eager to make a significant impact in the field, this is the perfect opportunity for you!

Qualifications

  • 1+ year of research experience in machine learning.
  • Strong Python and machine learning background required.

Responsibilities

  • Create datasets for advanced language model capabilities.
  • Build tooling for efficient fine-tuning of models.
  • Run experiments to support AI research.

Skills

Python
Machine Learning
C/C++
Multithreading
Performance Optimization
Google Cloud / AWS
Data Structures
Algorithms

Tools

CUDA
OpenCL
Transformers Library

Job description

Nexa AI is a development platform specializing in small multimodal models and accelerated edge inference, optimized for any device. It enables building high-performance AI apps on-device without model compression or edge deployment hassles. Nexa AI supports state-of-the-art models from DeepSeek, Llama, Gemma, Qwen, and Nexa's own Octopus, OmniVLM, and OmniAudio. It offers industry-leading on-device AI expertise, enabling developers to deploy optimized, local AI in hours, not months. The platform's features include multimodal model support, model compression, and local on-device inference.

About Nexa AI

Nexa AI: Accelerating Gen-AI tasks on any device. Build high-performance AI apps on-device without the hassle of model compression or edge deployment.

Minimum Qualifications
  • Have at least 1 research project related to machine learning where you played a major role
  • Significant Python, machine learning, and research experience
  • Familiarity with C or C++
Responsibilities
  1. Create datasets for potential, powerful capabilities of language models such as function calling and reflection
  2. Build tooling and infrastructure to enable efficient fine-tuning experiments on language models
  3. Help develop new methods or novel fine-tuning techniques to improve language model behaviors
  4. Run experiments that feed into key AI research
Additional Skills and Responsibilities
  • Knowledge of OS internals, compilers, low-power/mobile optimization
  • Experience with low-level code in C and frameworks like CUDA, OpenCL
  • Proficiency in multithreading and performance optimization
  • Excellent CS fundamentals (data structures, algorithms, coding)
  • Develop SDKs for Android and iOS
  • Diagnose and fix bugs and performance issues
  • Specialize in Google Cloud / AWS tech stacks
  • Familiarity with LLM technologies, particularly Transformers library
  • Experience with model compression and edge device deployment is a plus
  • Contribute to SDK development across Android, iOS, and Linux platforms

No ratings available yet. Be the first to rate this tool!

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Research Scientist (AI Agent Economies)

ZipRecruiter

San Francisco

Remote

USD 120 000 - 180 000

2 days ago
Be an early applicant

AI Research Scientist: AEC. Remote US or Canada

Lensa

Washington

Remote

USD 120 000 - 250 000

3 days ago
Be an early applicant

Foundational AI Research Scientist Intel contract employee

Intel

Remote

USD 52 000 - 200 000

3 days ago
Be an early applicant

AI Research Scientist

Autodesk

San Francisco

On-site

USD 120 000 - 180 000

Yesterday
Be an early applicant

RESEARCH SCIENTIST, AI (MODELING FOCUS) NEW YORK, NY / / SAN FRANCISCO, CA, UNITED STATES

EvolutionaryScale

San Francisco

Hybrid

USD 90 000 - 150 000

Yesterday
Be an early applicant

AI Research Scientist

Hum

San Francisco

On-site

USD 120 000 - 180 000

Yesterday
Be an early applicant

Research Scientist (AI) - Cell & Tissue Modeling

GenBio AI

Palo Alto

On-site

USD 90 000 - 150 000

2 days ago
Be an early applicant

ARTIFICIAL INTELLIGENCE RESEARCH SCIENTIST (GEN AI - MULTIMODAL LEARNING)

Eluvio

Berkeley

On-site

USD 90 000 - 150 000

5 days ago
Be an early applicant

AI Scientist II(remote)

Claritev

Naperville

Remote

USD 120 000 - 130 000

6 days ago
Be an early applicant