Enable job alerts via email!

AI Research Scientist

AI Tech Suite

Cupertino (CA)

On-site

USD 80,000 - 150,000

Full time

Yesterday

Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative development platform is seeking a talented individual to join their team and contribute to the creation of high-performance AI applications. This role focuses on leveraging cutting-edge technologies and methodologies to enhance language models and optimize performance across devices. You will engage in exciting tasks such as building datasets, developing fine-tuning techniques, and collaborating on SDK development for various platforms. If you're passionate about AI and eager to make a significant impact in the field, this is the perfect opportunity for you!

Qualifications

1+ year of research experience in machine learning.
Strong Python and machine learning background required.

Responsibilities

Create datasets for advanced language model capabilities.
Build tooling for efficient fine-tuning of models.
Run experiments to support AI research.

Skills

Python

Machine Learning

C/C++

Multithreading

Performance Optimization

Google Cloud / AWS

Data Structures

Algorithms

Tools

CUDA

OpenCL

Transformers Library

Nexa AI is a development platform specializing in small multimodal models and accelerated edge inference, optimized for any device. It enables building high-performance AI apps on-device without model compression or edge deployment hassles. Nexa AI supports state-of-the-art models from DeepSeek, Llama, Gemma, Qwen, and Nexa's own Octopus, OmniVLM, and OmniAudio. It offers industry-leading on-device AI expertise, enabling developers to deploy optimized, local AI in hours, not months. The platform's features include multimodal model support, model compression, and local on-device inference.

About Nexa AI

Nexa AI: Accelerating Gen-AI tasks on any device. Build high-performance AI apps on-device without the hassle of model compression or edge deployment.

Minimum Qualifications

Have at least 1 research project related to machine learning where you played a major role
Significant Python, machine learning, and research experience
Familiarity with C or C++

Responsibilities

Create datasets for potential, powerful capabilities of language models such as function calling and reflection
Build tooling and infrastructure to enable efficient fine-tuning experiments on language models
Help develop new methods or novel fine-tuning techniques to improve language model behaviors
Run experiments that feed into key AI research

Additional Skills and Responsibilities

Knowledge of OS internals, compilers, low-power/mobile optimization
Experience with low-level code in C and frameworks like CUDA, OpenCL
Proficiency in multithreading and performance optimization
Excellent CS fundamentals (data structures, algorithms, coding)
Develop SDKs for Android and iOS
Diagnose and fix bugs and performance issues
Specialize in Google Cloud / AWS tech stacks
Familiarity with LLM technologies, particularly Transformers library
Experience with model compression and edge device deployment is a plus
Contribute to SDK development across Android, iOS, and Linux platforms

No ratings available yet. Be the first to rate this tool!

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Research Scientist (AI Agent Economies)

ZipRecruiter

San Francisco

Remote

USD 120 000 - 180 000

2 days ago

Be an early applicant