Enable job alerts via email!

Student Researcher (Doubao (Seed) - Foundation Model - Video Generation) - 2025 Start (PhD)

ByteDance

San Jose (CA)

On-site

Full time

30+ days ago

Job summary

ByteDance is seeking a Student Researcher for its Doubao vision team, focusing on multi-modal foundation models in AI for a 2025 start. This position provides hands-on experience in generative AI within a collaborative and innovative environment, suitable for PhD candidates pursuing research in cutting-edge technologies.

Benefits

Health insurance
Paid holidays
Sick leave

Qualifications

  • PhD candidate with research experience in multimodal understanding and vision.
  • Publications in top-tier venues required.
  • Strong communication skills preferred.

Responsibilities

  • Conduct cutting-edge research in multimodal machine learning.
  • Develop foundation models to enhance ByteDance products.
  • Explore downstream AI products utilizing generated content.

Skills

Strong coding skills in Python
Research experience in multi-modal understanding
Experience in deep learning frameworks

Education

Currently pursuing a PhD in Software Development, Computer Science, or related fields

Job description

Student Researcher (Doubao (Seed) - Foundation Model - Video Generation) - 2025 Start (PhD)

San Jose Intern R&D PhD Intern- 2025 Start

Job ID: A111904A

Responsibilities

Team Introduction: Welcome to the Doubao Vision team, where we spearhead multi-modality foundation models on visual understanding and visual generation. Our mission is to solve the visual intelligence problem for AI. We conduct cutting-edge research on areas like vision and language, large vision models, and generative foundation models. The team is a mix of experienced research scientists and engineers, aiming to advance the research boundaries in foundation models and apply our technologies to our rich application scenarios, with a feedback loop to improve our foundation technologies.

Join us in shaping the future of AI technologies and revolutionizing our product experience for global users. We are looking for talented individuals to join us for a Student Researcher opportunity in 2025. These opportunities at ByteDance aim to offer students industry exposure and hands-on experience, with flexibility in duration, time commitment, and location. Candidates can apply to a maximum of two positions, reviewed on a rolling basis.

Responsibilities
  • Conduct cutting-edge research and development in foundation model and multimodal machine learning, especially in generative AI (e.g., image, video generation).
  • Research and develop foundation models to enhance ByteDance products.
  • Explore new downstream products utilizing AI technology.
Qualifications
Minimum Qualifications
  • Currently pursuing a PhD in Software Development, Computer Science, Computer Engineering, or related fields.
  • Research experience in multi-modal understanding, vision and language, such as video captioning, VQA, Text-to-video retrieval, audio/music understanding and generation.
  • Publications in top-tier venues like CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, EMNLP, ACL, COLING.
  • Strong coding skills in Python and deep learning frameworks.
  • Must have work authorization during employment.
Preferred Qualifications
  • Ability to collaborate well and work independently.
  • Strong communication skills.
Additional Information

Founded in 2023, ByteDance Doubao (Seed) leads in AI foundation models, spanning deep learning, reinforcement learning, and more, with global research labs.

Join ByteDance to be part of a creative, innovative environment that values diversity, inclusion, and impact. We offer comprehensive benefits, including health insurance, paid holidays, and sick leave. The hourly rate for this position is $60.

For Los Angeles County candidates, we consider criminal records per applicable laws. We are committed to providing accommodations for candidates with disabilities or religious beliefs during the recruitment process.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.