Data Cycling Center (DCC) is a Data Science team that develops AI-driven content (unstructured data) understanding capabilities, identifies business opportunities from the understanding, and builds products and solutions to capture those opportunities. Our mission is to simplify the acquisition and utilization of unstructured/unlabeled data. The team act as the data modeling factory, using and analyzing mass data and finding useful insights for business growth.
About the Role: We are looking for experienced data scientists to join our team and apply advanced analytics and machine learning techniques-including Prompt Engineering (PE), multi-modal large language models (LLMs), computer vision (CV), natural language processing (NLP), and audio signal processing-to optimize intelligent labeling workflows and data products within TikTok's ecosystem. Your work will help improve user experience, enhance content integrity, and support data-driven strategic decision-making. You will collaborate closely with cross-functional teams across product, operations, and algorithms to build scalable, end-to-end Prompt Engineering and LLM workflows for intelligent content moderation and labeling applications.
Key Responsibilities:
Minimum Qualifications: 1) Advanced degree (Master's or Ph. D.) in Statistics, Computer Science, Applied Mathematics, Data Science, or a related quantitative field. 2) Strong theoretical foundation in computer science, machine learning, and statistics, with industry experience in deep learning and at least one of the following: Prompt Engineering, LLMs, CV, NLP, or speech recognition.
3) In-depth experience in unsupervised learning, clustering algorithms, and pattern recognition from unstructured data such as text or video. 4) Strong experience with unsupervised learning, clustering algorithms, and extracting data insights from unstructured video format data, recognizing patterns, and developing models 5) Experience in data project management, and solid foundations of maths and algorithms
6) Expertise in SQL, Hive, Presto, or Spark, and experience with large-scale datasets; along with strong proficiency in Python and Deep Learning frameworks such as TensorFlow or PyTorch 7) Excellent communication and collaboration skills, with the ability to work effectively across global teams and stakeholders. Preferred Qualifications:
Thought, Retrieval-Augmented Generation, Supervised Fine-Tuning) to real-world problems.
* The salary benchmark is based on the target salaries of market leaders in their relevant sectors. It is intended to serve as a guide to help Premium Members assess open positions and to help in salary negotiations. The salary benchmark is not provided directly by the company, which could be significantly higher or lower.