Enable job alerts via email!

Senior Data Scientist - Information Retrieval & Generative AI

Techfabric Digital Solutions India

United States

Remote

USD 120,000 - 160,000

Full time

Today
Be an early applicant

Job summary

A leading technology firm is seeking a Senior Data Scientist in the United States to drive innovative solutions in information retrieval and generative AI. You will lead ML product development, work with state-of-the-art RAG architectures, and impact millions of users worldwide. Ideal candidates have advanced Python skills, strong ML/DL experience, and a Master's degree in a related field.

Benefits

Cutting-edge Technology
Global Impact
Career Growth
Innovation Freedom

Qualifications

  • 5-7+ years of data science experience with 2+ years in senior roles.
  • Demonstrated experience with production ML systems serving millions of users.

Responsibilities

  • Drive end-to-end ML product development from research to production deployment.
  • Collaborate with engineering and product teams to translate business requirements into scalable data solutions.
  • Lead breakthrough research in information retrieval and generative AI.

Skills

Advanced Python programming with pandas, scikit-learn, TensorFlow/PyTorch
SQL & Database Management
Machine Learning & Deep Learning
Statistics & Probability

Education

Masters degree in Computer Science, Statistics, Mathematics

Tools

AWS
Azure
GCP
Docker
Kubernetes
Job description
Senior Data Scientist - Information Retrieval & Generative AI

Location: Remote (India) | Type: Full-time | Experience: 5-7+ years

Transform How the World Discovers Information. Join our AI revolution in transforming how billions discover and interact with information. As a Senior Data Scientist, you’ll architect next-generation search and retrieval systems that power intelligent experiences across our platform, directly impacting millions of users worldwide.

The Challenge

Design and deploy state-of-the-art RAG architectures processing petabyte-scale datasets. Build hybrid dense/sparse retrieval pipelines that serve millions of daily queries with sub-second latency. Your models will directly impact product strategy and drive measurable business outcomes for our global user base.

What You'll Do
  • Strategic Leadership
    • Drive end-to-end ML product development from research to production deployment.
    • Collaborate with engineering and product teams to translate business requirements into scalable data solutions.
    • Mentor junior data scientists and establish best practices for model development.
    • Lead breakthrough research in information retrieval and generative AI.
  • Technical Execution
    • Design and optimize transformer-based architectures for information retrieval and generation.
    • Implement advanced chunking strategies for semantic search and RAG applications.
    • Build and maintain real-time ML pipelines processing millions of documents.
    • Develop production-ready models with proper monitoring, versioning, and deployment strategies.
  • Innovation & Research
    • Research and prototype cutting-edge AI techniques in search, retrieval, and natural language processing.
    • Design large-scale experiments and A/B tests to validate model performance and business impact.
    • Stay current with latest developments in GenAI and contribute to open-source communities.
What We're Looking For
Essential Skills
  • Advanced Python programming with expertise in pandas, scikit-learn, TensorFlow/PyTorch.
  • SQL & Database Management for complex query optimization and data pipeline design.
  • Machine Learning & Deep Learning with track record of shipping ML products to production.
  • Statistics & Probability including advanced statistical modeling and hypothesis testing.
  • 5-7+ years of data science experience with 2+ years in senior roles.
Specialized Expertise
  • Information Retrieval Systems - Search algorithms, ranking, and relevance optimization.
  • Generative AI & LLMs - Prompt engineering, fine-tuning, and deployment at scale.
  • Content Chunking Strategies - Document processing and semantic segmentation for RAG systems.
  • Vector Databases - Hands-on experience with Pinecone, Weaviate, FAISS, or OpenSearch.
  • Transformer Models - Deep understanding of BERT, GPT, T5 architectures.
Advanced Technical Skills
  • RAG (Retrieval-Augmented Generation) implementation and optimization.
  • Named Entity Recognition (NER) at enterprise scale.
  • Cloud platforms (AWS, Azure, GCP) for ML deployment.
  • MLOps tools and practices (Docker, Kubernetes, model registries).
  • A/B testing and experimental design methodology.
Education & Experience
  • Masters degree in Computer Science, Statistics, Mathematics, or related quantitative field.
  • Demonstrated experience with production ML systems serving millions of users.
  • Strong publication record or open-source contributions (preferred).
Why Join Us
  • Cutting-edge Technology: Work with the latest in AI/ML, from transformer architectures to vector databases.
  • Global Impact: Your work will be used by millions of users across different continents.
  • Career Growth: Clear advancement paths with mentorship and leadership opportunities.
  • Innovation Freedom: 20% time for personal research projects and experimentation.
Our Interview Process
  1. Recruiter Screen (30 min) - Background, motivation, and culture fit.
  2. Technical Screen (60 min) - Live coding in Python/SQL, ML fundamentals.
  3. Technical Deep Dive (90 min) - Advanced ML/AI questions, system design.
Ready to Shape the Future of AI?

If you're passionate about pushing the boundaries of information retrieval and generative AI, we want to hear from you. Join a team where your expertise will drive innovation and create meaningful impact on a global scale. Apply now and help us build the next generation of intelligent search and discovery systems.

Send in your resume to Mr.Praveen at Praveen.kunta@techfabric.com

We are an equal opportunity employer committed to diversity and inclusion. We welcome applications from all qualified candidates regardless of race, gender, age, religion, sexual orientation, or disability status.

Application Requirements
  • Resume/CV highlighting relevant ML/AI experience.
  • Cover letter explaining your interest in information retrieval and generative AI.
  • Links to relevant projects, publications, or GitHub repositories (preferred).
  • Portfolio demonstrating production ML systems or research contributions.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.