Notice Period: (Immediate Joiner - Only)
Key Responsibilities:
- Fine-tune and optimize Large Language Models (LLMs) for mid-to-large-scale live production-quality applications.
- Host and deploy LLMs on custom infrastructure, ensuring high availability and performance.
- Conduct LLM evaluation following best practices as outlined in the Hugging Face LLM Evaluation Guide.
- Collaborate with cross-functional teams to design, develop, and implement AI-driven solutions tailored to business needs.
- Ensure model scalability, security, and compliance with industry standards.
Required Skills and Experience :- Experience: 4 years of hands-on experience with Generative AI and LLMs. (Total IT experience is not a priority.)
- Domain Expertise: Prior experience in the fintech or financial services domain is essential.
- LLM Fine-Tuning: Demonstrated expertise in fine-tuning LLMs for live production environments (academic or PoC projects are not relevant).
- Infrastructure Management: Experience with hosting and deploying LLMs on custom infrastructure.
- LLM Evaluation: Proficiency in conducting LLM evaluations using industry-recognized methodologies and frameworks.
Technical Skills :- Proficiency in Python and relevant AI/ML libraries (e.g., PyTorch, TensorFlow, Hugging Face).
- Strong understanding of LLM architectures and their optimization techniques.
- Familiarity with cloud-based or on-premise infrastructure for AI deployments.