About the Role
Tkxel is seeking a Lead/Senior Machine Learning Engineer with advanced hands‑on expertise across the machine learning spectrum—including classical ML, Time Series Analysis, Deep Learning, Computer Vision, NLP, and Agentic AI systems. In this strategic leadership role, you'll drive end‑to‑end development and deployment of innovative LLM and ML solutions, manage high‑performing teams, and collaborate deeply with research and product leaders to transform state‑of‑the‑art AI into business value.
Key Responsibilities
- Lead, mentor, and inspire a cross‑functional team of ML engineers, data scientists, and MLOps experts.
- Architect and oversee the lifecycle of advanced ML/LLM projects—from data acquisition, pre‑processing, classical modeling, and deep learning to robust production deployment.
- Drive initiatives in Agentic AI, Time Series Forecasting, Computer Vision, and NLP, championing best practices for each domain.
- Collaborate closely across Research, Product, and Infrastructure teams to define objectives, timelines, and measurable success metrics.
- Set technical direction in large‑scale model training, distributed systems, model optimization, and ML infrastructure design.
- Ensure adoption of MLOps best practices including experiment tracking, governance, CI/CD, automated testing, and model monitoring.
- Manage and optimize compute resources (GPU/TPU), budgets, and enforce responsible AI standards with a focus on security and compliance.
- Communicate technical visions, project milestones, risks, and insights to both technical and non‑technical stakeholders with clarity and impact.
Required Skills & Qualifications
- 5+ years hands‑on experience in Machine Learning (including classical ML, time series, deep learning, NLP, and Computer Vision).
- Advanced knowledge of LLMs, Transformers, and modern deep learning architectures.
- Proven leadership managing and scaling teams delivering high‑impact ML/LLM solutions.
- Extensive experience with frameworks: Scikit‑learn, PyTorch, TensorFlow, Hugging Face, DeepSpeed.
- Deep understanding of MLOps tools (MLflow, Kubeflow, Vertex AI) and cloud platforms (AWS, GCP, Azure).
- Demonstrated expertise in distributed training, G