About Markovate:At Markovate, you are part of a team that doesn't just follow trends but drives them. We transform businesses by leveraging innovative AI and digital solutions to turn vision into reality. Our team utilizes breakthrough technologies to develop bespoke strategies that align perfectly with our clients" aspirations. From AI consulting and Gen AI development to pioneering AI agents and agentic AI, we empower our partners to lead their industries with forward-thinking precision and unparalleled expertise.Job Description:As an ML Data Engineer at Markovate, your primary responsibility is to design, build, and optimize ML-ready feature pipelines and KPI engineering frameworks. This role demands expertise in smart segmentation logic, narrative generation orchestration, and forecasting signal preparation to facilitate advanced analytics and executive-level insights. The ideal candidate will possess a strong foundation in data engineering, ML pipeline orchestration, and LLM-driven applications.Key Responsibilities:- Design and implement feature engineering pipelines for key performance indicators (KPIs) like volume, margin, fill rate, and churn scores.- Develop smart segmentation logic utilizing SageMaker/KMeans or PySpark-based clustering methods.- Create forecasting signal preparation pipelines for tasks such as YoY drops and volume spikes.- Integrate narrative generation workflows with Bedrock/OpenAI.- Implement RAG embedding ingestion logic to support the Executive Copilot.- Establish and manage ETL and feature stores spanning staging, feature, and model layers.- Collaborate closely with data scientists, ML engineers, and analysts to provide ML-ready datasets and explainable outputs.- Contribute to the Smart Segmentation Advisor and Narrative Forecast Explainer components.- Pre-process and summarize LLM outputs for Copilot integration.Required Technical Skills:- Proficiency in PySpark, AWS Glue, and Step Functions.- Bachelor's degree in Computer Science, Information Technology, or a related field.- 4-6 years of experience in data engineering, ML feature pipeline development, and cloud-based orchestration.- Hands-on experience with SageMaker Feature Store or other vector storage solutions.- Knowledge of AWS services such as S3, Lambda, and data partitioning strategies.- Familiarity with LLM integration with Bedrock/OpenAI.- Basic understanding of prompt engineering and LangChain RAG pipelines.- Experience with narrative AI systems and ML-based explainability.- Strong grasp of feature engineering for KPIs in business contexts like churn, margin, and volume forecasting.- Exposure to LangChain and lightweight RAG pipelines.Note: The above job description is designed to encompass the main responsibilities and required technical skills for the ML Data Engineer position at Markovate.,
Sign-in & see how your skills match this job
Sign-in & Get noticed by top recruiters and get hired fast