JOB DESCRIPTION SUMMARY
The Machine Learning Engineer role will specialise in maintaining scoring models, production system maintenance and high-availability operations. You\'ll orchestrate and maintain our AzureML/Databricks-based scoring engine, ensure 99.99% uptime for production models, perform emergency fixes, and manage QA/UAT processes. This role partners with data scientists to operationalize models and data engineers to ensure efficient data flows.
KEY DUTIES & RESPONSIBILITIES
- Production Model Maintenance: Monitor, troubleshoot, and rectify issues in deployed credit scoring models (e.g., score drift, feature misalignment, output anomalies).
- Platform Orchestration: Manage AzureML pipelines & Databricks workflows for model retraining, batch scoring, and real-time inference.
- High-Availability Engineering: Ensure 24/7 uptime of scoring APIs serving banking clients; implement failover systems and load balancing.
- Release Management: Oversee QA/UAT processes for model updates including back-testing, shadow deployments, and canary releases.
- Model Governance: Maintain audit trails for model versions, inputs/outputs, and performance metrics. Support Compliance and Audit with creation of logs when requested.
- Incident Response: Lead troubleshooting of scoring engine failures with SLAs for financial institution clients.
- Infrastructure Optimization: Tune AzureML/Databricks clusters for cost-performance efficiency at scale.
- Vendor Management: Ensure vendor support is completing work as per scope and SLAs. Rectifying any vendor delivery issues.
EDUCATION & SKILLS
- Educated with at least bachelor\'s degree or equivalent in related field
- Education specialization or master\'s degree in computer science, Software Engineering
- Proficient in English
- Preferred proficiency in Arabic
- In-depth knowledge of React Native and Next JS and related modules, components and libraries
- Preferred In-depth knowledge of SiteCore or experience integrating with SiteCore
EXPERIENCE & KNOWLEDGE
- Bachelor\'s/Master\'s in Computer Science, Engineering, Data Science, or related field
- 2+ years in Data Science or Software Engineering experience
- 2+ years production ML operations experience, MLOps lifecycle management, including monitoring, retraining, and model versioning.
- Strong problem-solving skills and the ability to resolve issues efficiently.
- CI/CD for ML systems using tools such as Azure DevOps, GitHub Actions, MLflow, and other similar tools
- Ability to adapt ML workflows across different cloud environments (Azure, AWS, GCP) as needed.
ABILITIES & SPECIFIC REQUIREMENTS
- Practical experience in cloud-based ML platforms such as AzureML, Databricks, or equivalent (e.g., SageMaker, Vertex AI), with a preference for Azure.
- Python/PySpark for model debugging and patching. Working knowledge of Scikit-Learn and NumPy
- Deep understanding of credit scoring systems: feature engineering, scorecard interpretation, and output validation
- Credit bureau data structures (tradelines, inquiries, public records)
- Model risk management (MRM) standards
- Azure Solutions Architect or MLOps certifications
- Experience with financial services-grade SLAs (99.9% uptime) and outage management
- Knowledge of containerization (Docker, Kubernetes, AKS, or similar orchestration tools)
- Practical experience with Azure Data Factory (ADF) and Azure Data Lake Storage (ADLS) / Azure Blob Storage
- Knowledge of new and upcoming AI tools
- This is contained within the earlier requirement of python. If the plan is for this resource
- Excellent written and verbal communication skills English
- Strong interpersonal skills with the ability to engage and build relationships
- Crisis management under pressure
- Cross-functional collaboration with data science/risk teams
- Strong organizational skills with the ability to manage multiple tasks and projects simultaneously.
- Ability to develop and document procedures, roles, and guidelines.
- Security and Compliance Especially in financial services, mention data security, PII handling, and compliance with regulations