We have an urgent requirement for a Junior Data Scientist (Python and Azure Databricks) for our client based in Qatar.
Key Responsibilities
- Develop Python-based backend services for document parsing, information extraction, and data enrichment (CVs, JDs, etc.).
- Design and optimize automated data pipelines on Azure Databricks for real-time and batch processing.
- Create and maintain RESTful APIs using FastAPI for internal and external data consumption.
- Implement prompt engineering techniques with OpenAI APIs to extract structured insights from unstructured text.
- Model and query graph data using Neo4j for relationship analysis and graph-based reasoning.
- Design and manage Azure SQL Databases for storing and querying structured datasets.
- Collaborate with cross-functional teams to deliver reliable and scalable data solutions.
- Write clean, testable, and maintainable code with proper documentation.
- Lead the rapid development of Proof-of-Concepts (POCs) to test, validate, and showcase innovative backend and AI-enabled solutions.
Required Skills
- Proficiency in Python for scripting, backend development, and data processing.
- Experience building APIs using FastAPI or Flask.
- Good knowledge of Azure Databricks and Azure SQL database.
- Familiarity with Graph Databases (preferably Neo4j) and Cypher queries.
- Experience with OpenAI APIs / LLM-based applications and prompt engineering.
- Good understanding of ETL workflows, version control (Git), and software development best practices.
- Good understanding of vector databases like Milvus or FAISS.
This job posting appears to be active and not expired.