Overview
ACTIVELY HIRING
Find Your perfect Job
Sign-in & Get noticed by top recruiters and get hired fast
Responsibilities
- Work with data engineering and software engineering concepts to design and implement data pipelines and systems using distributed data technologies.
- Collaborate on building scalable data processing solutions using Spark, AirFlow, Kafka, NiFi, SQL, and cloud-based architectures.
- Develop and maintain data models, ETL/ELT processes, and data warehousing solutions; utilize tools such as BigQuery, Snowflake, Hive, Impala, Looker, Tableau, and Power BI for data visualization and analytics.
- Implement and operate with REST APIs, Jupyter Notebook workflows, and version control (Git, Jenkins, TeamCity) within CI/CD pipelines.
- Leverage GenAI-related tooling and concepts (Vector embeddings, RAG, LLMs, LangChain, LlamaIndex, OpenAI, Claude, Mistral) to design AI-enabled data applications and agent orchestration libraries.
- Engage in agile practices (Scrum/Agile) and adhere to professional software engineering standards, security, and risk management principles.
Qualifications / Skills
- Programming languages and frameworks: Python, Java, Scala, C; PySpark; Spark Core, Spark SQL, Spark Streaming; JDBC.
- Big data and data warehousing: Hadoop ecosystem, HDFS, YARN, Hive, Spark, Spark SQL, Spark Streaming, data modeling, OLAP concepts.
- Databases and storage: RDBMS (Oracle, MSSQL), NoSQL, SQL, data management best practices.
- Cloud and distributed systems: Google Cloud Platform, AWS, Cloud Composer, OpenShift, data tooling (dbt, ETLELT frameworks).
- Data visualization and BI: Tableau, Power BI, Looker.
- Tools and platforms: Git, Jenkins, Artifactory, JIRA, OpenShift; CI/CD pipelines; Data governance and SDLC.
- AI/ML concepts: GenAI applications, prompt engineering, vector embeddings, retrieval-augmented generation (RAG), OpenAI, Claude, Mistral, LangChain, LlamaIndex; experience with production-grade AI systems and agent orchestration.
- Other skills: Data structures and algorithm design, architectural specifications, SDLC tools, security, risk management, analytical thinking, communication (verbal and written).
Notes
Sign-in & Get noticed by top recruiters and get hired fast