Career Area: Technology, Digital and Data
Job Summary
We are seeking a highly motivated and detail-oriented Junior Data Engineer (Data Specialist) to join our dynamic team. The ideal candidate will have a good foundation in data engineering principles, a keen analytical mindset, and the ability to work collaboratively in a fast-paced environment. As a Junior Data Engineer, you will play a critical role in designing, developing, and maintaining data pipelines and infrastructure to support our organization's data needs, joining our Financial Systems Tech Support – CAT IT Division.
The preference for this role is to be based out of Whitefield, PSN Bangalore office.
What you will do
- Data Pipeline Development: Design, develop, and maintain scalable and reliable data pipelines to ingest, process, and store large volumes of data from various sources.
- Data Integration and Management: Collaborate with data architects, product owners, business analysts, and other stakeholders to understand data requirements and deliver data solutions that meet their needs; integrate data from multiple sources, including databases, APIs, and external data providers, into a unified data platform; manage and optimize data storage solutions, ensuring efficient data retrieval and processing.
- Data Quality and: Implement data quality checks and validation processes to ensure the accuracy and consistency of data; adhere to data governance policies and best practices to maintain data security and compliance.
- Performance Optimization: Monitor and optimize data pipeline performance to ensure timely and efficient data processing; identify and resolve performance bottlenecks and system issues to maintain data pipeline reliability.
- Documentation and Collaboration: Document data engineering processes, workflows, and solutions for future reference and knowledge sharing; collaborate with cross-functional teams to support data-driven decision-making and project initiatives.
Qualifications
- Bachelor’s degree in computer science, Information Technology, Data Science, or a related field.
- 3-4 years of experience in data engineering, data analysis, or software development.
Technical Skills
- Strong experience on advanced SQL databases (Oracle, Teradata, Snowflake).
- Experience in data integration and hands‑on experience in any of the ETL tools such as Datastage, Informatica, SnapLogic etc.
- Able to transform technical requirements into data collection queries.
- Should be capable of working with business and other IT teams and convert the requirements into queries.
- Good understanding of ETL architecture and design.
- Good knowledge in Databases, SQL, or PL/SQL.
- Good experience with CI/CD tools such as GitHub Actions (AzDO pipelines).
- Good to have knowledge on Unix commands.
- Good to have experience on AWS Glue.
- Good to have knowledge on Qlik replication tool.
- This position requires the candidate to work a 5‑day‑a‑week schedule in the office.
- Shift Timing – 01:00‑10:00 PM IST
Soft Skills
- Good analytical and problem‑solving skills with attention to detail.
- Excellent communication and interpersonal skills to collaborate effectively with cross‑functional teams.
- Ability to manage multiple tasks and prioritize workload in a fast‑paced environment.
- Eagerness to learn new technologies and stay updated with industry trends.
Skills desired
Value Realization
- Knowledge of value realization methods; ability to plan, execute, monitor and manage business activities and resources to determine and achieve the actual value from a business initiative as estimated in an associated business case.
- Describes the importance of Value Realization milestones for long‑term projects.
- Assists in measuring and attaining post‑project implementation value.
- Identifies process changes or business value integration in change initiative plans and activities.
- Contributes to project team communication regarding the linkage of a change initiative to its associated business goals and case.
Communicating Complex Concepts
- Knowledge of effective presentation tools and techniques to ensure clear understanding; ability to use summarization and simplification techniques to explain complex technical concepts in simple, clear language appropriate to the audience.
- Uses comparisons to familiar ideas when introducing a technical or novel feature.
- Separates fundamental concepts from supporting details in explaining a product or service.
- Explains products and services with little or no use of technical jargon or advanced vocabulary.
- Emphasises the most important facts or features of a product or service.
- Perceives lack of audience comprehension; further simplifies explanation when needed.
Agile Development
- Knowledge of agile methodologies and the agile development lifecycle; ability to utilise formal agile methodologies, disciplines, practices, and techniques for the delivery of new and enhanced applications.
- Explains a specific agile process and its associated checkpoints and deliverables.
- Works with a specific agile development methodology.
- Interprets metrics used to measure progress and effectiveness.
- Defines key selection and qualifying criteria for projects suitable for agile development.
- Applies major tools and techniques associated with the specific methodology.
Cloud Computing
- Knowledge of the concepts, technologies and services of cloud computing; ability to design, deploy and implement cloud computing solutions in various business environment.
- Works with a specific cloud service model, such as Infrastructure as a Service (IaaS) or Software as a Service (SaaS).
- Follows established procedures in defining client requirements for cloud services.
- Implements a specific cloud service model, such as IaaS, SaaS or Serverless computing.
- Carries out tasks, under supervision, to increase capacity or add capabilities through cloud computing.
- Documents and resolves basic problems related to cloud security.
Database Design (Physical)
- Knowledge of database systems; ability to establish a data model for designing an organization’s database that runs effectively and efficiently for better business outcome.
- Researches relationships between different business data sets.
- Participates in the establishment of data structures based on their relationships.
- Adheres to an organization’s database design policies.
- Utilises database design tools and techniques.
- Tests the effectiveness of the database before its release for business use.
ETL Process
- Knowledge of the extraction, transformation and loading (ETL) process; ability to develop a database through the ETL process.
- Ensures the accuracy and effectiveness of data provided for the warehouse.
- Loads selected data into the warehouse on a regular basis.
- Derives necessary business or technical information from original data.
- Acquires a large amount of data from different sources/systems.
- Solves technical and administrative problems during the ETL process.
Information Management
- Knowledge of an organization’s existing and planned Information Architecture and Information Management (IM) methodology; ability to collect and manage information from different sources, and distribute this information to enhance operational efficiency.
- Assists in the installation, configuration and support of information management software.
- Participates in developing back‑up, recovery and archival practices and procedures.
- Analyzes issues and requirements for managing information resources.
- Follows organizational guidelines and policies on information management.
- Makes recommendations for enhancements to centralized and distributed databases.
Modeling
- Data, Process, Events, Objects: Knowledge of data, process and events; ability to use tools and techniques for analysing and documenting logical relationships among data, processes or events.
- Defines common types of data, processes or event models.
- Participates in model reviews and walkthroughs.
- Explains existing models to clients and describes their associated business processes.
- Focuses on modifying Data Structure, Access and Entity Relationship Diagrams.
- Prepares reports to recap current findings and issues in data, processes or event models.
What you will get (Benefits)
- Work Life Harmony
- Earned and medical leave.
- Relocation assistance
Holistic Development
- Personal and professional development through Caterpillar’s employee resource groups across the globe.
- Career developments opportunities with global prospects.
Health and Wellness
- Medical coverage – Medical, life and personal accident coverage.
- Employee mental wellness assistance program.
Financial Wellness
- Employee investment plan.
- Pay for performance – Annual incentive Bonus plan.
Posting Dates
December 2, 2025 – December 15, 2025
Caterpillar is an Equal Opportunity Employer. Qualified applicants of any age are encouraged to apply.
Not ready to apply? Join our Talent Community.