The Associate Data Engineer designs, builds, and maintains data solutions for collecting, storing, processing, and analyzing large volumes of data efficiently and accurately. You will also drive design and implementation of specific data models, promote effective use of data querying APIs, deploys machine learning models, and ensure consistent data flow through the organization.
Position Responsibilities
- Designs, builds, and maintains reliable, efficient and scalable data infrastructure for data collection, storage, transformation, and analysis.
- Implements data orchestration pipelines, data sourcing, cleansing, augmentation, and quality control processes.
- Works with business and technology collaborators to grasp current and future data infrastructure needs.
- Designs, builds and maintains scalable data solutions including data pipelines, data models, and applications for efficient and reliable data workflow; including those specifically tailored for machine learning workflows.
- Builds, implements, and upholds current and upcoming data platforms such as data warehouses, repositories for structured and unstructured data.
- Collaborates with Data Scientists and Engineers to create features and pre-process data for ML models and move data analysis models into production.
- Designs and develops analytical tools, algorithms, data landscape modernization roadmaps, and programs to support Data Engineering activities like writing scripts and automating tasks.
- Applies a variety of data interchange formats to ensure data requirements are met and continuously monitors data integrity across the organization.
- Integrates machine learning algorithms into current production systems and workflows, taking into account compatibility with other systems, data sources, and APIs.
- Builds and advocates for efficient utilization of data querying APIs to ensure seamless access to organizational data sources.
- Evaluates, integrates, and manages tools and frameworks within the data engineering ecosystem, ensuring compatibility and efficiency in model development and deployment.
- Designs and promotes data versioning and lineage tracking, including transparency and traceability for data used in ML model training and inference.
Required Qualifications
- Knowledge of database systems, data lakes, and NoSQL databases
- Knowledge of data warehouse concepts and architectures (e.g., Synapse)
- Familiarity with data quality and data modelling tools
- Proficiency in using version control systems like Git for managing codebase
- Experience with Cloud native data services such as PySpark, Scala, Azure Data Factory and Databricks
- Practical experience with big data processing frameworks and techniques such as HDFS, MapReduce, Storage formats (Avro, Parquet), Stream processing
- Experience with integrating to back-end/legacy environments
- Knowledge of AI model deployment in production environments
- Experience handing real-time data for AI Applications
- Ability to build and deploy Data Ops and ML Ops pipelines in Cloud-native environments
When you join our team
- We’ll empower you to learn and grow the career you want.
- We’ll recognize and support you in a flexible environment where well-being and inclusion are more than just words.
- As a member of our distributed team, we\'ll assist you in crafting the future you envision.
#LI-Hybrid
About Manulife and John Hancock
Manulife Financial Corporation is a leading international financial services provider, helping people make their decisions easier and lives better. To learn more about us, visit the Manulife site.
Equal Opportunity Employer
Manulife/John Hancock embraces diversity and is committed to fair recruitment, retention, advancement and compensation without discrimination on the basis of race, ancestry, place of origin, colour, ethnic origin, citizenship, religion or religious beliefs, creed, sex, sexual orientation, gender identity or expression, age, marital status, disability, or any other ground protected by applicable law. A Human Resources representative will assist applicants who request a reasonable accommodation during the application process. To request an accommodation, contact recruitment at Manulife.
Referenced Salary Location: Toronto, Ontario
Working Arrangement: Hybrid
Salary range is expected to be between $60,900 CAD - $113,100 CAD. If applying outside the primary location, please contact recruitment for the salary range in your location. Actual salary varies by market conditions, geography and factors such as knowledge, skills, and experience. Eligible employees may participate in incentive programs.
Manulife offers a wide array of benefits, including health, dental, vision, disability, life and AD&D coverage, retirement savings plans, and leaves of absence. For U.S. applicants, contact recruitment for U.S.-specific PTO information.