This role is based remotely but if you live within a 50-mile radius of an office [Atlanta, Austin, Detroit, Warren, Milford or Mountain View], you are expected to report to that location three times a week, at minimum.
The Role
The Marketing Applied Sciences team is responsible for developing analytics-driven solutions to help GM Marketing Organizations achieve their business goals. The Staff Data Engineer role is responsible for designing, developing and maintaining data pipelines, databases and data infrastructure to enable the efficient data collection, storage and analysis in support of marketing strategies. This role requires collaboration with data scientists, data infrastructure architects and data consumption stakeholders to ensure the availability of high-quality data for insights and decision making is available and will deliver value to the GM strategic vision for the future and meet prioritized business needs. This role drives to deliver innovative solutions with technical fidelity and thrives to work effectively across a cross-functional data and stakeholder ecosystem.
What You’ll Do
- Design, construct, install and maintain data architectures, including database and large-scale processing systems.
- Develop and maintain ETL (Extract, Transform, Load) processes to collect, cleanse and transform data from various sources inclusive of cloud.
- Design and implement data pipelines to collect, process and transfer data from various sources to storage systems (data warehouses, data lakes, etc)
- Implement security measures to protect sensitive data and ensure compliance with data privacy regulations.
- Build data solutions that ensure data quality, integrity and security through data validation, monitoring, and compliance with data governance policies.
- Administer and optimize databases for performance and scalability.
- Maintain Master Data, Metadata, Data Management Repositories, Logical Data Models, and Data Standards.
- Troubleshoot and resolve data-related issues affecting data quality fidelity.
- Document data architectures, processes and best practices for knowledge sharing across the GM data engineering community.
- Participate in the evaluation and selection of data related tools and technologies.
- Collaborate across other engineering functions within EDAI, Marketing Technology, and Software & Services.
Your Skills & Abilities (Required Qualifications)
- 7+ years of hands-on experience.
- Bachelor's degree (or equivalent work experience) in Computer Science, Data Science, Software Engineering, or a related field.
- Strong understanding and ability to provide mentorship in the areas of data ETL processes and tools for designing and managing data pipelines.
- Proficient with big data frameworks and tools like Apache Hadoop, Apache Spark, or Apache Kafka for processing and analyzing large datasets.
- Hands on experience with data serialization formats like JSON, Parquet and XML.
- Consistently models and leads in best practices and optimization for scripting skills in languages like Python, Java, Scala, etc for automation and data processing.
- Proficient with database administration and performance tuning for databases like MySQL, PostgresSQL or NoSQL databases.
- Proficient with containerization (e.g., Docker) and orchestration platforms (e.g., Kubernetes) for managing data applications.
- Experience with cloud platforms and data services for data storage and processing.
- Consistently designs solutions and builds data solutions that are highly automated, performant, with quality checks that provide data consistency and accuracy outcomes.
- Experienced at actively managing large-scale data engineering projects, including planning, resource allocation, risk management, and ensuring successful project delivery.
- Understands data governance principles, data privacy regulations, and experience implementing security measures to protect data.
- Able to integrate data engineering pipelines with machine learning models and platforms.
- Strong problem-solving skills to identify and resolve complex data engineering issues efficiently.
- Ability to work effectively in cross-functional teams, collaborate with data scientists, analysts, and stakeholders to deliver data solutions.
- Ability to lead and mentor junior data engineers, providing guidance and support in complex data engineering projects.
- Influential communication skills to effectively convey technical concepts to non-technical stakeholders and document data engineering processes.
- Models a mindset of continuous learning, staying updated with the latest advancements in data engineering technologies, and a drive for innovation.
PREFERRED QUALIFICATIONS:
- Master’s Degree or Ph.D. in Computer Science, Engineering, Marketing Science, Econometrics, Statistics, Psychometrics, (Bio)Statistics, Applied Mathematics, Operations Research, or another quantitative field.