We are establishing a new Data Engineer role focused on implementing and evolving our data classification capabilities. This position is critical in ensuring that our data assets are properly identified, categorized, and protected across systems. The role will support our commitment to data privacy, compliance, and governance as we scale. You will be instrumental in building foundational frameworks that enable secure, compliant data use. By driving automation and consistency in classification, you'll help reduce risk and improve data visibility. This is a strategic opportunity to shape a new function and influence how data is managed across the organization. To start with this will be reporting to the CIO until the function is fully staffed.
ROLE PURPOSE
We are establishing a new Data Engineer role focused on implementing and evolving our data classification capabilities. This position is critical in ensuring that our data assets are properly identified, categorized, and protected across systems. The role will support our commitment to data privacy, compliance, and governance as we scale. You will be instrumental in building foundational frameworks that enable secure, compliant data use. By driving automation and consistency in classification, you'll help reduce risk and improve data visibility. This is a strategic opportunity to shape a new function and influence how data is managed across the organization. To start with this will be reporting to the CIO until the function is fully staffed.
Responsibilities- Data Classification Implementation: Design and implement data classification frameworks and processes to categorize sensitive and non-sensitive data across multiple sources and systems based on the policies from InfoSec team.
- Data Governance & Compliance: Ensure that the classification system adheres to industry standards, such as GDPR, HIPAA, and CCPA, and works closely with legal and compliance teams to mitigate risks.
- Data Classification Automation: Develop and maintain automated pipelines for classifying incoming and existing data, ensuring data is categorized consistently and efficiently.
- Data Classification Tools Integration: Integrate classification tools and platforms into the organization’s data ecosystem to streamline the classification process.
- Collaboration with Cross-functional Teams: Work closely with data scientists, analysts, and security teams to ensure accurate data classification, and to provide them with the necessary resources and tools.
- Data Security: Implement security measures to protect classified data, ensuring that sensitive information is only accessible to authorized personnel based on classification.
- Continuous Improvement: Monitor and review the effectiveness of the data classification process regularly, identifying opportunities for improvements and staying up-to-date with evolving data privacy regulations and industry best practices.
Qualifications- Bachelor’s or Master’s degree in Computer Science, Data Engineering, Information Systems, or a related field; strong academic background in data systems and software engineering. Proven experience with data classification tools and frameworks; hands-on experience with platforms and solutions like Apache Atlas, Collibra, Informatica, or Microsoft Purview (we're a M365 and Azure company).
- Strong programming and data pipeline development skills; proficiency in Python, SQL, and experience with ETL/ELT processes and tools like Apache Airflow, dbt, or similar.
- Knowledge of data privacy and compliance regulations; familiarity with GDPR, HIPAA, CCPA, and data governance best practices.
ADDITIONAL REQUIREMENTSSolid understanding of cybersecurity principles, compliance, and risk management. Ability to influence and drive change across business and technical teams.