Job Description: Cloud Engineer
We are passionate about building software that solves problems. Our Site Reliability Engineers (SREs) empower users with rich features, high availability, and stellar performance to pursue their missions. We are seeking a public cloud experienced engineer to plan, design, and implement next-generation cloud infrastructure solutions. The Cloud Engineer will be part of the Engineering team, requiring strong knowledge of application monitoring, infrastructure monitoring, automation, maintenance, and service reliability improvements.
We look for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences.
Reasons to Join Ford:
- Join a leading global automotive company with over 120 years of market presence.
- Full-time fixed contract with competitive compensation and benefits (restaurant card discounts, etc.).
- Work-life balance: 33 vacation days and a hybrid work model (2-3 days a week).
- Career development through high-impact projects to enhance your technical skills.
Responsibilities:
- Design, automate, and manage a highly available and scalable cloud deployment for development teams.
- Collaborate with engineering and architecture teams to evaluate and identify optimal cloud solutions, leveraging scalability, high performance, and security.
- Modernize existing on-premises solutions and improve existing systems.
- Automate deployments and manage applications in GCP.
- Develop and maintain cloud solutions following best practices.
- Ensure efficient data storage and processing in compliance with security policies and best practices.
- Work with engineering teams to identify optimization strategies and develop self-healing capabilities.
- Develop strong observability capabilities.
- Identify, analyze, and resolve infrastructure vulnerabilities and deployment issues.
- Review existing systems regularly and recommend improvements.
Qualifications:
- Strong understanding of data visualization techniques and creating clear dashboards.
- Familiarity with DORA metrics (Deployment Frequency, Lead Time for Changes, Change Failure Rate, Time to Restore Service).
- Knowledge of CI/CD integration with observability, correlating pipeline events with performance and error data.
- Experience working with RESTful APIs, including consuming and building them.
- Understanding of API specifications (Swagger/OpenAPI).
- Experience building dashboards in Grafana connecting to various data sources, including GCP.
- Experience implementing and managing SLOs using Nobl9.
- Automating data collection and processing with GCP services like Cloud Scheduler.
- Proficiency with Dynatrace for application performance monitoring.
- Experience with Nobl9 for reliability management, defining and monitoring SLOs and error budgets.
- Strong skills in Grafana dashboard creation and visualization.
- Understanding of Prometheus metrics collection and architecture.
- Knowledge of OpenTelemetry Collectors configuration pipelines.
- Experience with API gateway (e.g., Apigee) is a plus.
- Experience with package configuration and deployment tools like Helm, Kustomize, ArgoCD.
- Proficiency in scripting languages such as Python, Go, Java, JavaScript, Node.js.
- Exposure to cloud monitoring and logging.
- Experience with distributed storage technologies (NFS, HDFS, Ceph, S3) and resource management frameworks (Kubernetes, Mesos, Yarn).
- Automation tools experience is a priority.
- English proficiency at an advanced/high level.
Additional Information:
Ford is committed to diversity and equal opportunity for all, opposing any form of discrimination or harassment based on gender, marital status, race, ethnicity, disability, sexual orientation, religion, age, or caring responsibilities.
LI-Hybrid
LIAH2