Enable job alerts via email!

IoT Site Reliability Engineer

Talent Insider

Jakarta Selatan

On-site

IDR 300.000.000 - 400.000.000

Full time

Today
Be an early applicant

Job summary

A leading HR Consultancy Service in Jakarta is seeking an experienced IoT Site Reliability Engineer to ensure system reliability and scalability. The ideal candidate will have a technical Bachelor's degree and at least 4 years of experience in a similar role. Responsibilities include developing automation tools, optimizing system performance, and participating in on-call support. Proficiency in scripting and familiarity with cloud platforms are essential for this role.

Qualifications

  • Minimum 4 years proven experience as a Site Reliability Engineer or in a similar role.
  • Proficiency in scripting and automation with languages such as Python and Bash.
  • Familiarity with cloud platforms and strong knowledge of containerization technologies.

Responsibilities

  • Implement and maintain best practices for system reliability and availability.
  • Develop and enhance automation tools for monitoring and recovery processes.
  • Identify and resolve performance bottlenecks proactively.

Skills

Scripting and automation with Python
Containerization and orchestration technologies
Monitoring tools and practices
Knowledge of security best practices
Experience in IoT

Education

Bachelor's degree in Software Engineering or related field

Tools

AWS
Azure
GCP
Docker
Kubernetes
Terraform
Ansible
Job description
About the job IoT Site Reliability Engineer
About the Company:

Talent Insider is an upcoming HR Consultancy Service, founded in 2021. Our clients have been some of the leading brands in Indonesia, and this service continues to expand. Registered in Singapore & Indonesia, we can assist with your growth plans and strategies, and continue to expand our regional presence with strong regional partners to assist our client in recruitment and branding strategy.

Job Description:
  • Implement And Maintain Best Practices For System Reliability, Availability, And Scalability While Minimizing Downtime And Disruptions For Both Software And Hardware Systems
  • Develop And Enhance Automation Tools And Scripts For System Monitoring, Deployment, And Recovery To Streamline Operational Processes
  • Identify And Resolve Performance Bottlenecks, Proactively Optimizing System Components To Ensure Optimal Response Times And Resource Utilization
  • Participate In On-call Rotations To Respond To And Resolve System Incidents Promptly And Efficiently, Ensuring Minimal Impact On End-users. Site Visits And On-site Debugging Will Be Needed.
  • Use Infrastructure As Code Tools To Manage And Version Infrastructure, Making It More Predictable And Reproducible
  • Set Up And Maintain Robust Monitoring, Alerting, And Logging Systems To Detect And Mitigate Issues Before They Impact The User Experience
Job Requirements:
  • Bachelor's degree in a technical or scientific field such as Software Engineering, Computer Science, Electrical Engineering or IT preferred
  • Minimum 4 years proven experience as a Site Reliability Engineer or in a similar role
  • Proficiency in scripting and automation with languages such as Python and Bash
  • Familiarity with cloud platforms (e.g., AWS, Azure, GCP)
  • Strong knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes)
  • Experience with Infrastructure as Code tools (e.g., Terraform, Ansible)
  • Solid understanding of monitoring tools and practices
  • Knowledge of security best practices and incident response
  • Experience and knowledge of IoT (eg. sensors, Raspberry Pi, device management)
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.