Job Search and Career Advice Platform

Enable job alerts via email!

Senior Software Engineer, Platform

FIRMUS METAL INTERNATIONAL PTE. LTD.

Singapore

On-site

SGD 80,000 - 120,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A tech company located in Singapore is looking for a Senior Software Engineer focused on platform engineering. The ideal candidate will enhance observability capabilities and internal tooling to boost productivity. With a requirement of 7+ years of experience, proficiency in modern development frameworks, and cloud platforms, this role is perfect for self-starters passionate about innovation. The job will also involve collaboration with AI/ML engineers and improving automated test coverage.

Qualifications

  • 7+ years of experience as Software Engineer, with 3 years in Platform/Observability engineering.
  • Strong proficiency in modern development frameworks and observability tools.
  • Working knowledge of AI-augmented development tools and CI/CD.

Responsibilities

  • Drive collaboration with AI/ML engineers for monitoring and observability.
  • Develop Prometheus exporters for low-level monitoring.
  • Enhance internal tooling for productivity automation.

Skills

Modern application development frameworks and languages (e.g., Go, Python, Node.js)
Advanced querying and optimization using SQL, PromQL, LogQL, GraphQL
Observability stack (e.g., Loki, Grafana, Tempo, Prometheus, Thanos, ClickHouse)
Data streaming (e.g., Kafka, Pulsar)
Automated testing frameworks (e.g., Pytest, JUnit, K6)
Cloud platforms (e.g., AWS, Azure, or GCP)
Containerization technologies (e.g., Docker)
CI/CD practices (e.g., Ansible, GitHub Actions, Jenkins)
Clear and effective English communication

Education

Bachelor's degree in computer science or a related field
Job description
ROLE SUMMARY

Firmus Technologies is seeking a Senior Software Engineer focussing on Platform Engineering to join our Engineering and Technology team. You will drive the enhancement of our observability capabilities to achieve ClusterMAX Platinum tier recognition from SemiAnalysis. You will also enhance internal tooling to improve developer and operations productivity. This role is ideal for a self-starter with passion for building things from first principles. You naturally break down complex problems into their fundamental truths to uncover novel and elegant solutions—rather than relying on conventional patterns.

KEY RESPONSIBILITIES
  • Drive and collaborate with AI/ML engineers to develop and integrate AI/ML application-level monitoring from the ground up, including model accuracy tracking and performance observability.
  • Develop purpose-built Prometheus exporters to provide necessary granularity for robust low-level components and interconnect fabric monitoring.
  • Build and enhance internal tooling to automate workflows, improve developer and operations productivity, and streamline platform operations (e.g., dashboards, CLI tools, automation scripts, self-service portals).
  • Continuously improve automated test coverage and effectiveness by adopting new testing frameworks, tools, and best practices.
  • Own net-new product experiments (e.g., VR with Meta Quest), driving innovation from concept to production deployment and mass adoption.
  • Contribute to the adoption and integration of AI-augmented development tools and workflows.
SKILLS AND EXPERIENCE
  • Bachelor's degree in computer science or a related technical field.
  • 7+ years of experience as Software Engineer, with a minimum of 3 years in a dedicated Platform/Observability engineering focus role.
  • Demonstrated strong proficiency: Modern application development frameworks and languages (e.g., Go, Python, Node.js).
  • Demonstrated strong proficiency: Advanced querying and optimization using SQL, PromQL, LogQL, GraphQL.
  • Demonstrated strong proficiency: Observability stack (e.g., Loki, Grafana, Tempo, Prometheus, Thanos, ClickHouse).
  • Demonstrated strong proficiency: Data streaming (e.g., Kafka, Pulsar).
  • Demonstrated strong proficiency: Automated unit, integration, security, load and end-to-end testing frameworks (e.g., Pytest, JUnit, K6, Go test, Cypress) and integrating tests into CI/CD pipelines.
  • Demonstrated strong proficiency: Cloud platforms (e.g., AWS, Azure, or GCP).
  • Demonstrated strong proficiency: Containerization technologies (e.g., Docker).
  • Experience with AI-augmented development tools and workflows.
  • Working knowledge on configuration management and CI/CD (e.g., Ansible, GitHub Actions, Jenkins, ArgoCD)
  • Clear and effective English communication, written and spoken.
  • Bonus Points:
  • Familiarity with Linux internals, networking stacks, distributed storage and high-performance computing.
  • Experience in high-growth startups or regulated industries with robust security and data privacy requirements, including SOC 2 Type 2 and ISO 27001.
KEY COMPETENCIES
  • Strong systems thinking with a proven ability to solve complex problems.
  • Deep expertise in observability and platform engineering.
  • High sense of ownership and ability to execute autonomously.
  • Effective cross-functional communication and collaboration.
  • Commitment to quality, reliability, and maintainability.
  • Continuous learner with a bias for innovation and first-principles thinking.
SUCCESS METRICS
  • Improved observability coverage and operation insights.
  • Progress toward achieving ClusterMAX Platinum criteria.
  • Increased adoption and effectiveness of internal tooling.
  • Higher automated test coverage and reduced regressions.
  • Measurable improvements in developer and operations teams' productivity.
  • Successful delivery of experimental projects and AI-augmented workflows.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.