Enable job alerts via email!
A leading technology company in Singapore is looking for a Chaos Engineering Specialist to design and implement chaos experiments. You will be responsible for simulating real-world failures using tools such as Chaos Monkey and AWS Fault Injection Simulator. The ideal candidate should manage chaos engineering tools proficiently, document procedures, and present findings to stakeholders. This role is essential in ensuring the reliability and resilience of systems through thorough experimentation.
As a Chaos Engineering Specialist, your primary responsibility will be to design and implement chaos experiments. You will be required to develop and execute chaos experiments to simulate real-world failures and identify vulnerabilities in our systems. It will be essential for you to define clear objectives, hypotheses, and success metrics for each experiment. Additionally, you will need to thoroughly document experiment procedures, results, and lessons learned.
One of the key requirements for this role is experience in using Chaos Engineering Tools. You should be proficient in managing chaos engineering tools such as Chaos Monkey, Gremlin, ToxiProxy, Chaos Mesh, Chaos Blade, Azure Chaos Studio, and AWS Fault Injection Simulator. Proficiency in a combination of these tools will be an added advantage.
Cloud platform chaos engineering will also be a significant part of your role. You will be responsible for designing and executing chaos experiments within cloud environments, specifically Azure, AWS, and GCP. This will involve utilizing tools such as Azure Chaos Studio and AWS Fault Injection Simulator, among others.
Documentation and reporting will also play a crucial role in this position. You will be expected to maintain detailed documentation of chaos experiments, procedures, and results. Furthermore, you will need to generate reports and present findings to stakeholders, highlighting the impact of the chaos experiments on the system.
Overall, as a Chaos Engineering Specialist, you will play a vital role in ensuring the reliability and resilience of our systems through the implementation of chaos experiments and the analysis of their outcomes.