Enable job alerts via email!

Sr. Reliability Engineer

Verint

United States

On-site

USD 100,000 - 140,000

Full time

6 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Verint is seeking a Reliability Engineer to enhance customer experience through reliable cloud-based systems. The ideal candidate will have extensive software engineering knowledge, strong experience in continuous deployment practices, and a passion for driving operational excellence in a fast-paced environment. Join a collaborative team and contribute to impactful results for global brands while enjoying opportunities for personal and professional growth.

Qualifications

  • 5 years of experience in software development and/or site reliability engineering.
  • Proficiency with cloud platforms (AWS or Azure).
  • Experience with CI/CD processes.

Responsibilities

  • Ensure the scalability, availability, and reliability of cloud-based systems.
  • Design, develop, and maintain scalable software systems.
  • Lead incident response and participate in postmortem analysis.

Skills

Relational databases
REST API
Microservices
Scripting (Bash, Python)
Monitoring tools (Prometheus, Grafana)

Education

Bachelor’s degree in Computer Science or Engineering

Tools

Jenkins
Docker
Kubernetes
Terraform

Job description

At Verint, we believe customer engagement is the core of every global brand. Our mission is to help organizations elevate Customer Experience (CX) and increase workforce productivity by delivering CX Automation. We hire innovators with the passion, creativity, and drive to answer constantly shifting market challenges and deliver impactful results for our customers. Our commitment to attracting and retaining a talented, diverse, and engaged team creates a collaborative environment that openly celebrates all cultures and affords personal and professional growth opportunities. Learn more at www.verint.com.

Overview of Job Function:

Verint’s Reliability Engineer is responsible for all aspects of the development and operational reliability of platforms and applications. In this highly skilled, hands-on role, our Reliability Engineer ensures the scalability, availability, performance, and reliability of cloud-based systems and participates in and leads the design, development, testing, deployment, monitoring, and support of cloud-native solutions, while also serving as a subject matter expert for customer implementation and cloud platform support.

This Reliability Engineer works closely with a global team of engineers to build robust, observable, and resilient systems that meet business objectives, following DevOps and SRE best practices. The engineer also contributes to continuous integration and deployment (CI/CD) processes, incident response, and postmortem analysis, while mentoring junior engineers and driving process improvements.

Principal Duties and Essential Responsibilities:

  • Ongoing evaluation (test) of feature design – proactively work with others to identify issues or potential risk areas with the architecture (performance etc.)
  • Distill requirements from feature level into implementation level tasks
  • Ensure design and implementation work meets the stakeholder’s requirements
  • Ensure that the feature design is correct for operations, as well as deployment and sustainability
  • Support departmental and team initiatives by being the first one to take a call when high severity issues come up
  • Work with Team Technical Architect and Lead to define, document, and communicate coherent feature design
  • Design, develop, and maintain scalable and reliable software systems with a focus on operational excellence.
  • Implement and maintain observability tools (monitoring, logging, alerting) to ensure system health and performance.
  • Participate in on-call rotations, incident response, and root cause analysis to improve system reliability.
  • Collaborate with development and operations teams to define and implement SLOs, SLIs, and error budgets.
  • Automate infrastructure provisioning, configuration, and deployment using Infrastructure as Code (IaC) tools.
  • Proactively identify and address system bottlenecks, performance issues, and reliability risks.
  • Ensure the right work is being done, including unit tests, automation, throughput, capacities, security, and performance.
  • Support the design and implementation of fault-tolerant, self-healing systems.
  • Lead and contribute to blameless postmortems and continuous improvement initiatives.
  • Guide and mentor team members on SRE principles, tooling, and best practices.
  • Communicate relevant risks and issues to stakeholders and ensure alignment with business goals.
  • Support planning with estimates, dependencies, risk areas, and sequencing tasks.
  • Advocate for and implement solutions to reduce technical debt and improve system maintainability.

Minimum Requirements:

  • Bachelor’s degree in Computer Science, Engineering or other related field or equivalent work experience
  • 5 years of experience in software development and/or site reliability engineering.
  • Proficiency with relational databases
  • Strong experience with REST API/microservice applications
  • Experience with CI/CD tools like Jenkins and/or Harness.
  • Experience with observability tools like Prometheus, Grafana, ELK, AppDynamics.
  • Familiarity with container orchestration: Docker, Kubernetes.
  • Experience with cloud platforms: AWS and/or Azure.
  • Knowledge of Infrastructure as Code tools: Terraform, CloudFormation, or similar.
  • Strong scripting skills (e.g., Bash, Python).
  • Experience with version control systems (e.g., GitHub).
  • Experience of continuous integration systems e.g. Jenkins, Harness
  • Ability to work in a fast-paced, high-energy environment and take ownership of assignments.
  • Successful completion of a background screening process including, but not limited to, employment verifications, criminal search, OFAC, SS Verification, as well as credit and drug screening, where applicable and in accordance with federal and local regulations

Preferred Requirements:

  • Experience with incident management and postmortem processes.
  • Experience with Java, C#, Spring Boot, Spring Cloud.
  • Familiarity with SRE principles and practices (e.g., SLOs, SLIs, error budgets).
  • Knowledge of security best practices in cloud environments.
  • Experience with behavior-driven development (BDD) and test automation tools such as Selenium and Cucumber.

#LI-KD1

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

Zeektek

null null

Remote

Remote

USD 130,000 - 160,000

Full time

Today
Be an early applicant

Senior Site Reliability Engineer

MongoDB

null null

Remote

Remote

USD 127,000 - 249,000

Full time

14 days ago

Senior Site Reliability Engineer (AWS, AI/ML, & APM)

Davita Inc.

null null

Remote

Remote

USD 120,000 - 160,000

Full time

5 days ago
Be an early applicant

Senior Site Reliability Engineer

Roadie

null null

Remote

Remote

USD 120,000 - 160,000

Full time

7 days ago
Be an early applicant

Remote - Senior Site Reliability Engineer (SRE)

Green Dot Corporation

null null

Remote

Remote

USD 87,000 - 132,000

Full time

4 days ago
Be an early applicant

Senior Site Reliability Engineer New United States (Remote)

Upgrade, Inc.

null null

Remote

Remote

USD 120,000 - 160,000

Full time

5 days ago
Be an early applicant

Remote - Senior Site Reliability Engineer (SRE)

Green Dot

null null

Remote

Remote

USD 87,000 - 132,000

Full time

5 days ago
Be an early applicant

Senior Site Reliability Engineer

Credit Acceptance

null null

Remote

Remote

USD 117,000 - 174,000

Full time

26 days ago

Senior Site Reliability Engineer ( Remote - US)

Jobgether

null null

Remote

Remote

USD 120,000 - 160,000

Full time

10 days ago