Skip to main content

Site Reliability Engineer (SRE) - Consultant

JD 3

Site Reliability Engineer - JD :

Location - Chennai and Hyderabad

Experience - 9 to 14 Years

Key Responsibilities:

· Design, Implement and/or refine Service Management processes. (Monitoring, Incident, Problem, Capacity, Change & Releases and Service Level Management)

· Track system health, performance and reliability via monitoring, observability platforms, implement proactive alerting mechanisms to detect anomalies and respond swiftly to incidents.

· Act as a point of escalation for complex incidents, collaborating with senior engineers and management to ensure effective resolution.

· Establish and enforce change control and release management processes to ensure smooth and controlled deployment of system changes.

· Conduct post-incident analyses to identify root causes and implement actions to prevent recurrence and improve system resilience.

· Perform regular system testing to identify vulnerabilities and validate disaster recovery plans.

· Partner with development teams to improve services through rigorous testing and release procedures.

· Participate in system design consulting, platform management, and capacity planning.

· Integrate reliability practices into CI/CD pipelines to automate testing, quality assurance, and deployment processes.

· Foster a culture of collaboration between development and operations teams, promoting shared ownership and accountability for system reliability.

· Create sustainable systems and services through automation and uplifts.

· Balance feature development speed and reliability with well-defined service-level objectives

· Continuously evaluate and enhance system reliability, scalability and performance. Identify areas for improvement and implement solutions to optimize processes and reduce manual toil.

· Define, track, and monitor SLAs/ SLOs to measure and improve system reliability.

  • Collaborate with cross-functional teams to ensure scalable and adequate resource allocations and optimize cost efficiency.

Required skills and qualifications

· Bachelor’s degree (or equivalent) in computer science or related discipline

· Proven Process definition and Implementation experience, leveraging ITIL best practices

· Minimum ITIL V3 Intermediate / Expert certified - Mandatory

· Implementation experience of ITSM / ESM tools (e.g., SNOW, Remedy, JIRA)

· Strong DevSecOps skills with implementation experience – Foundation / Practitioner certification will be an advantage.

· Coding experience beyond simple scripts – Python, Java, C/C++ and JavaScript

· Knowledge of Linux/ Unix systems administration and troubleshooting skills

· Knowledge of relational and NoSQL databases and distributed storage systems Proficiency in database administration, query optimization, and data replication.

· Familiarity with Incident management and collaboration tools such as JIRA, PagerDuty, Slack, or ServiceNow.

· Expertise in performance monitoring and analysis tools such as New Relic, AppDynamics, or Datadog.

· Familiarity with configuration management tools like Ansible, Puppet, or Chef

· Knowledge of Observability (e.g, Dynatrace, SolarWinds) and monitoring systems (e.g., Prometheus, Nagios) and log management tools (e.g., ELK stack, Splunk).

· Strong analytical thinking and problem-solving abilities to identify patterns, troubleshoot issues, and propose effective solutions.

· Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.

· Previous success in technical engineering

The Cognizant community:
We are a high caliber team who appreciate and support one another. Our people uphold an energetic, collaborative and inclusive workplace where everyone can thrive.

  • Cognizant is a global community with more than 300,000+ associates around the world.
  • We don’t just dream of a better way – we make it happen.
  • We take care of our people, clients, company, communities and climate by doing what’s right.
  • We foster an innovative environment where you can build the career path that’s right for you.

About us:
Cognizant is one of the world's leading professional services companies, transforming clients' business, operating, and technology models for the digital era. Our unique industry-based, consultative approach helps clients envision, build, and run more innovative and efficient businesses. Headquartered in the U.S., Cognizant (a member of the NASDAQ-100 and one of Forbes World’s Best Employers 2024) is consistently listed among the most admired companies in the world. Learn how Cognizant helps clients lead with digital at www.cognizant.com

Our commitment to diversity and inclusion:
Cognizant is an equal opportunity employer that embraces diversity, champions equity and values inclusion. We are dedicated to nurturing a community where everyone feels heard, accepted and welcome. Your application and candidacy will not be considered based on race, color, sex, religion, creed, sexual orientation, gender identity, national origin, disability, genetic information, pregnancy, veteran status or any other protected characteristic as outlined by federal, state or local laws.

If you have a disability that requires reasonable accommodation to search for a job opening or submit an application, please email [email protected] with your request and contact information.

Disclaimer:
Compensation information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law.

Applicants may be required to attend interviews in person or by video conference. In addition, candidates may be required to present their current state or government issued ID during each interview.

Join our talent community

Haven’t found the right opportunity yet? Receive the latest updates on job opportunities, recruitment events and company news tailored just for you.

Sign up