Skip to main content

Infra Dev Specialist

00068803962



Job Summary

Infra Dev Specialist responsible for designing automating and optimizing large scale monitoring and observability for enterprise infrastructure using AWS CloudWatch LogicMonitor and Splunk in a hybrid work model enabling reliable systems actionable insights and rapid incident detection for business critical services while collaborating closely with cross functional technology teams.


Responsibilities

  • Design and implement robust infrastructure monitoring solutions using AWS CloudWatch LogicMonitor and Splunk to provide deep visibility into application and platform health for business critical environments.
  • Develop reusable monitoring templates dashboards and alerting policies that standardize observability across cloud and on premises systems while reducing manual configuration effort.
  • Build and maintain automated ingestion pipelines that collect normalize and route logs metrics and events into Splunk and related observability tools to support rapid troubleshooting.
  • Configure intelligent alert thresholds and noise reduction strategies in AWS CloudWatch and LogicMonitor to minimize false positives while ensuring timely notification of genuine service degradation.
  • Collaborate closely with application infrastructure and security teams to translate functional and nonfunctional requirements into monitoring specifications that align with enterprise standards.
  • Conduct detailed root cause analysis using Splunk searches correlation rules and visualizations to identify performance bottlenecks and recurring incidents that impact user experience.
  • Optimize cost and performance of monitoring implementations by refining data retention sampling strategies and metric collection policies for various environments.
  • Create and maintain clear operational runbooks that describe alert meaning diagnostic steps and remediation guidelines so support teams can respond consistently and efficiently.
  • Implement infrastructure as code approaches for monitoring configurations using automation tools to enable repeatable deployments version control and environment consistency.
  • Perform capacity and trend analysis using historical metrics and log data to forecast resource needs prevent outages and support data driven infrastructure planning.
  • Partner with reliability and operations teams to define service level indicators and service level objectives and to align monitoring coverage with agreed reliability targets.
  • Drive continuous improvement of monitoring quality by reviewing incident reports identifying visibility gaps and implementing targeted enhancements that reduce mean time to detect and mean time to resolve.
  • Document monitoring architectures data flows and configuration standards in a concise and accessible manner to support knowledge sharing across global teams.

  • Qualifications

  • Apply a strong background in AWS services with hands on expertise in AWS CloudWatch features including custom metrics logs and alarms to design effective monitoring solutions.
  • Leverage deep practical experience with Splunk including data onboarding index design search optimization and dashboard creation to deliver actionable operational insights.
  • Use proven skills with LogicMonitor or similar platforms to configure device discovery metric collection and alert routing for diverse infrastructure components.
  • Bring seven to eight years of overall infrastructure or operations experience with significant focus on observability monitoring engineering and incident management in enterprise settings.
  • Demonstrate proficiency in at least one scripting language such as Python or PowerShell to automate monitoring deployment data transformations and routine maintenance tasks.
  • Apply knowledge of networking operating systems and common enterprise platforms so that monitoring strategies accurately reflect dependencies and failure modes.
  • Exhibit strong analytical and problem solving abilities with a track record of reducing incident frequency and improving system stability through data driven decisions.
  • Communicate clearly with both technical and nontechnical stakeholders explaining monitoring metrics dashboards and alerts in understandable terms that support sound decisions.
  • Adapt effectively to a hybrid work model by collaborating through digital channels documenting work thoroughly and maintaining high coordination with distributed teams.
  • Maintain familiarity with security and compliance considerations related to log and metric data handling to ensure observability solutions meet organizational governance needs.

  • Certifications Required

    AWS Certified SysOps Administrator or AWS Certified DevOps Engineer and Splunk Core Certified Power User or equivalent observability certification.


    What we offer

    • The chance to work with impact. Here, you’re empowered to bring your biggest thinking to help our company and clients improve everyday life.
    • Ownership over your career. Stay at the top of your game through our award-winning learning and development ecosystem. And when your ambitions change or we offer new opportunities, we help you pivot by providing reskilling, on-the-job learning and guidance to find new roles that might be a better fit.
    • The opportunity to thrive on a high caliber team with heart. We celebrate each other’s experiences and perspectives and promote a sense of belonging through our affinity groups and diversity and inclusion initiatives.
    • A comprehensive total rewards package, including a competitive salary and a pension plan with matching contributions.
    • Flexible health and financial benefits to support you and your eligible dependents—from day one.
    • True work-life balance. Be at your best through paid time off, flexible work arrangements, volunteering opportunities, social events, and so much more.  

    About us
    Cognizant (Nasdaq: CTSH) is an AI Builder and technology services provider, building the bridge between AI investment and enterprise value by building full-stack AI solutions for our clients. Our deep industry, process and engineering expertise enables us to build an organization’s unique context into technology systems that amplify human potential, realize tangible returns and keep global enterprises ahead in a fast-changing world. See how at www.cognizant.com or @cognizant.

    Other employment-related information
    Cognizant is an equal opportunity employer. Your application and candidacy will not be considered based on race, color, sex, religion, creed, sexual orientation, gender identity, national origin, disability, genetic information, pregnancy, veteran status or any other characteristic protected by federal, provincial or local laws.

    If you have a disability that requires reasonable accommodation to search for a job opening or submit an application, please email [email protected] with your request and contact information.

    Language requirements vary depending on roles, but we ask that all candidates have basic English proficiency for company-wide communications purposes. For roles based in Quebec, professional English proficiency is required, as you’ll deliver services to and collaborate with stakeholders outside the province who may not speak French.

    Your path to Cognizant

    Wondering what to expect after you apply? Here’s a peek at our recruitment process—and keep in mind that not all candidates advance through every step and the process may vary depending on your role and location.

    Your Application Midnight Blue RGB

    Step 1: Application

    Find an open role that aligns with your skills and career goals and show us why you’re the person for the job. Consider joining our Talent Community if you don’t find the right opportunity.

    Phone Call Midnight Blue RGB

    Step 2: Recruiter call

    If one of our recruiters sees a fit, they’ll set up a short introductory call to learn more about you and how your experiences and skills align with the role.

    Step 3: Interview(s)

    If you and our team would like to continue the process, you’ll meet with one of our hiring managers. Some roles may also require technical assessments and/or client interviews.

    Step 4: Final decision

    Our hiring team will then review each candidates’ potential to succeed in the role. This process may take some time because we want to get it right—but you can count on us to keep you updated.

    Benefits that help you thrive and grow

    Our teams achieve incredible things when they feel fully supported. That’s why our benefits program is built around the diverse needs of our people—so they can enjoy a fulfilling, balanced and healthy life.

    Untitled Design 49
    Financial wellbeing

    Financial wellbeing

    We regularly review market data to ensure compensation is competitive and reflects the value you bring. Your benefits extend beyond pay and may include retirement plans, financial education, discount programs, etc.

    1 (1)
    Physical and mental wellbeing

    Physical and mental wellbeing

    We empower you to prioritize your wellbeing through paid time off, flexible working where possible, healthcare plans, counselling, our Mental Health Allyship program and more. 

    Your Career, Your Way
    Your career, your way

    Your career, your way

    With 90% of our associates building skills through GenAI training, job shadowing, industry certifications and more, you have everything you need to build a full career.

    Professionals
    Real-world impact

    Real-world impact

    Think about the biggest brands you rely on. Chances are, they rely on us to help strengthen their business. Here, you’ll turn bold ideas into solutions that improve lives everywhere.

    Haven't yet found the right opportunity?

    Receive the latest updates on job opportunities, recruitment events and company news—tailored just for you!

    Get the latest updates