Job Summary
Serve as an Infra Dev Specialist with primary focus on LogicMonitor and Splunk to design implement and optimize enterprise monitoring and observability solutions for a global organization in a hybrid work model. Collaborate with cross functional teams to enhance platform reliability automate workflows and deliver actionable insights that improve service uptime and operational excellence in rotational shifts.
Responsibilities
Design and implement scalable monitoring solutions using LogicMonitor to ensure high availability and performance of critical infrastructure across data center and cloud environmentsConfigure maintain and optimize Splunk based observability including data onboarding dashboards and alerts to provide timely and actionable insights for incident response and problem managementDevelop automation scripts and reusable components for monitoring configuration deployment and maintenance to reduce manual effort and improve consistency across environmentsCollaborate with application infrastructure and security teams to define monitoring requirements and translate them into LogicMonitor and Splunk configurations that align with enterprise standardsTroubleshoot complex issues in monitoring pipelines including data collection alert noise and dashboard accuracy while driving sustainable fixes and improvementsImplement and refine alerting strategies to minimize false positives and ensure rapid detection of real incidents thereby contributing to improved mean time to detect and mean time to resolveDocument monitoring architectures runbooks and standard operating procedures in clear and comprehensive form to support knowledge sharing and operational continuity across rotational shiftsCoordinate with service owners during platform changes and releases to validate monitoring coverage and ensure that new services are onboarded into LogicMonitor and Splunk in a timely mannerAnalyze trends in infrastructure performance and event data to identify capacity risks reliability gaps and opportunities for optimization that support business continuity and cost efficiencySupport compliance and audit activities by ensuring that monitoring configurations logs and dashboards adhere to internal policies and external regulatory expectationsWork in hybrid and rotational shift model to provide continuous coverage for monitoring operations and incident support while collaborating effectively across time zones and regionsContribute to continuous improvement initiatives by evaluating new features of LogicMonitor and Splunk proposing enhancements and driving small proof of concept efforts that deliver measurable valueEngage with vendor support and internal stakeholders to resolve advanced product issues and to stay aligned with recommended practices for LogicMonitor and Splunk deployment and usage
Qualifications
Possess seven to eight years of hands on experience in infrastructure monitoring with strong and demonstrable expertise in implementing and administering LogicMonitor in enterprise environmentsDemonstrate advanced proficiency in Splunk including data ingestion configuration of forwarders creation of searches dashboards alerts and use of Splunk apps relevant to infrastructure observabilityApply solid understanding of network server database and cloud infrastructure concepts to design meaningful metrics and alerts that reflect real service health and user experienceUtilize scripting experience in areas such as PowerShell or Python to automate monitoring tasks integrate external systems and streamline routine operational activitiesExhibit strong analytical and problem solving skills with ability to interpret large volumes of monitoring data and logs to derive clear root causes and propose effective remediation optionsCommunicate clearly in verbal and written form to collaborate with distributed teams document procedures and influence stakeholders on monitoring best practices without formal authorityAdapt effectively to hybrid work model and rotational shifts while maintaining high standards of reliability accountability and focus on service level objectives
Certifications Required
Preferred certifications include LogicMonitor Certified Professional and Splunk Certified Power User or Splunk Certified Administrator.
The Cognizant community:
We are a high caliber team who appreciate and support one another. Our people uphold an energetic, collaborative and inclusive workplace where everyone can thrive.
- Cognizant is a global community with more than 300,000 associates around the world.
- We don’t just dream of a better way – we make it happen.
- We take care of our people, clients, company, communities and climate by doing what’s right.
- We foster an innovative environment where you can build the career path that’s right for you.
About us:
Cognizant (Nasdaq: CTSH) is an AI Builder and technology services provider, building the bridge between AI investment and enterprise value by building full-stack AI solutions for our clients. Our deep industry, process and engineering expertise enables us to build an organization’s unique context into technology systems that amplify human potential, realize tangible returns and keep global enterprises ahead in a fast-changing world. See how at www.cognizant.com or @cognizant.
Cognizant is an equal opportunity employer. Your application and candidacy will not be considered based on race, color, sex, religion, creed, sexual orientation, gender identity, national origin, disability, genetic information, pregnancy, veteran status or any other characteristic protected by federal, state or local laws.
Disclaimer:
Compensation information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law.
Applicants may be required to attend interviews in person or by video conference. In addition, candidates may be required to present their current state or government issued ID during each interview.