Skip to main content

Lead Infra. Architect

00066127211


Job Summary

15+ years of experience in DevOps Infrastructure Automation and Kubernetes administration.

Proven leadership in managing on-prem container orchestration platforms at scale.

Architectural understanding of microservices distributed systems and secure automation frameworks.

Deep expertise in Docker Kubernetes OpenShift and CI/CD tooling.

Experience with Helm GitOps and secure credential management.

Strong proficiency in Linux administration Shell scripting and Python.


Responsibilities

Kubernetes Cluster Leadership

Architect administer and scale enterprise-grade Kubernetes clusters in on-prem datacentre.

  • Lead cluster lifecycle management: provisioning upgrades patching node pools and capacity planning.

Define and enforce multi-tenant governance using RBAC network policies Pod-Security Policies and Namespaces.

Implement and optimize Ingress controllers service meshes and API gateways for secure traffic routing.

Establish high availability disaster recovery and backup strategies for cluster components and workloads.

Drive root cause analysis and resolution of complex cluster-level issues.

Containerization & Orchestration Strategy

Oversee containerization standards using Docker Compose and private registries.

Lead deployment and orchestration of microservices via Kubernetes Helm.

Define resource optimization strategies including autoscaling affinity rules and quota enforcement.

CI/CD Architecture

Architect and govern CI/CD pipelines

Standardize build and release processes across diverse tech stacks

Design reusable pipeline frameworks and automation templates for rapid onboarding and delivery.

Integrate CI/CD with Kubernetes for seamless rollout rollback and canary deployments.

AI Workflow Enablement (ClearML)

Lead integration of ClearML for experiment tracking model versioning and pipeline orchestration.

Collaborate with AI/ML teams to containerize models and automate GPU job scheduling.

Build and maintain custom ClearML agents and workflows for reproducible experimentation and deployment.

Scripting & Tooling

Develop robust automation scripts in Shell Python

Build internal tools and dashboards to enhance infrastructure observability and operational efficiency.

Understanding of NIM services CUDA frameworks & libraries/models from OpenAI/Huggingface are good-to-have from Infrastructure perspective.


Certifications Required

as applicable


About us
Cognizant (Nasdaq: CTSH) is an AI Builder and technology services provider, building the bridge between AI investment and enterprise value by building full-stack AI solutions for our clients. Our deep industry, process and engineering expertise enables us to build an organization’s unique context into technology systems that amplify human potential, realize tangible returns and keep global enterprises ahead in a fast-changing world. See how at www.cognizant.com or @cognizant.

Additional employment information
Compensation information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law.

Language requirements vary depending on roles, but we ask that all candidates have basic English proficiency for company-wide communications purposes. For roles based in Quebec, professional English proficiency is required, as you’ll deliver services to and collaborate with stakeholders outside the province who may not speak French.

Cognizant is an equal opportunity employer. Your application and candidacy will not be considered based on race, color, sex, religion, creed, sexual orientation, gender identity, national origin, disability, genetic information, pregnancy, veteran status or any other characteristic protected by federal, provincial or local laws.

If you have a disability that requires reasonable accommodation to search for a job opening or submit an application, please email [email protected] with your request and contact information.

Benefits that help you thrive and grow

Our benefits program is built with you in mind—so you can enjoy a fulfilling, balanced and healthy life.

a blue line drawing of a plant with leaves

Financial wellbeing

We regularly review market data to ensure compensation reflects the value you bring. Your benefits extend beyond pay and may include financial education, a pension plan with matching contributions, etc.

Stay Healthy Midnight Blue RGB

Physical and mental health

We empower you to prioritize your wellbeing through paid time off, flexible working where possible, healthcare plans, counselling, our Mental Health Allyship program and more. 

Build The Career You Want Midnight Blue RGB

Your career, your way

With 350,000+ roles at Cognizant, you’ll have opportunities explore new technologies, industries and locations—and build the skills you need to grow your career.

Making A Meaningful Impact Midnight Blue RGB

Real-world impact

Think about the biggest brands you rely on. Chances are, they rely on us to help strengthen their business. Here, you’ll turn bold ideas into solutions that improve lives everywhere.

Haven't yet found the right opportunity?

Get the latest updates on job opportunities, recruitment events and company news—tailored just for you!

Be in the know