Job Title:
ITOPS / Observability AI Architect
Company: Themesoft Inc.
Location: Bangalore, Karnataka
Created: 2026-03-25
Job Type: Full Time
Job Description:
Job Title: ITOPS / Observability AI ArchitectJob SummaryWe are seeking a highly experienced IT Operations (ITOPS) and Observability AI Architect tolead the design, development, and implementation of advanced observability and AIOpssolutions for enterprise clients. The ideal candidate should have 10+ years of experience intechnical development and architecture roles, with deep expertise in observabilityplatforms, AIOps tools, cloud-native architectures (Azure/AWS), containerization,orchestration, and automation. This role requires a strong understanding of modernobservability technologies, AI-driven operations, and the ability to architect scalable,intelligent systems that enhance operational efficiency and resilience.Key Responsibilities• Develop end-to-end observability and AIOps architectures for large-scale enterpriseenvironments.• Define standards and best practices for monitoring, alerting, and automatedremediation.• Drive the deployment and integration of observability platforms and AIOps toolsacross hybrid and multi-cloud environments.• Ensure seamless integration with ITSM, DevOps, and CI/CD pipelines.• Evaluate emerging technologies in observability and AIOps to recommend strategicadoption.• Design AI/ML-driven predictive analytics for proactive incident management androot cause analysis.• Work closely with clients, operations, and business teams to align architecture withorganizational goals.• Mentor technical teams on observability and AIOps best practices.• Optimize system performance through advanced telemetry, distributed tracing, andanomaly detection.• Implement automated workflows for incident prevention and resolution.Experience & Qualifications• MCA, B.E degree in Computer Science, or related field.• 10+ years in IT architecture or technical leadership roles.• Proven expertise in observability tools (e.g., Dynatrace, Datadog, New Relic,Prometheus, Grafana) and AIOps platforms (e.g., Moogsoft, BigPanda, ServiceNowAIOps).• Strong experience with Azure/AWS cloud architectures, containerization (Docker),and orchestration (Kubernetes).• Hands-on experience with automation frameworks and infrastructure-as-code(Terraform, Ansible).• Hands-on experience in IT operations preferably with IT infrastructure andapplications servicesSkills:• Deep understanding of monitoring, logging, distributed tracing, and telemetry.• Knowledge of AI/ML concepts applied to IT operations.• Excellent problem-solving, communication, and leadership skills• Good understanding and exposure to ITIL frameworksPreferred:• Certifications in cloud platforms (AWS/Azure), Kubernetes, or observability tools.• Experience in designing self-healing systems and predictive analytics for IToperations.