Job Title:
Grafana Enterprise Engineer
Company: Aptimized
Location: Udaipur, Rajasthan
Created: 2025-11-20
Job Type: Full Time
Job Description:
Job Title: Grafana Enterprise Engineer / Observability EngineerLocation: Hyderabad (Office-based)Experience: 5–10 yearsEmployment Type: Full-timeOverviewWe are looking for a highly skilled Grafana Enterprise Engineer with strong experience in observability, monitoring, performance optimization, and dashboarding across large-scale distributed systems. The ideal candidate will have deep hands-on expertise with Grafana Enterprise, Prometheus, Loki, Tempo, and other observability tools.Key ResponsibilitiesGrafana Enterprise AdministrationInstall, configure, and manage Grafana Enterprise environments (on-prem / cloud).Manage users, roles, permissions, and Grafana Enterprise features such as:Enterprise pluginsReportingSSO integrationsAlerting & Incident managementOptimize performance of Grafana back-end services.Observability Stack ManagementDeploy, manage, and scale:Prometheus / MimirLoki (log aggregation)Tempo (tracing)AlertmanagerDevelop scraping strategies, retention policies, sharding, and federation setups.Integrate data sources including InfluxDB, Elasticsearch, CloudWatch, Azure Monitor, and others.Dashboards & AlertsArchitect and create advanced Grafana dashboards with templating, variables, and drill-down capabilities.Build actionable alerting rules, automate alert routing/notifications, and reduce noise.Work with teams to define SLIs/SLOs, performance metrics, and monitoring standards.Automation & SRE PracticesImplement automation using Terraform, Helm, Ansible, or similar tools.Develop monitoring-as-code templates for scalable deployments.Participate in SRE practices including:Incident responseRoot cause analysis (RCA)Performance tuningCapacity planningCollaborationWork closely with application, DevOps, and cloud teams to onboard services into the monitoring ecosystem.Train internal teams on dashboard usage, alerting, and observability best practices.Required Skills3+ years hands-on Grafana Enterprise experience.Strong expertise in Prometheus, Loki, Tempo, or similar observability tools.Strong skills in Dashboards, Alerting, Metrics, Logs.Experience building and supporting high-availability observability platforms.Strong Linux and scripting skills (Shell, Python).Experience with Docker, Kubernetes, CI/CD tools.Knowledge of cloud platforms: AWS / Azure / GCP.Experience with SSO, LDAP, OAuth, or SAML integrations.Preferred SkillsExperience with Grafana Mimir, Grafana Cloud, or Enterprise Metrics.Exposure to log pipelines like Promtail, Fluentd, Fluent Bit, Vector.Infrastructure-as-Code (Terraform) experience.Certification in Cloud or Observability tools.