Job Title:
Freelance Opportunity: Monitoring & Automation Specialist (Grafana, PromQL, GCP, Loki, Tempo, SNOW)
Company: ThreatXIntel
Location: Lucknow, Uttar pradesh
Created: 2025-11-19
Job Type: Full Time
Job Description:
Company DescriptionThreatXIntel is a startup cyber security company committed to helping businesses and organizations safeguard their digital assets from cyber threats. Our expert team provides tailored and cost-effective solutions, including cloud security, web and mobile security testing, cloud security assessment, and DevSecOps. With a proactive approach to identifying vulnerabilities, we ensure businesses are equipped to protect their critical digital environments. Our mission is to empower organizations of all sizes with high-quality, affordable cyber security services so they can confidently focus on growth and innovation.Role DescriptionWe are seeking a Monitoring & Automation Specialist to work on building and optimizing dashboards, query writing, log management, distributed tracing, and automated workflows. The ideal candidate will have hands-on experience with Grafana, PromQL, GCP Metrics Explorer, Loki, Tempo (OpenTelemetry), and SNOW (ServiceNow) automation for incident management. This role will involve creating scalable monitoring solutions, writing queries to support Service Level Objectives (SLO), and streamlining alerting workflows.Key ResponsibilitiesGrafana Dashboard Creation: Design and implement dashboards to visualize business metrics and system performance.PromQL Query Writing: Write efficient PromQL queries to aggregate metrics, calculate SLOs, and define alerting conditions.GCP Metrics Explorer Setup: Set up monitoring for GCP services and establish alerting policies and escalation procedures.Loki Log Management: Implement structured logging with Loki, manage log correlations, and troubleshoot using logs to identify issues.Tempo Distributed Tracing: Use Tempo (OpenTelemetry) to monitor distributed traces, identify performance bottlenecks, and optimize performance.Automation & Alerts with SNOW: Integrate ServiceNow (SNOW) for incident creation and automate workflows, ensuring timely responses to alerts.Optimize Incident Management: Set up automated escalation procedures, monitor KPIs, and ensure continuous improvement in the monitoring pipeline.Required SkillsGrafana: Expertise in creating and configuring dashboards to visualize business and performance metrics.PromQL: Advanced knowledge in writing queries, aggregating metrics, and setting up alerting conditions.GCP Metrics Explorer: Experience with GCP monitoring setup, alerting, and escalation procedures for cloud infrastructure.Loki: Proficiency in log management, structured logging, and log correlation to diagnose issues effectively.Tempo: Solid experience in distributed tracing with OpenTelemetry, identifying bottlenecks and optimizing system performance.SNOW Automation: Experience in integrating ServiceNow for incident creation, automating workflows, and managing alerting policies.Nice-to-Have SkillsExperience with Kubernetes or containerized environments for monitoring.Prometheus setup and management for metrics aggregation.Knowledge of CI/CD tools like Jenkins or GitLab for automating workflows.Familiarity with other log management or observability tools such as Elasticsearch or Datadog.