Skill: APM, RUM, Logs, Metrics, Synthetics, Dashboards, SLOs, Service Maps
Exp: 6 to 12Yrs
Location: Gurgaon
Work Mode: 5 days WFO
Notice Period: Immediate (Feb joiners)
JD:
3+ years hands on in Datadog
– Datadog fundamentals
– Basic monitoring and dashboards
– Alerting basics
– SLOs, Service Maps
– exposure towards Kubernetes-based microservice monitoring
– Datadog fundamentals
– Basic monitoring and dashboards
– Alerting basics
– SLOs, Service Maps
– exposure towards Kubernetes-based microservice monitoring
– Integration setup
– Exposure to Dynatrace, New Relic, SolarWinds, Cloud Native tools (not must have)
Key Responsibilities : Platform Ownership: Design, implement, and manage the Observability Platform (Datadog primary; Dynatrace/New Relic secondary). End-to-End Observability: Configure and maintain APM, RUM, Logs, Metrics, and Synthetics for distributed systems and APIs. Root Cause Analysis (RCA): Lead RCA for high latency, error spikes, and resource utilization issues across microservices and strong problem-solving and analytical skills Automation & Integration: Integrate observability tools with ServiceNow, PagerDuty, Slack, and CI/CD pipelines for automated alerting and incident workflows. Dashboarding & Reporting: Develop unified dashboards and executive reports for KPIs, SLOs, and system-health. Tagging Governance: Define and enforce consistent tagging, naming, and cost-allocation practices across environments

contactus@briskwinit.com