Observability Team Lead
- רמת גן
- הכשרה
- משרה מלאה
- Lead the design, implementation, and maintenance of our observability infrastructure which manages billions of observability events (logs, traces, metrics) per day.
- Set vision & roadmap - Define plans for observability for the entire company to be aligned with our production strategy.
- Own the platform end-to-end - Operate and evolve Datadog, Coralogix, OpenTelemetry, PagerDuty and more while keeping initiatives for internal systems.
- Enable R&D teams - Provide auto-discovered dashboards, golden-signal templates, and tooling so every service ships with standard monitoring from day one
- Champion best practices - Run internal workshops, publish monthly insights, and contribute to monday's R&D.
- 5+ years building or operating large-scale observability / SRE platforms, including 3+ years in a leadership role.
- Deep hands-on experience with Datadog (or similar), distributed tracing, log pipelines, and observability tooling.
- Familiarity with Kubernetes, and microservice architectures (cells or multi-cluster experience a plus).
- Excellent communication skills. Able to translate dashboards into stories that matter to engineers and executives alike.
- Software-engineering background. Go/Python/TypeScript and IaC familiarity (Terraform/CDK).
- ניהול תש...
Mploy