Site Reliability Engineer

Site Reliability Engineer

Link Group

Warszawa
Poznań
Olsztyn
Białystok
Kraków
Szczecin
Lublin
Gdańsk
Wrocław
Rzeszów
Praca zdalna
Site Reliability Engineering
Azure DevOps
Kubernetes
Datadog
CI/CD
incident management
AI
root cause analysis
monitoring
performance optimization

Hexjobs Insights

Poszukujemy Senior Site Reliability Engineer, odpowiedzialnego za niezawodność aplikacji AI. Wymagana znajomość Azure, Kubernetes, oraz doświadczenie w zarządzaniu incydentami.

Słowa kluczowe

Site Reliability Engineering
Azure DevOps
Kubernetes
Datadog
CI/CD
incident management
AI
root cause analysis
monitoring
performance optimization

About the RoleWe are looking for a Senior Site Reliability Engineer who will take end-to-end ownership of reliability for AI-driven applications and pipelines. This is a hands-on engineering role, not a coordination or ticket-driven position. The ideal candidate actively diagnoses, resolves, and automates production issues rather than only designing solutions.Requirements5+ years as SRE / Production / Platform EngineerStrong incident management & RCA experienceHands-on with: Azure DevOps, Kubernetes, Datadog, Azure, CI/CDProactive, ownership mindset, self-drivenExperience in production environmentsNice to have: AI/LLM pipelines, GrafanaResponsibilitiesBuild and maintain monitoring, alerting, dashboardsLead incident response & root cause analysisEnsure reliability and performance of AI pipelinesStandardize telemetry (latency, failures, throughput)Optimize CI/CD and release qualityReduce recurring incidents with engineering teams

Wyświetlenia: 1
Opublikowana7 dni temu
Wygasaza 3 miesiące
Źródło
Logo
Logo

Podobne oferty, które mogą Cię zainteresować

Na podstawie "Site Reliability Engineer"

Nie znaleziono ofert, spróbuj zmienić kryteria wyszukiwania.