Senior Site Reliability Engineer (Praca zdalna)

Hexjobs ATS

Senior Site Reliability Engineer (Praca zdalna)

TQLO SPÓŁKA Z OGRANICZONĄ ODPOWIEDZIALNOŚCIĄ

Warszawa

placeholder

B2B, PERMANENT

💼 B2B

PERMANENT

☁️ AWS

🚢 Kubernetes

Terraform

GitHub Actions

Cloudflare

Dynatrace

CI/CD

observability

SRE

🔄 DevOps

Podsumowanie

Poszukiwany Senior Site Reliability Engineer do zdalnej pracy. Odpowiedzialność za infrastrukturę AWS, Kubernetes oraz automatyzację procesów. Wymagana znajomość Terraform, GitHub Actions, oraz monitoringu.

Słowa kluczowe

AWSKubernetesTerraformGitHub ActionsCloudflareDynatraceCI/CDobservabilitySREDevOps

Benefity

•Stabilna, długoterminowa współpraca B2B
•Praca nad projektami o wysokiej skali z realnym wpływem
•100% pracy zdalnej, elastyczne godziny
•Współpraca w dojrzałej kulturze inżynieryjnej
•Dostęp do nowoczesnego stosu technologicznego

Opis stanowiska

Our Client is an international organization developing a modern, highly available digital platform used by millions of users.The project focuses on building and maintaining scalable cloud infrastructure, automating processes, improving reliability, and implementing Site Reliability Engineering (SRE) best practices.We are looking for an experienced Senior Site Reliability Engineer who will take ownership of production environments, enhance observability, and automate the entire application lifecycle.WORK MODE 100% remoteRESPONSIBILITIESDesigning, implementing, and scaling resilient infrastructure in AWS (multiple accounts, production and pre-production environments)Maintaining and evolving Kubernetes (EKS) environments using Helm, ArgoCD, and Terraform, ensuring predictable and auditable deployment processesCollaborating with product and platform teams on SRE best practices (SLIs/SLOs, error budgets, reliability reviews)Building and improving observability using Dynatrace, Grafana, cloud-native metrics, and open-source toolsOptimizing Cloudflare configuration (WAF, cache and routing rules, perimeter security) to improve performance and securityAutomating infrastructure, deployments, and routine tasks using GitHub Actions, Python, and BashParticipating in incident response, leading post-mortems, and turning lessons learned into tangible improvementsREQUIREMENTSMinimum 5 years of experience in an SRE/DevOps role in AWS-based production environments (AWS preferred, Azure acceptable)Strong proficiency with Terraform, Helm, ArgoCD, and GitHub ActionsExcellent knowledge of Kubernetes (EKS) – autoscaling, rollout strategies, troubleshooting, cluster architectureExperience building and maintaining observability pipelines (logs, metrics, traces, SLIs/SLOs, alerting)Ability to design high-availability and fault-tolerant systemsSolid understanding of CI/CD principles and GitOps practicesExperience with Cloudflare (DNS, CDN, WAF, rulesets)Hands-on experience with monitoring tools such as Dynatrace, Prometheus, and GrafanaVery good command of English (collaboration with teams in Europe and the US)Experience in incident response: on-call rotations, RCA, post-mortemsNice to haveExamples of improvements introduced in the areas of SLO/SLI management or alert fatigue reductionContributions to automation or observability toolingExperience leading reliability reviews and promoting a post-mortem cultureInterest in resilience engineering and knowledge sharing within the SRE communityWHY JOIN?Stable, long-term B2B cooperation directly with the end clientWork on high-scale projects with real impact on a platform used by millions of usersFull technical autonomy with real influence over architecture, solutions, and reliability standards100% remote work, flexible hours, and an async-friendly environmentMature engineering culture, partnership-based collaboration, and teamwork with experts from Europe and the USAccess to a modern tech stack: AWS, EKS, Terraform, ArgoCD, Cloudflare, Dynatrace, and cloud-native toolsTQLO Sp. z o.o. – Employment Agency (KRAZ No. 33580)Thank you for all applications. We will contact selected candidates.

Zaloguj się, aby zobaczyć pełny opis oferty

Wyświetlenia: 65

Zgłoś

Opublikowana	5 dni temu
Wygasa	za 20 dni
Rodzaj umowy	B2B, PERMANENT
Źródło

Podobne oferty, które mogą Cię zainteresować

Na podstawie "Senior Site Reliability Engineer"

Nie znaleziono ofert, spróbuj zmienić kryteria wyszukiwania.

Aplikacja mobilna

Zainstaluj aplikację Hexjobs, aby aplikować szybciej i otrzymywać powiadomienia.