Site Reliability Engineer

Site Reliability Engineer

EPAM Systems (Poland) sp. z o.o.

Kraków
Kraków, Lesser Poland
B2B
Site Reliability Engineering
Cloud Solutions
AWS
Azure
GCP
Python
CI/CD
Kubernetes
Monitoring Tools

Hexjobs Insights

Role: Site Reliability Engineer. Responsibilities include implementing SRE practices, designing cloud solutions, troubleshooting, and ensuring system reliability. Requirements: 3+ years experience in SRE, knowledge of cloud platforms, DevOps tools.

Słowa kluczowe

Site Reliability Engineering
Cloud Solutions
AWS
Azure
GCP
Python
CI/CD
Kubernetes
Monitoring Tools

Benefity

  • Flexible schedule and opportunity to work remotely within Poland
  • Outstanding career roadmap
  • Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
  • Benefits package (health insurance, multisport, shopping vouchers)
  • Participation in the Employee Stock Purchase Plan

Technologies we use

About the project

Your responsibilities

  • Collaborate with development, security, quality, and operation teams to implement SRE practices and ensure system reliability
  • Define and support required level of reliability, availability, and performance for services and applications
  • Design and deliver Cloud-based solutions tailored to client needs
  • Troubleshoot, mitigate, and support fixing of the infrastructure and application issues in a timely manner
  • Implement a monitoring system for the infrastructure and application reliability
  • Communicate technical concepts clearly to both engineering teams and management stakeholders

Our requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related field
  • 3+ years of hands-on experience in Site Reliability Engineering or related roles
  • Proven experience in any cloud (AWS/GCP/Azure)
  • Experience with implementing SRE practices such as SLO/SLI, Error budgets, Postmortems, Reducing Toil, capacity planning, and Incident Management
  • Python or other scripting/programming language
  • Strong background in monitoring tools
  • Proficiency in CI/CD tools, infrastructure as code, and configuration management
  • Solid knowledge of container orchestration technologies (Kubernetes, Docker)
  • English language proficiency at an Upper-Intermediate level (B2) or higher

Optional

  • Expertise in deployment and management of LLMs, including technologies like RAG
  • Certification in Kubernetes, AWS/GCP/Azure, or similar technologies
  • Proven experience in DevOps
  • Knowledge of managing and optimizing AI/ML models in production environments, including basic deployment, monitoring, and maintenance

This is how we work on a project

Development opportunities we offer

What we offer

  • Engineering community of industry professionals
  • Friendly team and enjoyable working environment
  • Flexible schedule and opportunity to work remotely within Poland
  • Chance to work abroad for up to 60 days annually
  • Business-driven relocation opportunities
  • Outstanding career roadmap
  • Leadership development, career advising, soft skills, and well-being programs
  • Certification (GCP, Azure, AWS)
  • Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
  • English language classes
  • Stable income (Employment Contract or B2B)
  • Participation in the Employee Stock Purchase Plan
  • Benefits package (health insurance, multisport, shopping vouchers)
  • Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
  • Referral bonuses
  • Corporate, social and well-being events

Benefits

Wyświetlenia: 1
Opublikowana4 dni temu
Wygasaza 26 dni
Rodzaj umowyB2B
Źródło
Logo
Logo

Podobne oferty, które mogą Cię zainteresować

Na podstawie "Site Reliability Engineer"

Nie znaleziono ofert, spróbuj zmienić kryteria wyszukiwania.