
Site Reliability Engineer
EPAM Systems (Poland) sp. z o.o.
Kraków
Kraków, Lesser Poland
B2B
Site Reliability Engineering
Cloud Solutions
AWS
Azure
GCP
Python
CI/CD
Kubernetes
Monitoring Tools
Status
Hexjobs Insights
Role: Site Reliability Engineer. Responsibilities include implementing SRE practices, designing cloud solutions, troubleshooting, and ensuring system reliability. Requirements: 3+ years experience in SRE, knowledge of cloud platforms, DevOps tools.
Słowa kluczowe
Site Reliability Engineering
Cloud Solutions
AWS
Azure
GCP
Python
CI/CD
Kubernetes
Monitoring Tools
Benefity
- Flexible schedule and opportunity to work remotely within Poland
- Outstanding career roadmap
- Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
- Benefits package (health insurance, multisport, shopping vouchers)
- Participation in the Employee Stock Purchase Plan
Technologies we use
About the project
Your responsibilities
- Collaborate with development, security, quality, and operation teams to implement SRE practices and ensure system reliability
- Define and support required level of reliability, availability, and performance for services and applications
- Design and deliver Cloud-based solutions tailored to client needs
- Troubleshoot, mitigate, and support fixing of the infrastructure and application issues in a timely manner
- Implement a monitoring system for the infrastructure and application reliability
- Communicate technical concepts clearly to both engineering teams and management stakeholders
Our requirements
- Bachelor’s degree in Computer Science, Engineering, or a related field
- 3+ years of hands-on experience in Site Reliability Engineering or related roles
- Proven experience in any cloud (AWS/GCP/Azure)
- Experience with implementing SRE practices such as SLO/SLI, Error budgets, Postmortems, Reducing Toil, capacity planning, and Incident Management
- Python or other scripting/programming language
- Strong background in monitoring tools
- Proficiency in CI/CD tools, infrastructure as code, and configuration management
- Solid knowledge of container orchestration technologies (Kubernetes, Docker)
- English language proficiency at an Upper-Intermediate level (B2) or higher
Optional
- Expertise in deployment and management of LLMs, including technologies like RAG
- Certification in Kubernetes, AWS/GCP/Azure, or similar technologies
- Proven experience in DevOps
- Knowledge of managing and optimizing AI/ML models in production environments, including basic deployment, monitoring, and maintenance
This is how we work on a project
Development opportunities we offer
What we offer
- Engineering community of industry professionals
- Friendly team and enjoyable working environment
- Flexible schedule and opportunity to work remotely within Poland
- Chance to work abroad for up to 60 days annually
- Business-driven relocation opportunities
- Outstanding career roadmap
- Leadership development, career advising, soft skills, and well-being programs
- Certification (GCP, Azure, AWS)
- Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
- English language classes
- Stable income (Employment Contract or B2B)
- Participation in the Employee Stock Purchase Plan
- Benefits package (health insurance, multisport, shopping vouchers)
- Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
- Referral bonuses
- Corporate, social and well-being events
Benefits
Wyświetlenia: 1
| Opublikowana | 4 dni temu |
| Wygasa | za 26 dni |
| Rodzaj umowy | B2B |
| Źródło |
Podobne oferty, które mogą Cię zainteresować
Na podstawie "Site Reliability Engineer"
Nie znaleziono ofert, spróbuj zmienić kryteria wyszukiwania.