
Senior Data Scientist /AI Engineer (RL)
ACAISOFT POLAND Sp. z o.o.
25200 - 38640 PLN / HOUR
Warszawa
Warszawa, Masovian
B2B
Python
AI frameworks
Langchain
Langraph
mcp-server
Machine Learning
Data Science
Reinforcement Learning
reward modeling
Status
Hexjobs Insights
Senior Data Scientist/AI Engineer role in Warszawa. Responsibilities include RL environments design, task generation pipelines, API design, and model evaluation. Requires 5+ years in Python and 3+ in ML.
Słowa kluczowe
Python
AI frameworks
Langchain
Langraph
mcp-server
Machine Learning
Data Science
Reinforcement Learning
reward modeling
Technologies we use
About the project
Your responsibilities
- Design and implement RL environments that support large-scale agent evaluation and reinforcement learning experiments.
- Build task generation pipelines, dynamic datasets, and scripted environments with controlled complexity and stochasticity.
- Develop verifiers and reward models to automatically score trajectories and evaluate model reasoning.
- Collaborate with infrastructure and systems engineers to ensure environments are scalable, reproducible, and instrumented for detailed telemetry.
- Design APIs and orchestration frameworks for running, resetting, and evaluating agents across environments.
- Optimize environment performance, logging, and reward reproducibility across distributed setups.
Our requirements
- 5+ years of experience in Python software engineering.
- Minimum 3 years in a Data Scientist, Machine Learning/Environment Engineering position.
- Being able to work 2 p.m. - 10 p.m.
- Practical knowledge of AI frameworks (Langchain, Langraph, mcp-server ).
- Extensive practical experience in working with AI, including prompt engineering and vibe coding.
Optional
- Knowledge of Codex or Claude Code.
- Experience in integrating AI with a system would be an advantage.
- Understanding of RL concepts - reward modeling, environment dynamics, verifiability, evaluation, and agent interaction loops.
- Familiarity with instrumentation, metrics, and data pipelines for RL evaluation.
- Expertise in planning your own work.
This is how we organize our work
This is how we work
Benefits
Wyświetlenia: 3
| Opublikowana | 22 dni temu |
| Wygasa | za 8 dni |
| Rodzaj umowy | B2B |
| Źródło |
Podobne oferty, które mogą Cię zainteresować
Na podstawie "Senior Data Scientist /AI Engineer (RL)"
Nie znaleziono ofert, spróbuj zmienić kryteria wyszukiwania.