T Hub - AI Expert

T Hub - AI Expert

T-Mobile

Warszawa
Warszawa, Masovian
Hybrydowa
RAG systems
LLMs
Python
PyTorch
Hugging Face Transformers
Docker
Kubernetes
vector databases
Linux
MLOps

Hexjobs Insights

T-Mobile seeks an AI Expert for RAG System Development, deploying LLM solutions, optimizing infrastructure, and collaborating across teams. Requires 3+ years in ML/NLP.

Słowa kluczowe

RAG systems
LLMs
Python
PyTorch
Hugging Face Transformers
Docker
Kubernetes
vector databases
Linux
MLOps

Benefity

  • Dynamic environment with diverse projects.
  • Opportunity to work with cutting-edge technologies.
  • Collaborative team setup supporting personal growth.
  • Individual exploration in technology.

Technologies we use

About the project

Your responsibilities

  • RAG System Development:
  • Architect and deploy end-to-end RAG pipelines, combining retrieval mechanisms (e.g., vector databases like qdrant) with generative models for enterprise use cases.
  • Fine-tune and optimize retrieval models to ensure high accuracy and low latency in on-prem environments.
  • Model Integration & Deployment:
  • Implement and customize inference servers using vLLM for efficient LLM serving and LiteLLM for lightweight model orchestration.
  • Integrate open-source LLMs with proprietary data sources and APIs.
  • On-Prem Infrastructure Management:
  • Design GPU-optimized, scalable infrastructure for LLM training and inference, ensuring compliance with security and data governance policies.
  • Collaborate with DevOps teams to containerize workflows using Docker/Kubernetes and automate MLOps pipelines.
  • Performance Optimization:
  • Apply techniques like quantization, pruning, and dynamic batching to maximize resource efficiency in resource-constrained on-prem setups.
  • Monitor system performance, troubleshoot bottlenecks, and ensure high availability.
  • Cross-Functional Collaboration:
  • Partner with data engineers to curate and preprocess domain-specific datasets for retrieval and generation tasks.
  • Translate business requirements into technical solutions for stakeholders in telco environments.

Our requirements

  • Bachelor’s/Master’s/PhD in Computer Science, AI, or related field.
  • 3+ years in ML/NLP roles, with 2+ years focused on RAG systems.
  • Proven experience deploying LLMs in on-prem or hybrid environments.
  • Proficiency with vLLM, LiteLLM, and open-source LLMs (e.g., LLAMA 3.2, Deepseek, Mistral).
  • Strong Python expertise with frameworks like PyTorch, Hugging Face Transformers, and LangChain.
  • Experience with vector databases (e.g. qdrant).
  • Familiarity with Linux-based systems and RedHat OpenShift
  • Ability to communicate complex AI concepts to non-technical stakeholders.
  • Strong problem-solving skills and adaptability in fast-paced environments.

What we offer

  • A dynamic environment where you’ll consecutively lead your contributions across diverse projects.
  • Opportunity to become an expert in some of the most exciting cutting-edge technologies like Conversational AI platforms and VoIP solutions.
  • A collaborative team setup that supports your growth in a customer-facing technical consulting role.
  • Room for individual technological exploration while shaping innovative enterprise solutions.

Benefits

Wyświetlenia: 4
Opublikowana14 dni temu
Wygasaza 16 dni
Tryb pracyHybrydowa
Źródło
Logo
Logo

Podobne oferty, które mogą Cię zainteresować

Na podstawie "T Hub - AI Expert"

Nie znaleziono ofert, spróbuj zmienić kryteria wyszukiwania.