
Generative AI Application Engineer
At I-Next Data, the AI innovation center of Tel Aviv Sourasky (Ichilov) Medical Center, we build and deploy production-ready LLM applications that integrate directly into clinical workflows. Our mission is to become one of the world’s most impactful healthcare AI hubs. We are a 35+ developer team focused on production, within a group of ~45 employees.
Role Overview
We’re seeking a generative AI application engineer for a new GenAI team. You will deploy and serve proprietary language models. In this role, you’ll automate data pipelines, embed evaluation frameworks, and own the observability stack, delivering low-latency, cost-efficient generative AI services that scale from prototype to production.
Key Responsibilities
- Deploy models via APIs, Bedrock, Azure OpenAI or self-hosted LLMs; containerize and monitor (Docker, K8s, LangFuse, Phoenix) with support from our dedicated DevOps team.
- Build user-facing systems (e.g., summarization tools, design assistants…).
- Orchestrate multi-step pipelines: prompt → retrieval (RAG) → generation → evaluation → feedback loop.
- Develop and integrate evaluation frameworks with custom metrics.
- Optimize latency vs accuracy, cost per 1K tokens, and context-window management.
Skills and Experience
- 4+ years of experience in software development and strong Python programming skills.
- Design thinking and problem-solving mindset.
- Frameworks & orchestration: LangChain / LangGraph, Docker/Kubernetes.
- Evaluation & benchmarking: G-Eval, LLM-as-a-Judge.
- Deployment: AWS Bedrock, Azure.
- Work with vector data bases, feature stores.
Similar jobs


