Generative AI Application Engineer

At I-Next Data, the AI innovation center of Tel Aviv Sourasky (Ichilov) Medical Center, we build and deploy production-ready LLM applications that integrate directly into clinical workflows. Our mission is to become one of the world’s most impactful healthcare AI hubs. We are a 35+ developer team focused on production, within a group of ~45 employees.

Role Overview

We’re seeking a generative AI application engineer for a new GenAI team. You will deploy and serve proprietary language models. In this role, you’ll automate data pipelines, embed evaluation frameworks, and own the observability stack, delivering low-latency, cost-efficient generative AI services that scale from prototype to production.

Key Responsibilities

Deploy models via APIs, Bedrock, Azure OpenAI or self-hosted LLMs; containerize and monitor (Docker, K8s, LangFuse, Phoenix) with support from our dedicated DevOps team.
Build user-facing systems (e.g., summarization tools, design assistants…).
Orchestrate multi-step pipelines: prompt → retrieval (RAG) → generation → evaluation → feedback loop.
Develop and integrate evaluation frameworks with custom metrics.
Optimize latency vs accuracy, cost per 1K tokens, and context-window management.

Skills and Experience

4+ years of experience in software development and strong Python programming skills.
Design thinking and problem-solving mindset.
Frameworks & orchestration: LangChain / LangGraph, Docker/Kubernetes.
Evaluation & benchmarking: G-Eval, LLM-as-a-Judge.
Deployment: AWS Bedrock, Azure.
Work with vector data bases, feature stores.

Apply for job

Similar jobs

I-NEXT Data

Generative AI Application Engineer

Staff Backend Engineer- Agent Experience

Senior Backend Engineer – AI & LLM Infrastructure

Data Engineer

I-NEXT Data