close menu

Generative AI Application Engineer

At I-Next Data, the AI innovation center of Tel Aviv Sourasky (Ichilov) Medical Center, we build and deploy production-ready LLM applications that integrate directly into clinical workflows. Our mission is to become one of the world’s most impactful healthcare AI hubs. We are a 35+ developer team focused on production, within a group of ~45 employees.

Role Overview

We’re seeking a generative AI application engineer for a new GenAI team. You will deploy and serve proprietary language models. In this role, you’ll automate data pipelines, embed evaluation frameworks, and own the observability stack, delivering low-latency, cost-efficient generative AI services that scale from prototype to production.

Key Responsibilities

  • Deploy models via APIs, Bedrock, Azure OpenAI or self-hosted LLMs; containerize and monitor (Docker, K8s, LangFuse, Phoenix) with support from our dedicated DevOps team.
  • Build user-facing systems (e.g., summarization tools, design assistants…).
  • Orchestrate multi-step pipelines: prompt → retrieval (RAG) → generation → evaluation → feedback loop.
  • Develop and integrate evaluation frameworks with custom metrics.
  • Optimize latency vs accuracy, cost per 1K tokens, and context-window management.

Skills and Experience

  • 4+ years of experience in software development and strong Python programming skills.
  • Design thinking and problem-solving mindset.
  • Frameworks & orchestration: LangChain / LangGraph, Docker/Kubernetes.
  • Evaluation & benchmarking: G-Eval, LLM-as-a-Judge.
  • Deployment: AWS Bedrock, Azure.
  • Work with vector data bases, feature stores.

 

Similar jobs