W&B: OPTIMIZING LLM APPS AT SCALE
LLM-based system evaluation, and our work on French and German LLM leaderboards for LLM evaluation.
For LLM-based system evaluation, we will focus on Wandbot, detailing our extensive configuration search across different models, embedding techniques, and settings to pinpoint the optimal RAG configurations. This part will also highlight how tools like Pydantic and LiteLLM aid in crafting production-ready systems, showcasing Wandbot's advancements in performance and functionality.
Registration link: