LlamaIndex Sessions: Practical Tips and Tricks for Productionizing RAG (feat. Sisil @ Jasper)

LlamaIndex February 18, 2024
Video Thumbnail
LlamaIndex Logo

LlamaIndex

View Channel

About

Official YouTube Channel for LlamaIndex - the platform to build document agents

Latest Posts

No results found. Try different keywords.

Video Description

In this video, Sisil Mehta (ML eng @, Jasper) walks through practical tips and tricks that his team implemented for productionizing a RAG system at Jasper.ai, backed by LlamaIndex abstractions. These tricks include the following: 1. Picking a proper PDF parser that can maintain semantic structure, parse text from tables/images, and be represented as XML or Markdown 2. Adding the right "layers" of metadata; besides global document context, also inject summary context from "sub-documents" to more precisely localize context. 3. Hybrid fusion between different retrieval methods 4. LLM-powered reranking. Reduce token usage by reranking summaries that reference underlying chunks. 5. Use XML and emotion prompting to get well-structured outputs free of hallucinations