A peer recently inspired me to see how quickly I could build a RAG (Retrieval Augmented Generation) system to implement a standard LLM chat interface with responses limited to content from a collection of private technical documents.
As luck would have it, an older TalkPython podcast had introduced me to