I gave a talk at the 7CTOs Colloquium in Austin on best practices for building Retrieval-Augmented Generation (RAG) pipelines—and how we’re starting to see a clear architecture emerge for serious LLM-based applications.

Web Development Then, LLMs Now: Forecasting LLM Best Practices by Looking Back – and Ahead

In this talk, I cover:

I also share prompts, techniques, and design tradeoffs we’ve discovered while building real-world LLM applications at Scientist.com.

Thanks to 7CTOs for putting on the event and thank you to Howdy.com for hosting!