I’ve recently jumped in to understanding DSPy. Their tag line is “Programming—not prompting—Foundation Models”. My first impression is that it is a solid framework for coordinating LLM workflows in a completely new way. And the tag line, so far, “feels” very right!
These are some resources that helped me get an initial impression. Many of the docs are very clear and easy to follow. I’ll try to keep this list updated as I go:
DSPy Links
DSPy Github Repo - the readme is really great, don’t skip reading it :)
Intro Notebook - solid intro to concepts
DSPy - Compiling Declarative Language Model Calls into Self-Improving Pipelines (Oct 2023)
DSPy Assertions - Computational Constraints for Self-Refining Language Model Pipelines (Dec 2023)
In-Context Learning for Extreme Multi-Label Classification - really cool application of DSPy on extreme multi-label classification with thousands of classes, authors achieved SOTA results on several benchmarks…this was the result that got me initially interested in DSPy (Jan 2024)
Compiling for tricky tasks - very interesting notebook
DSPy Guides - looking forward to getting to these at some point
Announcing DSPy - tweet
DSPy paper reviewer comments - interesting read and discussion
Redefining Language Model interactions through DSPy - blog gives an overview
Introducing Assertions in DSPy - blog gives intro to concepts covered in the assertions paper
DSPy Docs - @lateinteraction provided the link to these docs, they’re also really good and thorough!
DSPy walkthrough by Kanav - initial insights from reading the paper and playing around with the tutorials
DSPy one-page cheat sheet - Dataloaders, Evaluation, Optimizers, Assertions and more
DSPy Videos
DSPy - solid intro video
DSPy Explained - by Connor Shorten, recommended
Omar talking about DSPy on ML Ops Podcast - ML Ops is great, Austin conference was 🔥
DSPy and ColBERT with Omar Khattab - Weaviate Podcast
Karel D’Oosterlinck discussing Extreme Multi-Label Classification - Weaviate Podcast