Recipes
Playbooks
Multi-step prompt recipes for specific outcomes. Each playbook delivers something concrete -- a RAG eval harness, an injection test suite, a code-review loop -- with the full prompt chain, failure modes, and adjacent variations.
2 playbooks
Techniques
Build a Classification Prompt That Beats Fine-Tuning
A production-ready classification prompt for your own labels, with self-consistency voting, structured output, and an eval harness -- typically reaches 90%+ accuracy without any fine-tuning.
Development
Token Budget & Cost Estimator for Production LLM Apps
A pre-flight cost calculator that tells you -- before you call -- how many tokens a given prompt + expected response will burn, across multiple models, so your app can choose the cheapest model that fits the window.
