What is Rejection Sampling?
Rejection Sampling
Rejection Sampling (in LLMs) is a data curation technique where a generator model produces multiple candidate answers, and a separate evaluator model filters out low-quality outputs. The remaining high-quality responses are then used for supervised fine-tuning.
Detailed Deep Dive
Rejection sampling is a dataset purification technique widely used in alignment training (SFT and RLHF). A source model generates multiple candidate completions for a set of instructions, and a separate evaluator (or reward model) filters out low-scoring or incorrect responses. The remaining top-quality examples are saved to form clean demonstration datasets, bootstrapping model capabilities without human labeling.
Frequently Asked Questions
Q:How does rejection sampling improve LLM training?
It filters out poor reasoning steps or incorrect outputs, ensuring the model only learns from high-quality, correct demonstrations.
Q:What is another name for this process?
It is often referred to as best-of-N sampling or self-training with selection.
Quick Facts
- CategoryModel Training
- Key ApplicationHigh-quality instruction dataset creation, code correctness filtering, and model bootstrapping
Coverage Trend12 Weeks
Related AI Terms
Rejection Sampling Media Coverage & Intelligence
No Direct Rejection Sampling News Today
We currently have no direct coverage articles matching "Rejection Sampling" in the database archive. Explore trending global AI topics below instead.
Trending AI Stories
The Control Gap: Enterprise AI organizations have an ownership problem, not a technology problem - and most are governing it by hand
AI portfolios are expanding far faster than the ability to govern them across enterprises. Most organizations run a contested field of platforms, each claiming
SpaceX has an AI device prototype, and it sure sounds phone-ish
SpaceX reportedly showed investors a "handset-like" AI device before going public. It could be another signal SpaceX wants to expand into wireless.
Ashton Kutcher leaving Sound Ventures to launch new VC firm with Morgan Beller
The actor and investor is joining forces with Morgan Beller, who was previously a GP at NFX, to invest in early-stage startups.
You Can Now Sound the Alarm on AI Behaving Badly
Are you worried your AI chatbot is trying to build a bomb or leak personal information about you? There's a website for that.