NAVIGATION

What is Rejection Sampling?

Definition

Rejection Sampling

Rejection Sampling (in LLMs) is a data curation technique where a generator model produces multiple candidate answers, and a separate evaluator model filters out low-quality outputs. The remaining high-quality responses are then used for supervised fine-tuning.

Detailed Deep Dive

Rejection sampling is a dataset purification technique widely used in alignment training (SFT and RLHF). A source model generates multiple candidate completions for a set of instructions, and a separate evaluator (or reward model) filters out low-scoring or incorrect responses. The remaining top-quality examples are saved to form clean demonstration datasets, bootstrapping model capabilities without human labeling.

Frequently Asked Questions

Q:How does rejection sampling improve LLM training?

It filters out poor reasoning steps or incorrect outputs, ensuring the model only learns from high-quality, correct demonstrations.

Q:What is another name for this process?

It is often referred to as best-of-N sampling or self-training with selection.

Quick Facts

  • CategoryModel Training
  • Key ApplicationHigh-quality instruction dataset creation, code correctness filtering, and model bootstrapping

Coverage Trend12 Weeks

12w agoToday

Rejection Sampling Media Coverage & Intelligence

No Direct Rejection Sampling News Today

We currently have no direct coverage articles matching "Rejection Sampling" in the database archive. Explore trending global AI topics below instead.

Trending AI Stories

VentureBeatJul 1, 2026

The Control Gap: Enterprise AI organizations have an ownership problem, not a technology problem - and most are governing it by hand

AI portfolios are expanding far faster than the ability to govern them across enterprises. Most organizations run a contested field of platforms, each claiming

TechCrunch AIJul 1, 2026

SpaceX has an AI device prototype, and it sure sounds phone-ish

SpaceX reportedly showed investors a "handset-like" AI device before going public. It could be another signal SpaceX wants to expand into wireless.

TechCrunch AIJul 1, 2026

Ashton Kutcher leaving Sound Ventures to launch new VC firm with Morgan Beller

The actor and investor is joining forces with Morgan Beller, who was previously a GP at NFX, to invest in early-stage startups.

WiredJul 1, 2026

You Can Now Sound the Alarm on AI Behaving Badly

Are you worried your AI chatbot is trying to build a bomb or leak personal information about you? There's a website for that.