Model Predictive Control

Test-Time Alignment for Large Language Models via Textual Model Predictive Control

Aligning Large Language Models (LLMs) with human preferences through finetuning is resource-intensive, motivating lightweight alternatives at test time. We address test-time alignment through the lens of sequential decision making, a perspective that …