NVIDIA Research Taiwan
NVIDIA Research Taiwan
Home
News
Members
Research
Publications
Contact
Light
Dark
Automatic
Recursive RL
Recursive Think-Answer Process for LLMs and VLMs
Think-Answer reasoners such as DeepSeek-R1 have made notable progress by leveraging interpretable internal reasoning. However, despite the frequent presence of self-reflective cues like 'Oops!', they remain vulnerable to output errors during …
Cite
×