ChipNeMo: Domain-Adapted LLMs for Chip Design

ChipNeMo aims to explore the applications of large language models (LLMs) for industrial chip design. Instead of directly deploying off-the-shelf commercial or open-source LLMs, we instead adopt the following domain adaptation techniques: custom tokenizers, domain-adaptive continued pretraining, supervised fine-tuning (SFT) with domain-specific instructions, and domain-adapted retrieval models. We evaluate these methods on three selected LLM applications for chip design: an engineering assistant chatbot, EDA script generation, and bug summarization and analysis. Our results show that these domain adaptation techniques enable significant LLM performance improvements over general-purpose base models across the three evaluated applications, enabling up to 5x model size reduction with similar or better performance on a range of design tasks. Our findings also indicate that there’s still room for improvement between our current results and ideal outcomes. We believe that further investigation of domain-adapted LLM approaches will help close this gap in the future.

Authors

Mingjie Liu

Teo Ene (NVIDIA)

Robert Kirby (NVIDIA)

Chris Cheng (NVIDIA)

Nathaniel Pinckney

Rongjian Liang

Jonah Alben (NVIDIA)

Himyanshu Anand (NVIDIA)

Sanmitra Banerjee (NVIDIA)

Ismet Bayraktaroglu (NVIDIA)

Bonita Bhaskaran (NVIDIA)

Bryan Catanzaro (NVIDIA)

Arjun Chaudhuri (NVIDIA)

Sharon Clay (NVIDIA)

Bill Dally (NVIDIA)

Laura Dang (NVIDIA)

Parikshit Deshpande (NVIDIA)

Siddhanth Dhodhi (NVIDIA)

Sameer Halepete (NVIDIA)

Eric Hill (NVIDIA)

Jiashang Hu (NVIDIA)

Sumit Jain (NVIDIA)

Brucek Khailany

George Kokai (NVIDIA)

Kishor Kunal (NVIDIA)

Xiaowei Li (NVIDIA)

Charley Lind (NVIDIA)

Hao Liu (NVIDIA)

Stuart Oberman (NVIDIA)

Sujeet Omar (NVIDIA)

Sreedhar Pratty (NVIDIA)

Jonathan Raman (NVIDIA)

Ambar Sarkar (NVIDIA)

Zhengjiang Shao (NVIDIA)

Hanfei Sun (NVIDIA)

Pratik P Suthar (NVIDIA)

Varun Tej (NVIDIA)

Walker Turner

Kaizhe Xu (NVIDIA)

Haoxing (Mark) Ren (NVIDIA)

Publication Date

Monday, October 30, 2023

Research Area

Artificial Intelligence and Machine Learning

Circuits and VLSI Design

Generative AI

Uploaded Files

paper690 KB