Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(4)
2024
(10)
2023
(18)
2022
(20)
2021
(12)
2020
(12)
2019
(12)
2018
(6)
2017
(3)
2016
(6)
2015
(2)
2014
(1)
2013
(3)
2010
(1)
2007
(1)
Facet Publication Year
Research Areas
Circuits and VLSI Design
(32)
Artificial Intelligence and Machine Learning
(19)
Computer Architecture
(5)
Algorithms and Numerical Methods
(3)
Generative AI
(1)
High Performance Computing
(1)
Events
ISPD
(1)
NeurIPS
(1)
32 results found
Circuits and VLSI Design
Clear all
2022
2019
Circuits and VLSI Design
2022
Efficient Arithmetic Block Identification with Graph Learning and Network-flow
Ziyi Wang, Zhuolun He, Chen Bai,
Haoyu Yang
, Bei Yu
An Adversarial Active Sampling-based Data Augmentation Framework for Manufacturable Chip Design
Mingjie Liu
,
Haoyu Yang
, Zongyi Li, Kumara Sastry, Saumyadip Mukhopadhyay, Selim Dogru, Anima Anandkumar, David Z. Pan,
Brucek Khailany
,
Haoxing (Mark) Ren
NeurIPS
LNS-Madam: Low-Precision Training in Logarithmic Number System Using Multiplicative Weight Update
Jiawei Zhao,
Steve Dai
,
Rangharajan Venkatesan
,
Brian Zimmer
, Mustafa Ali,
Ming-Yu Liu
,
Brucek Khailany
,
William Dally
, Anima Anandkumar
Why are Graph Neural Networks Effective for EDA Problems?
Haoxing (Mark) Ren
, Siddhartha Nath,
Yanqing Zhang
, Hao Chen,
Mingjie Liu
TransSizer: A Novel Transformer-Based Fast Gate Sizer
Siddhartha Nath, Geraldo Pradipta, Corey Hu, Tian Yang,
Brucek Khailany
,
Haoxing (Mark) Ren
TAG: Learning Circuit Spatial Embedding from Layouts
Keren Zhu, Hao Chen,
Walker Turner
, George F. Kokai, Po-Hsuan Wei, David Z. Pan,
Haoxing (Mark) Ren
Photonic Circuits for Accelerated Computing Systems
Ben Lee
XT-PRAGGMA: Crosstalk Pessimism Reduction Accessible by GPU Gate-level Simulations and Machine Learning
Vidya Chhabria,
Ben Keller
,
Yanqing Zhang
, Sandeep Vollala, Sreedhar Patty,
Haoxing (Mark) Ren
,
Brucek Khailany
Placement Optimization via PPA-Directed Graph Clustering
Yi-Chen Lu, Tian Yang, Sung Kyu Lim,
Haoxing (Mark) Ren
From RTL to CUDA: A GPU Acceleration Flow for RTL Simulation with Batch Stimulus
Dian-Lun Lin,
Haoxing (Mark) Ren
,
Yanqing Zhang
,
Brucek Khailany
, Tsung-Wei Huang
A 0.297-pJ/bit 50.4-Gb/s/wire Inverter-Based Short-Reach Simultaneous Bidirectional Transceiver for Die-to-Die Interface in 5nm CMOS
Yoshinori Nishi
, John W. Poulton,
Xi Chen
,
Sanquan Song
,
Brian Zimmer
,
Walker Turner
,
Stephen Tell
,
Nikola Nedovic
,
John Wilson
,
William Dally
,
Tom Gray
A 17–95.6 TOPS/W Deep Learning Inference Accelerator with Per-Vector Scaled 4-bit Quantization for Transformers in 5nm
Ben Keller
,
Rangharajan Venkatesan
,
Steve Dai
,
Stephen Tell
,
Brian Zimmer
,
William Dally
,
Tom Gray
,
Brucek Khailany
Saving PAM4 Bus Energy with SMOREs: Sparse Multi-level Opportunistic Restricted Encodings
Mike O'Connor
,
Donghyuk Lee
,
Niladrish Chatterjee
,
Michael B. Sullivan
,
Steve Keckler
AutoCRAFT: Layout Automation for Custom Circuits in Advanced FinFET Technologies
Hao Chen,
Walker Turner
,
Sanquan Song
, Keren Zhu, George Kokai,
Brian Zimmer
,
Tom Gray
,
Brucek Khailany
,
Haoxing (Mark) Ren
ISPD
Routability-Aware Placement for Advanced FinFET Analog Circuits with Satisfiability Modulo Theories
Hao Chen,
Walker Turner
, David Z. Pan,
Haoxing (Mark) Ren
Generic Lithography Modeling with Dual-band Optics-Inspired Neural Networks
Haoyu Yang
, Zongyi Li, Kumara Sastry, Saumyadip Mukhopadhyay,
Mark Kilgard
, Anima Anandkumar,
Brucek Khailany
, Vivek Singh,
Haoxing (Mark) Ren
GATSPI: GPU Accelerated Gate-Level Simulation for Power Improvement
Yanqing Zhang
,
Haoxing (Mark) Ren
, Akshay Sridharan,
Brucek Khailany
Driving Down Link Energy and Driving Up Link Density in GPU Networks
Ben Lee
DeePattern: Layout Pattern Generation with Transforming Convolutional Auto-Encoder
Haoyu Yang
, Shuhe Li, Wen Chen, Piyush Pathak, Frank Gennari, Ya-Chieh Lai, Bei Yu
Best Paper Award
Machine Learning and Algorithms: Let Us Team Up for EDA
Haoxing (Mark) Ren
,
Brucek Khailany
,
Matt Fojtik
,
Yanqing Zhang
2019
MAGNet: A Modular Accelerator Generator for Neural Networks
Rangharajan Venkatesan
, Sophia Shao, Miaorong Wang,
Jason Clemons
,
Steve Dai
,
Matt Fojtik
,
Ben Keller
, Alicia Klinefelter,
Nathaniel Pinckney
, Priyanka Raina,
Yanqing Zhang
,
Brian Zimmer
,
William Dally
,
Joel Emer
,
Steve Keckler
,
Brucek Khailany
Simba: Scaling Deep-Learning Inference with Multi-Chip-Module-Based Architecture
Sophia Shao,
Jason Clemons
,
Rangharajan Venkatesan
,
Brian Zimmer
,
Matt Fojtik
,
Ted Jiang
,
Ben Keller
, Alicia Klinefelter,
Nathaniel Pinckney
, Priyanka Raina,
Stephen Tell
,
Yanqing Zhang
,
William Dally
,
Joel Emer
,
Tom Gray
,
Brucek Khailany
,
Steve Keckler
Best Paper award
IEEE Micro Top Picks in Computer Architecture (Honorable Mention)
A 0.11 pJ/Op, 0.32-128 TOPS, Scalable Multi-Chip-Module-based Deep Neural Network Accelerator Designed with a High-Productivity VLSI Methodology
Rangharajan Venkatesan
, Sophia Shao,
Brian Zimmer
,
Jason Clemons
,
Matt Fojtik
,
Ted Jiang
,
Ben Keller
, Alicia Klinefelter,
Nathaniel Pinckney
, Priyanka Raina,
Stephen Tell
,
Yanqing Zhang
,
William Dally
,
Joel Emer
,
Tom Gray
,
Steve Keckler
,
Brucek Khailany
A 0.11 pJ/Op, 0.32-128 TOPS, Scalable Multi-Chip-Module-based Deep Neural Network Accelerator with Ground-Reference Signaling in 16nm
Brian Zimmer
,
Rangharajan Venkatesan
, Sophia Shao,
Jason Clemons
,
Matt Fojtik
,
Ted Jiang
,
Ben Keller
, Alicia Klinefelter,
Nathaniel Pinckney
, Priyanka Raina,
Stephen Tell
,
Yanqing Zhang
,
William Dally
,
Joel Emer
,
Tom Gray
,
Steve Keckler
,
Brucek Khailany
DREAMPlace: Deep Learning Toolkit-Enabled GPU Acceleration for Modern VLSI Placement
Yibo Lin, Shounak Dhar, Wuxi Li,
Haoxing (Mark) Ren
,
Brucek Khailany
, David Z. Pan
DAC 2019 Best Paper Award
High Performance Graph Convolutional Networks with Applications in Testability Analysis
Yuzhe Ma,
Haoxing (Mark) Ren
,
Brucek Khailany
, Harbinder Sikka, Lijuan Luo, Karthikeyan Natarajan, Bei Yu
PRIMAL: Power Inference using Machine Learning
Yuan Zhou,
Haoxing (Mark) Ren
,
Yanqing Zhang
,
Ben Keller
,
Brucek Khailany
, Zhiru Zhang
Analog/Mixed-Signal Hardware Error Modeling for Deep Learning Inference
Angad S. Rekhi,
Brian Zimmer
,
Nikola Nedovic
, Nigxi Liu,
Rangharajan Venkatesan
, Miaorong Wang,
Brucek Khailany
,
William Dally
,
Tom Gray
A Fine-Grained GALS SoC with Pausible Adaptive Clocking in 16 nm FinFET
Matt Fojtik
,
Ben Keller
, Alicia Klinefelter,
Nathaniel Pinckney
,
Stephen Tell
,
Brian Zimmer
, Tezaswi Raja, Kevin Zhou,
William Dally
,
Brucek Khailany
ASYNC 2019 Best Paper Award
Voltage-Follower Coupling Quadrature Oscillator with Embedded Phase-Interpolator in 16nm FinFET
Xi Chen
,
Sanquan Song
, John Poulton,
Nikola Nedovic
,
Brian Zimmer
,
Stephen Tell
,
Tom Gray
Routability-Driven Macro Placement with Embedded CNN-Based Prediction Model
Yu-Hung Huang, Zhiyao Xie, Guan-Qi Fang, Tao-Chun Yu,
Haoxing (Mark) Ren
, Shao-Yun Fang, Yiran Chen, Jiang Hu
A 1.17-pJ/b, 25-Gb/s/pin Ground-Referenced Single-Ended Serial Link for Off- and On-Package Communication Using a Process- and Temperature-Adaptive Voltage Regulator
John Poulton,
John Wilson
,
Walker Turner
,
Brian Zimmer
,
Xi Chen
,
Sudhir Kudva
,
Sanquan Song
,
Stephen Tell
,
Nikola Nedovic
, Wenxu Zhao, Sunil Sudhakaran,
Tom Gray
,
William Dally