  Stephen Tell  

 



  ![](/sites/default/files/person/Stephen_Tell.JPG)

  

 Stephen G. Tell joined NVIDIA's Circuits Research Group in April 2009. Prior to joining NVIDIA, he has worked on a variety of high-performance computation and interconnect projects at Rambus, Velio Communications, and the UNC Microelectronics Systems Lab.



   Research Area(s)

[Circuits and VLSI Design](/index.php/research-area/circuits)

 

 

  

 

 

 



 ### Publications

 

### 2026 

[Alpha-Vision: A Real-Time Always-on Vision Processor with 787µs Face Detection Latency in &lt;5mW](/publication/2026-02_alpha-vision-real-time-always-vision-processor-787ms-face-detection-latency)

[Ben Keller](/person/ben-keller), [Rangharajan Venkatesan](/person/rangharajan-venkatesan), [Steve Dai](/person/steve-dai), [Jason Clemons](/person/jason-clemons), [Matt Fojtik](/person/matt-fojtik), [Muya Chang](/person/muya-chang), Thierry Tambe, [Nathaniel Pinckney](/person/nathaniel-pinckney), [Stephen Tell](/person/stephen-tell), [Qijing Jenny Huang](/person/qijing-jenny-huang), [Shalini De Mello](/person/shalini-de-mello), [Brucek Khailany](/person/brucek-khailany)



[ISSCC 2026](https://www.isscc.org/)









### 2024 

[A 0.190-pJ/bit 25.2-Gb/s/wire Inverter-Based AC-Coupled Transceiver for Short-Reach Die-to-Die Interfaces in 5-nm CMOS](/publication/2024-04_0190-pjbit-252-gbswire-inverter-based-ac-coupled-transceiver-short-reach-die)

[Yoshinori Nishi](/person/yoshi-nishi), John W. Poulton, [Xi Chen](/person/xi-chen), [Sanquan Song](/person/sanquan-song), [Brian Zimmer](/person/brian-zimmer), [Walker Turner](/person/walker-turner), [Stephen Tell](/person/stephen-tell), [Nikola Nedovic](/person/nikola-nedovic), [John Wilson](/person/john-wilson), [William Dally](/person/william-dally), [Tom Gray](/person/tom-gray)



[IEEE Journal of Solid-State Circuits (JSSC) (Volume: 59, Issue: 4, April 2024)](https://ieeexplore.ieee.org/document/10185334)









### 2023 

[A 0.190-pJ/bit 25.2-Gb/s/wire Inverter-Based AC-Coupled Transceiver for Short-Reach Die-to-Die Interfaces in 5-nm CMOS](/publication/2023-06_0190-pjbit-252-gbswire-inverter-based-ac-coupled-transceiver-short-reach-die)

[Yoshinori Nishi](/person/yoshi-nishi), John W. Poulton, [Xi Chen](/person/xi-chen), [Sanquan Song](/person/sanquan-song), [Brian Zimmer](/person/brian-zimmer), [Walker Turner](/person/walker-turner), [Stephen Tell](/person/stephen-tell), [Nikola Nedovic](/person/nikola-nedovic), [John Wilson](/person/john-wilson), [William Dally](/person/william-dally), [Tom Gray](/person/tom-gray)



[2023 IEEE SYMPOSIUM ON VLSI TECHNOLOGY &amp; CIRCUITS](https://ieeexplore.ieee.org/abstract/document/10185334)









[A 0.297-pJ/Bit 50.4-Gb/s/Wire Inverter-Based Short-Reach Simultaneous Bi-Directional Transceiver for Die-to-Die Interface in 5-nm CMOS](/publication/2023-04_0297-pjbit-504-gbswire-inverter-based-short-reach-simultaneous-bi-directional)

[Yoshinori Nishi](/person/yoshi-nishi), John W. Poulton, [Walker Turner](/person/walker-turner), [Xi Chen](/person/xi-chen), [Sanquan Song](/person/sanquan-song), [Brian Zimmer](/person/brian-zimmer), [Stephen Tell](/person/stephen-tell), [Nikola Nedovic](/person/nikola-nedovic), [John Wilson](/person/john-wilson), [William Dally](/person/william-dally), [Tom Gray](/person/tom-gray)



[IEEE Journal of Solid-State Circuits ( Volume: 58, Issue: 4, April 2023)](https://ieeexplore.ieee.org/document/10011563)









[A 95.6-TOPS/W Deep Learning Inference Accelerator With Per-Vector Scaled 4-bit Quantization in 5 nm](/publication/2023-01_956-topsw-deep-learning-inference-accelerator-vector-scaled-4-bit-quantization)

[Ben Keller](/person/ben-keller), [Rangharajan Venkatesan](/person/rangharajan-venkatesan), [Steve Dai](/person/steve-dai), [Stephen Tell](/person/stephen-tell), [Brian Zimmer](/person/brian-zimmer), [Charbel Sakr](/person/charbel-sakr), [William Dally](/person/william-dally), [Tom Gray](/person/tom-gray), [Brucek Khailany](/person/brucek-khailany)



[Journal of Solid-State Circuits](https://ieeexplore.ieee.org/document/10019275)









### 2022 

[A 0.297-pJ/bit 50.4-Gb/s/wire Inverter-Based Short-Reach Simultaneous Bidirectional Transceiver for Die-to-Die Interface in 5nm CMOS](/publication/2022-06_0297-pjbit-504-gbswire-inverter-based-short-reach-simultaneous-bidirectional)

[Yoshinori Nishi](/person/yoshi-nishi), John W. Poulton, [Xi Chen](/person/xi-chen), [Sanquan Song](/person/sanquan-song), [Brian Zimmer](/person/brian-zimmer), [Walker Turner](/person/walker-turner), [Stephen Tell](/person/stephen-tell), [Nikola Nedovic](/person/nikola-nedovic), [John Wilson](/person/john-wilson), [William Dally](/person/william-dally), [Tom Gray](/person/tom-gray)



[2022 IEEE SYMPOSIUM ON VLSI TECHNOLOGY &amp; CIRCUITS](https://archive.vlsisymposium.org/22web/about/)









[A 17–95.6 TOPS/W Deep Learning Inference Accelerator with Per-Vector Scaled 4-bit Quantization for Transformers in 5nm](/index.php/publication/2022-06_17-956-topsw-deep-learning-inference-accelerator-vector-scaled-4-bit)

[Ben Keller](/index.php/person/ben-keller), [Rangharajan Venkatesan](/index.php/person/rangharajan-venkatesan), [Steve Dai](/index.php/person/steve-dai), [Stephen Tell](/index.php/person/stephen-tell), [Brian Zimmer](/index.php/person/brian-zimmer), [William Dally](/index.php/person/william-dally), [Tom Gray](/index.php/person/tom-gray), [Brucek Khailany](/index.php/person/brucek-khailany)



[2022 Symposium on VLSI Technology &amp; Circuits Digest of Technical Papers](https://www.vlsisymposium.org)









### 2021 

[Simba: scaling deep-learning inference with chiplet-based architecture](/publication/2021-05_simba-scaling-deep-learning-inference-chiplet-based-architecture)

Yakun Sophia Shao, [Jason Clemons](/person/jason-clemons), [Rangharajan Venkatesan](/person/rangharajan-venkatesan), [Brian Zimmer](/person/brian-zimmer), [Matt Fojtik](/person/matt-fojtik), [Ted Jiang](/person/ted-jiang), [Ben Keller](/person/ben-keller), Alicia Klinefelter, [Nathaniel Pinckney](/person/nathaniel-pinckney), Priyanka Raina, [Stephen Tell](/person/stephen-tell), [Yanqing Zhang](/person/yanqing-zhang), [William Dally](/person/william-dally), [Joel Emer](/person/joel-emer), [Tom Gray](/person/tom-gray), [Brucek Khailany](/person/brucek-khailany), [Steve Keckler](/person/stephen-keckler)



[Communications of the ACM](https://dl.acm.org/doi/10.1145/3460227)



ACM Research Highlight





### 2020 

[Reference-Noise Compensation Scheme for Single- Ended Package-to-Package Links](/publication/2020-02_reference-noise-compensation-scheme-single-ended-package-package-links)

[Xi Chen](/person/xi-chen), [Nikola Nedovic](/person/nikola-nedovic), [Stephen Tell](/person/stephen-tell), [Sudhir Kudva](/person/sudhir-kudva), [Brian Zimmer](/person/brian-zimmer), [Trey Greer](/person/trey-greer), John Poulton, [Sanquan Song](/person/sanquan-song), [Walker Turner](/person/walker-turner), [John Wilson](/person/john-wilson), [Tom Gray](/person/tom-gray)



[2020 International Solid-State Circuits Conference](http://isscc.org/)









[A 0.32–128 TOPS, Scalable Multi-Chip-Module-Based Deep Neural Network Inference Accelerator With Ground-Referenced Signaling in 16 nm](/publication/2020-01_032-128-tops-scalable-multi-chip-module-based-deep-neural-network-inference)

[Brian Zimmer](/person/brian-zimmer), [Rangharajan Venkatesan](/person/rangharajan-venkatesan), Yakun Sophia Shao, [Jason Clemons](/person/jason-clemons), [Matt Fojtik](/person/matt-fojtik), [Ted Jiang](/person/ted-jiang), [Ben Keller](/person/ben-keller), Alicia Klinefelter, [Nathaniel Pinckney](/person/nathaniel-pinckney), Priyanka Raina, [Stephen Tell](/person/stephen-tell), [Yanqing Zhang](/person/yanqing-zhang), [William Dally](/person/william-dally), [Joel Emer](/person/joel-emer), [Tom Gray](/person/tom-gray), [Steve Keckler](/person/stephen-keckler), [Brucek Khailany](/person/brucek-khailany)



[IEEE Journal of Solid-State Circuits (JSSC)](https://ieeexplore.ieee.org/document/8959403)



JSSC 2020 Best Paper award





### 2019 

[Simba: Scaling Deep-Learning Inference with Multi-Chip-Module-Based Architecture](/publication/2019-10_simba-scaling-deep-learning-inference-multi-chip-module-based-architecture)

Sophia Shao, [Jason Clemons](/person/jason-clemons), [Rangharajan Venkatesan](/person/rangharajan-venkatesan), [Brian Zimmer](/person/brian-zimmer), [Matt Fojtik](/person/matt-fojtik), [Ted Jiang](/person/ted-jiang), [Ben Keller](/person/ben-keller), Alicia Klinefelter, [Nathaniel Pinckney](/person/nathaniel-pinckney), Priyanka Raina, [Stephen Tell](/person/stephen-tell), [Yanqing Zhang](/person/yanqing-zhang), [William Dally](/person/william-dally), [Joel Emer](/person/joel-emer), [Tom Gray](/person/tom-gray), [Brucek Khailany](/person/brucek-khailany), [Steve Keckler](/person/stephen-keckler)



[International Symposium on Microarchitecture (MICRO)](https://dl.acm.org/doi/10.1145/3352460.3358302)



Best Paper award, IEEE Micro Top Picks in Computer Architecture (Honorable Mention)





[A 0.11 pJ/Op, 0.32-128 TOPS, Scalable Multi-Chip-Module-based Deep Neural Network Accelerator Designed with a High-Productivity VLSI Methodology](/publication/2019-08_011-pjop-032-128-tops-scalable-multi-chip-module-based-deep-neural-network)

[Rangharajan Venkatesan](/person/rangharajan-venkatesan), Sophia Shao, [Brian Zimmer](/person/brian-zimmer), [Jason Clemons](/person/jason-clemons), [Matt Fojtik](/person/matt-fojtik), [Ted Jiang](/person/ted-jiang), [Ben Keller](/person/ben-keller), Alicia Klinefelter, [Nathaniel Pinckney](/person/nathaniel-pinckney), Priyanka Raina, [Stephen Tell](/person/stephen-tell), [Yanqing Zhang](/person/yanqing-zhang), [William Dally](/person/william-dally), [Joel Emer](/person/joel-emer), [Tom Gray](/person/tom-gray), [Steve Keckler](/person/stephen-keckler), [Brucek Khailany](/person/brucek-khailany)



[Hot Chips: A Symposium on High Performance Chips](http://www.hotchips.org/)









[A 0.11 pJ/Op, 0.32-128 TOPS, Scalable Multi-Chip-Module-based Deep Neural Network Accelerator with Ground-Reference Signaling in 16nm](/publication/2019-06_011-pjop-032-128-tops-scalable-multi-chip-module-based-deep-neural-network)

[Brian Zimmer](/person/brian-zimmer), [Rangharajan Venkatesan](/person/rangharajan-venkatesan), Sophia Shao, [Jason Clemons](/person/jason-clemons), [Matt Fojtik](/person/matt-fojtik), [Ted Jiang](/person/ted-jiang), [Ben Keller](/person/ben-keller), Alicia Klinefelter, [Nathaniel Pinckney](/person/nathaniel-pinckney), Priyanka Raina, [Stephen Tell](/person/stephen-tell), [Yanqing Zhang](/person/yanqing-zhang), [William Dally](/person/william-dally), [Joel Emer](/person/joel-emer), [Tom Gray](/person/tom-gray), [Steve Keckler](/person/stephen-keckler), [Brucek Khailany](/person/brucek-khailany)



[Symposium on VLSI Circuits](https://ieeexplore.ieee.org/document/8778056)









[A Fine-Grained GALS SoC with Pausible Adaptive Clocking in 16 nm FinFET](/index.php/publication/2019-05_fine-grained-gals-soc-pausible-adaptive-clocking-16-nm-finfet)

[Matt Fojtik](/index.php/person/matt-fojtik), [Ben Keller](/index.php/person/ben-keller), Alicia Klinefelter, [Nathaniel Pinckney](/index.php/person/nathaniel-pinckney), [Stephen Tell](/index.php/person/stephen-tell), [Brian Zimmer](/index.php/person/brian-zimmer), Tezaswi Raja, Kevin Zhou, [William Dally](/index.php/person/william-dally), [Brucek Khailany](/index.php/person/brucek-khailany)



[ASYNC 2019](http://www.async2019.jp/)



ASYNC 2019 Best Paper Award





[Voltage-Follower Coupling Quadrature Oscillator with Embedded Phase-Interpolator in 16nm FinFET](/publication/2019-04_voltage-follower-coupling-quadrature-oscillator-embedded-phase-interpolator)

[Xi Chen](/person/xi-chen), [Sanquan Song](/person/sanquan-song), John Poulton, [Nikola Nedovic](/person/nikola-nedovic), [Brian Zimmer](/person/brian-zimmer), [Stephen Tell](/person/stephen-tell), [Tom Gray](/person/tom-gray)



[IEEE Custom Integrated Circuits Conference 2019](http://ieee-cicc.org/)









[A 1.17-pJ/b, 25-Gb/s/pin Ground-Referenced Single-Ended Serial Link for Off- and On-Package Communication Using a Process- and Temperature-Adaptive Voltage Regulator](/publication/2019-01_117-pjb-25-gbspin-ground-referenced-single-ended-serial-link-and-package)

John Poulton, [John Wilson](/person/john-wilson), [Walker Turner](/person/walker-turner), [Brian Zimmer](/person/brian-zimmer), [Xi Chen](/person/xi-chen), [Sudhir Kudva](/person/sudhir-kudva), [Sanquan Song](/person/sanquan-song), [Stephen Tell](/person/stephen-tell), [Nikola Nedovic](/person/nikola-nedovic), Wenxu Zhao, Sunil Sudhakaran, [Tom Gray](/person/tom-gray), [William Dally](/person/william-dally)



IEEE JOURNAL OF SOLID-STATE CIRCUITS









### 2018 

[Ground-Referenced Signaling for Intra-Chip and Short-Reach Chip-to-Chip Interconnects](/publication/2018-04_ground-referenced-signaling-intra-chip-and-short-reach-chip-chip-interconnects)

[Walker Turner](/person/walker-turner), John Poulton, [John Wilson](/person/john-wilson), [Xi Chen](/person/xi-chen), [Stephen Tell](/person/stephen-tell), [Matt Fojtik](/person/matt-fojtik), [Trey Greer](/person/trey-greer), [Brian Zimmer](/person/brian-zimmer), [Sanquan Song](/person/sanquan-song), [Nikola Nedovic](/person/nikola-nedovic), [Sudhir Kudva](/person/sudhir-kudva), Sunil Sudhakaran, Rizwan Bashirullah, Wenxu Zhao, [William Dally](/person/william-dally), [Tom Gray](/person/tom-gray)



Custom Integrated Circuits Conference









[A 1.17pJ/b 25Gb/s/pin Ground-Referenced Single Ended Serial Link for Off- and On-Package Communication in 16nm CMOS Using a Process- and Temperature-Adaptive Voltage Regulator](/index.php/publication/2018-02_117pjb-25gbspin-ground-referenced-single-ended-serial-link-and-package)

[John Wilson](/index.php/person/john-wilson), [Walker Turner](/index.php/person/walker-turner), John Poulton, [Brian Zimmer](/index.php/person/brian-zimmer), [Xi Chen](/index.php/person/xi-chen), [Sanquan Song](/index.php/person/sanquan-song), [Stephen Tell](/index.php/person/stephen-tell), [Nikola Nedovic](/index.php/person/nikola-nedovic), Wenxu Zhao, Sunil Sudhakaran, [Tom Gray](/index.php/person/tom-gray), [William Dally](/index.php/person/william-dally)



ISSCC









### 2016 

[A 28nm 2Mbit 6T SRAM with Highly Configurable Write Assist Implementation and Capacitor Based Sense Amplifier Input Offset Compen](/publication/2016-02_28nm-2mbit-6t-sram-highly-configurable-write-assist-implementation-and)

Mahmut Sinangil, John Poulton, [Matt Fojtik](/person/matt-fojtik), [Trey Greer](/person/trey-greer), [Stephen Tell](/person/stephen-tell), Andy Gotterba, Jesse Wang, Jason Golbus, [William Dally](/person/william-dally), [Tom Gray](/person/tom-gray)



Journal of Solid State Circuits









[A 6.5-to-23.3fJ/b/mm Balanced Charge-Recycling Bus in 16nm FinFET CMOS at 1.7-to-2.6Gb/s/wire with Clock Forwarding and Low-Crosstalk Contraflow Wiring](/publication/2016-02_65-233fjbmm-balanced-charge-recycling-bus-16nm-finfet-cmos-17-26gbswire-clock)

[John Wilson](/person/john-wilson), [Matt Fojtik](/person/matt-fojtik), John Poulton, [Xi Chen](/person/xi-chen), [Stephen Tell](/person/stephen-tell), [Trey Greer](/person/trey-greer), [Tom Gray](/person/tom-gray), [William Dally](/person/william-dally)



[International Solid-State Circuits Conference (ISSCC 2016)](http://ieeexplore.ieee.org/document/7417954/)









### 2013 

[A 0.54 pJ/b 20 Gb/s Ground-Referenced Single-Ended Short-Reach Serial Link in 28 nm CMOS for Advanced Packaging Applications](/publication/2013-12_054-pjb-20-gbs-ground-referenced-single-ended-short-reach-serial-link-28-nm)

John Poulton, [William Dally](/person/william-dally), [Xi Chen](/person/xi-chen), John Eyles, [Trey Greer](/person/trey-greer), [Stephen Tell](/person/stephen-tell), [John Wilson](/person/john-wilson), [Tom Gray](/person/tom-gray)



[Journal of Solid State Circuits](http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6601723)









[A 0.54pJ/b 20Gb/s Ground-Referenced Single-Ended Short-Haul Serial Link in 28nm CMOS for Advanced Packaging Applications](/index.php/publication/2013-02_054pjb-20gbs-ground-referenced-single-ended-short-haul-serial-link-28nm-cmos)

John Poulton, [William Dally](/index.php/person/william-dally), [Xi Chen](/index.php/person/xi-chen), John Eyles, [Trey Greer](/index.php/person/trey-greer), [Stephen Tell](/index.php/person/stephen-tell), [Tom Gray](/index.php/person/tom-gray)



[ISSCC](http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6487789)









### 2010 

[The Even/Odd Synchronizer: A Fast, All-Digital Periodic Synchronizer](/publication/2010-05_evenodd-synchronizer-fast-all-digital-periodic-synchronizer)

[William Dally](/person/william-dally), [Stephen Tell](/person/stephen-tell)



[16th International Symposium on Asynchronous Circuits and Systems](https://ieeexplore.ieee.org/document/5476986)