Frank Wang  

 
  ![](/sites/default/files/person/FrankWang.JPG)

  
 Research Director, Deep Learning and Computer Vision, NVIDIA

Professor, Department of Electrical Engineering, National Taiwan University


   Research Area(s)

[Artificial Intelligence and Machine Learning ](/index.php/research-area/machine-learning-artificial-intelligence)

[Computer Vision](/index.php/research-area/computer-vision)

[Generative AI](/index.php/research-area/generative-ai)

 
 Main Field of Interest

[Artificial Intelligence and Machine Learning ](/index.php/research-area/machine-learning-artificial-intelligence)

 
 Google Scholar

[https://scholar.google.com/citations?user=HSGvdtoAAAAJ&amp;hl=en](https://scholar.google.com/citations?user=HSGvdtoAAAAJ&hl=en)

 
 ### Publications

 
### 2026 

[Test-Time Alignment for Large Language Models via Textual Model Predictive Control](/publication/2026-04_test-time-alignment-large-language-models-textual-model-predictive-control)

Kuang-Da Wang, Teng-Ruei Chen, Yu Heng Hung, Guo-Xun Ko, Shuoyang Ding, [Frank Wang](/person/frank-wang), [Huck Yang](/person/huck-yang), Wen-Chih Peng, Ping-Chun Hsieh


[ICLR](https://openreview.net/forum?id=DsS3xRPSs5)


### 2025 

[ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning](/publication/2025-12_thinkact-vision-language-action-reasoning-reinforced-visual-latent-planning)

[Chi-Pin Huang](/person/chi-pin-huang), Yueh-Hua Wu, [Min-Hung Chen](/person/min-hung-chen), [Frank Wang](/person/frank-wang), [Fred Yang](/person/fred-yang)


[Neural Information Processing Systems (NeurIPS) 2025](https://arxiv.org/pdf/2507.16815)


[VoiceNoNG: Robust High-Quality Speech Editing Model without Hallucinations](/publication/2025-08_voicenong-robust-high-quality-speech-editing-model-without-hallucinations)

[Sung-Feng Huang](/person/sung-feng-huang), Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, Pin-Jui Ku, Ante Jukić, [Huck Yang](/person/huck-yang), Yu Tsao, [Frank Wang](/person/frank-wang), Hung-yi Lee, [Szu-Wei Fu](/person/szu-wei-fu)


[Interspeech 2025](https://www.interspeech2025.org/home)


[UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation](/publication/2025-04_uniwav-towards-unified-pre-training-speech-representation-learning-and)

Alexander H. Liu, Sang-gil Lee, [Huck Yang](/person/huck-yang), Yuan Gong, [Frank Wang](/person/frank-wang), James R. Glas, Rafael Valle


[ICLR 2025](https://openreview.net/forum?id=yj9lLwMjnE)


[Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation](/publication/2025-02_semantic-prompt-learning-weakly-supervised-semantic-segmentation)

Ci-Siang Lin, Chien-Yi Wang, [Frank Wang](/person/frank-wang), [Min-Hung Chen](/person/min-hung-chen)


[Winter Conference on Applications of Computer Vision (WACV)](https://wacv2025.thecvf.com/)


### 2024 

[Diffusion-Reward Adversarial Imitation Learning](/publication/2024-12_diffusion-reward-adversarial-imitation-learning)

Chun-Mao Lai, Hsiang-Chun Wang, Ping-Chun Hsieh, [Frank Wang](/person/frank-wang), [Min-Hung Chen](/person/min-hung-chen), Shao-Hua Sun


[Neural Information Processing Systems (NeurIPS)](https://neurips.cc/Conferences/2024)


[Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits](/publication/2024-12_detecting-undetectable-assessing-efficacy-current-spoof-detection-methods)

[Sung-Feng Huang](/person/sung-feng-huang), Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, [Huck Yang](/person/huck-yang), Yu Tsao, [Frank Wang](/person/frank-wang), Hung-yi Lee, [Szu-Wei Fu](/person/szu-wei-fu)


[IEEE SLT 2024](https://2024.ieeeslt.org/)


[DoRA: Weight-Decomposed Low-Rank Adaptation](/publication/2024-07_dora-weight-decomposed-low-rank-adaptation)

Shih-Yang Liu, Chien-Yi Wang, [Hongxu Danny Yin](/person/danny-yin), [Pavlo Molchanov](/person/pavlo-molchanov), [Frank Wang](/person/frank-wang), Kwang-Ting Cheng, [Min-Hung Chen](/person/min-hung-chen)


[International Conference on Machine Learning (ICML) 2024](https://icml.cc/Conferences/2024)


### 2023 

[Target-free Text-guided Image Manipulation](/publication/2023-02_target-free-text-guided-image-manipulation)

Wan-Cyuan Fan, Cheng-Fu Yang, Chiao-An Yang, [Frank Wang](/person/frank-wang)


[AAAI 2023](https://aaai.org/Conferences/AAAI-23/)


[Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis](/publication/2023-02_frido-feature-pyramid-diffusion-complex-scene-image-synthesis)

Wan-Cyuan Fan, Yen-Chun Chen, Dongdong Chen, Yu Cheng, Lu Yuan, [Frank Wang](/person/frank-wang)


[AAAI 2023](https://aaai.org/Conferences/AAAI-23/)


[Self-Supervised Pyramid Representation Learning for Multi-Label Visual Analysis and Beyond](/publication/2023-01_self-supervised-pyramid-representation-learning-multi-label-visual-analysis-and)

Cheng-Yen Hsieh, Chih-Jung Chang, Fu-En Yang, [Frank Wang](/person/frank-wang)


[WACV 2023](https://wacv2023.thecvf.com/home)


### 2022 

[Paraphrasing Is All You Need for Novel Object Captioning](/publication/2022-11_paraphrasing-all-you-need-novel-object-captioning)

Cheng-Fu Yang, Yao-Hung Hubert Tsai, Wan-Cyuan Fan, Ruslan Salakhutdinov, Louis-Philippe Morency, [Frank Wang](/person/frank-wang)


[NeurIPS 2022](https://nips.cc/)


[SPoVT: Semantic-Prototype Variational Transformer for Dense Point Cloud Semantic Completion](/publication/2022-11_spovt-semantic-prototype-variational-transformer-dense-point-cloud-semantic)

Sheng-Yu Huang, Hao-Yu Hsu, [Yu-Chiang Frank Wang](/person/frank-wang)


[NeurIPS 2022](https://nips.cc/)