  Siddharth Gururani  

 



  ![](/sites/default/files/person/sidd_ismir_website_cropped.jpeg)

  

 Siddharth Gururani is a Research Scientist at NVIDIA. Prior to joining NVIDIA he was an AI Scientist at EA, where he worked on expressive speech synthesis focusing on low-resource regimes and approaches based on interpretable features to encode prosody. He received his Ph.D. in Music Technology from Georgia Tech, where his work involved weakly supervised methods for identifying musical instruments in audio. He obtained his B.Tech and M.Tech in Computer Science &amp; Engineering from IIT Kharagpur. His research interests include text-to-speech, music information retrieval, and music generation. He has done internships at Gracenote, Samsung Research America, EA, and HKUST during his time in school.

He is an active member of the ISMIR community and served in the program committee of ISMIR 2021.



   Research Area(s)

[Artificial Intelligence and Machine Learning ](/research-area/machine-learning-artificial-intelligence)

 

 

  

 Main Field of Interest

[Artificial Intelligence and Machine Learning ](/research-area/machine-learning-artificial-intelligence)

 

  

 Google Scholar

[https://scholar.google.com/citations?user=\_C-H8\_MAAAAJ&amp;hl=en](https://scholar.google.com/citations?user=_C-H8_MAAAAJ&hl=en)

 

  

 

 

 



 ### Publications

 

### 2025 

[Fugatto 1 - Foundational Generative Audio Transformer Opus 1](/publication/2025-04_fugatto-1-foundational-generative-audio-transformer-opus-1)

Rafael Valle, Rohan Badlani, Zhifeng Kong, Sang-gil Lee, Arushi Goel, Sungwon Kim, Joao Felipe Santos, Shuqi Dai, [Siddharth Gururani](/person/siddharth-gururani), Aya AIJa'fari, Alex Liu, Kevin Shih, Wei Ping, [Huck Yang](/person/huck-yang), Bryan Catanzaro



[ICLR 2025](https://openreview.net/forum?id=B2Fqu7Y2cd)