Sameer Dharur  

 
  ![](/sites/default/files/person/Sameer.Dharur.png)

  
 Sameer Dharur is a research scientist on the [Cosmos](https://www.nvidia.com/en-us/ai/cosmos/) team at NVIDIA, helping to build vision-language-models (VLMs) that reason better about the world. Prior to that, he spent ~4.5 years as a researcher and engineer at Apple specializing in computer vision and natural language processing to solve problems in image and video understanding, question answering, and robotics. His long term research goals are around building AI systems that can visualize the world, communicate, and take actions in reasonable and interpretable ways under resource-constrained settings. His industry experience over 7+ years includes key contributions to making Apple's Siri more robust, enhancing Salesforce's Einstein Reply Recommendations and contributing to the development of Qualcomm's Snapdragon Neural Processing Engine.


   Research Area(s)

[Artificial Intelligence and Machine Learning ](/research-area/machine-learning-artificial-intelligence)

[Computer Vision](/research-area/computer-vision)

[Generative AI](/research-area/generative-ai)

[Natural Language Processing](/research-area/natural-language-processing)

[Physical AI](/research-area/physical-ai)

 
 Main Field of Interest

[Artificial Intelligence and Machine Learning ](/research-area/machine-learning-artificial-intelligence)

 
 Google Scholar

[https://scholar.google.com/citations?user=IVs8P7MAAAAJ&amp;hl=en](https://scholar.google.com/citations?user=IVs8P7MAAAAJ&hl=en)