Sameer Dharur is a research scientist on the Cosmos team at NVIDIA, helping to build vision-language-models (VLMs) that reason better about the world. Prior to that, he spent ~4.5 years as a researcher and engineer at Apple specializing in computer vision and natural language processing to solve problems in image and video understanding, question answering, and robotics. His long term research goals are around building AI systems that can visualize the world, communicate, and take actions in reasonable and interpretable ways under resource-constrained settings. His industry experience over 7+ years includes key contributions to making Apple's Siri more robust, enhancing Salesforce's Einstein Reply Recommendations and contributing to the development of Qualcomm's Snapdragon Neural Processing Engine.