Karan Sikka

Dr. Karan Sikka is a ~~Senior Computer Vision Scientist at [SRI International](https://www.sri.com/computer-vision/), Princeton, USA~~ Research Scientist at Meta(AI). He completed his PhD from University of California San Diego in 2016 (advised by Dr. Marian Bartlett). He completed his bachelors in ECE from IIT Guwahati, India in 2010.

Dr. Sikka’s doctoral thesis centered on developing machine learning models for action classification in videos, specifically under conditions of weak supervision. Upon joining SRI, his research trajectory shifted towards multimodal learning, emphasizing approaches for learning under few/zero-shot settings. He explored the use of diverse modalities to enhance various tasks, spanning from visual grounding to social media-analysis and geo-localization. His present research focuses on leveraging large language models (Generative AI), across a spectrum of applications including robotics, visual understanding, personalized content generation. Furthermore, he is also interested in enhancing the consistency of these models and mitigating issues such as hallucination. His work has been published at high quality venues such as CVPR, ICCV, ECCV, and also won multiple awards.

Please check the following links for more details.