Publications

2024

  1. Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models
    Yangyi Chen , Karan Sikka , Michael Cogswell , and 2 more authors
    In NAACL , 2024
  2. SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments
    Abhinav Rajvanshi , Karan Sikka , Xiao Lin , and 3 more authors
    In PNAS , 2024
  3. Dress: Instructing large vision-language models to align and interact with humans via natural language feedback
    Yangyi Chen , Karan Sikka , Michael Cogswell , and 2 more authors
    2024

2023

  1. TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models
    Indranil Sur , Karan Sikka , Matthew Walmer , and 5 more authors
    In ICCV , 2023
  2. Multilingual Content Moderation: A Case Study on Reddit
    Meng Ye , Karan Sikka , Katherine Atwell , and 3 more authors
    In EACL , 2023
  3. Predicting Information Pathways Across Online Communities
    Yiqiao Jin , Yeon-Chang Lee , Kartik Sharma , and 4 more authors
    In KDD , 2023
  4. Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning
    Anirudh Som , Karan Sikka , Helen Gent , and 3 more authors
    arXiv preprint arXiv:2310.10707, 2023
  5. A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
    Matthew Gwilliam , Michael Cogswell , Meng Ye , and 3 more authors
    arXiv preprint arXiv:2312.00115, 2023

2022

  1. Dual-Key Multimodal Backdoors for Visual Question Answering
    Matthew Walmer , Karan Sikka , Indranil Sur , and 2 more authors
    In CVPR , 2022
  2. Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark
    Pritish Sahu , Karan Sikka , and Ajay Divakaran
    In WACV , 2022

2021

  1. Towards solving multimodal comprehension
    Pritish Sahu , Karan Sikka , and Ajay Divakaran
    arXiv, 2021
  2. MISA: Online Defense of Trojaned Models using Misattributions
    Panagiota Kiourti , Wenchao Li , Anirban Roy , and 2 more authors
    In Annual Computer Security Applications Conference , 2021
  3. Resilient Data Augmentation Approaches to Multimodal Verification in the News Domain
    John Cadigan , Karan Sikka , Meng Ye , and 1 more author
    In ICCV Workshops , 2021

2020

  1. Deep adaptive semantic logic (dasl): Compiling declarative knowledge into deep neural networks
    Karan Sikka , Andrew Silberfarb , John Byrnes , and 4 more authors
    arXiv, 2020
  2. Rgb2lidar: Towards solving large-scale cross-modal visual localization
    Niluthpol Chowdhury Mithun , Karan Sikka , Han-Pang Chiu , and 2 more authors
    In ACMM , 2020
  3. Zero-shot learning with knowledge enhanced visual semantic embeddings
    Karan Sikka , Jihua Huang , Andrew Silberfarb , and 6 more authors
    arXiv, 2020
  4. Detecting trojaned dnns using counterfactual attributions
    Karan Sikka , Indranil Sur , Susmit Jha , and 2 more authors
    arXiv, 2020

2019

  1. Align2ground: Weakly supervised phrase grounding guided by image-caption alignment
    Samyak Datta , Karan Sikka , Anirban Roy , and 3 more authors
    In ICCV , 2019
  2. Integrating text and image: Determining multimodal document intent in instagram posts
    Julia Kruk , Jonah Lubin , Karan Sikka , and 3 more authors
    EMNLP, 2019
  3. Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks
    Karan Sikka , Lucas Van Bramer , and Ajay Divakaran
    arXiv, 2019
  4. Foodx-251: a dataset for fine-grained food classification
    Parneet Kaur , Karan Sikka , Weijun Wang , and 2 more authors
    CVPR Workshops, 2019
  5. Sunny and dark outside?! improving answer consistency in vqa through entailed question generation
    Arijit Ray , Karan Sikka , Ajay Divakaran , and 2 more authors
    EMNLP, 2019
  6. Semantically-Aware Attentive Neural Embeddings for Long-Term 2D Visual Localization
    Zachary Seymour , Karan Sikka , Han-Pang Chiu , and 2 more authors
    In BMVC , 2019

2018

  1. Zero-shot object detection
    Ankan Bansal , Karan Sikka , Gaurav Sharma , and 2 more authors
    In ECCV , 2018
  2. Understanding visual ads by aligning symbols and objects using co-attention
    Karuna Ahuja , Karan Sikka , Anirban Roy , and 1 more author
    In CVPR Workshops , 2018
  3. Make up your mind: Towards consistent answer predictions in vqa models
    Arijit Ray , Giedrius T Burachas , Karan Sikka , and 4 more authors
    In ECCV Workshops , 2018

2017

  1. Deep active object recognition by joint label and action prediction
    Mohsen Malmir , Karan Sikka , Deborah Forster , and 3 more authors
    CVIU, 2017
  2. Discriminatively trained latent ordinal model for video classification
    Karan Sikka , and Gaurav Sharma
    PAMI, 2017
  3. Adascan: Adaptive scan pooling in deep convolutional neural networks for human action recognition in videos
    Amlan Kar , Nishant Rai , Karan Sikka , and 1 more author
    In CVPR , 2017

2016

  1. Lomo: Latent ordinal model for facial analysis in videos
    Karan Sikka , Gaurav Sharma , and Marian Bartlett
    In CVPR , 2016

2015

  1. The more the merrier: Analysing the affect of a group of people in images
    Abhinav Dhall , Jyoti Joshi , Karan Sikka , and 2 more authors
    In AFGR , 2015
  2. Exemplar hidden markov models for classification of facial expressions in videos
    Karan Sikka , Abhinav Dhall , and Marian Bartlett
    In CVPR Workshops , 2015
  3. Automated assessment of children’s postoperative pain using computer vision
    Karan Sikka , Alex A Ahmed , Damaris Diaz , and 4 more authors
    Pediatrics, 2015
  4. Joint Clustering and Classification for Multiple Instance Learning
    Karan Sikka , Ritwik Giri , and Marian Bartlett
    In BMVC , 2015
  5. Deep Q-learning for Active Recognition of GERMS: Baseline performance on a standardized dataset for active learning.
    Mohsen Malmir , Karan Sikka , Deborah Forster , and 2 more authors
    In BMVC , 2015

2014

  1. A discriminative parts based model approach for fiducial points free and shape constrained head pose normalisation in the wild
    Abhinav Dhall , Karan Sikka , Gwen Littlewort , and 2 more authors
    In WACV , 2014
  2. Classification and weakly supervised pain localization using multiple segment representation
    Karan Sikka , Abhinav Dhall , and Marian Stewart Bartlett
    IVC, 2014
  3. Emotion recognition in the wild challenge 2014: Baseline, data and protocol
    Abhinav Dhall , Roland Goecke , Jyoti Joshi , and 2 more authors
    In ICMI , 2014
  4. Facial expression analysis for estimating pain in clinical settings
    Karan Sikka
    In ICMI , 2014

2013

  1. Weakly supervised pain localization using multiple instance learning
    Karan Sikka , Abhinav Dhall , and Marian Bartlett
    In AFGR , 2013
  2. Multiple kernel learning for emotion recognition in the wild
    Karan Sikka , Karmen Dykstra , Suchitra Sathyanarayana , and 2 more authors
    In ICMI , 2013
  3. Pseudo vs. true defect classification in printed circuits boards using wavelet features
    Sahil Sikka , Karan Sikka , Manas Kamal Bhuyan , and 1 more author
    arXiv preprint arXiv:1310.6654, 2013

2012

  1. Exploring bag of words architectures in the facial expression domain
    Karan Sikka , Tingfan Wu , Josh Susskind , and 1 more author
    In ECCV , 2012

2011

  1. Texture information-based hybrid methodology for the segmentation of SAR images
    Pankaj K Singh , Nitesh Sinha , Karan Sikka , and 1 more author
    International journal of remote sensing, 2011

2010

  1. Comparison of algorithms for ultrasound image segmentation without ground truth
    Karan Sikka , and Thomas M Deserno
    In Medical Imaging 2010: Image Perception, Observer Performance, and Technology Assessment , 2010

2009

  1. A fully automated algorithm under modified FCM framework for improved brain MR image segmentation
    Karan Sikka , Nitesh Sinha , Pankaj K Singh , and 1 more author
    Magnetic Resonance Imaging, 2009