Publications


My View is the Best View: Procedure Learning from Egocentric Videos.

Authors: Siddhant Bansal, Chetan Arora 0001, C. V. Jawahar Pages: 657-675 Year: 2022 


Enhancing Indic Handwritten Text Recognition Using Global Semantic Information.

Authors: Ajoy Mondal, C. V. Jawahar Pages: 360-374 Year: 2022 


Towards Robust Handwritten Text Recognition with On-the-fly User Participation.

Authors: Ajoy Mondal, Rohit Saluja, C. V. Jawahar Pages: 12:1 Year: 2022 


Overcoming Label Noise for Source-free Unsupervised Video Domain Adaptation.

Authors: Avijit Dasgupta, C. V. Jawahar, Karteek Alahari Pages: 20:1-20:9 Year: 2022 


A Fine-Grained Vehicle Detection (FGVD) Dataset for Unconstrained Roadsāœ±.

Authors: Prafful Kumar Khoba, Chirag Parikh, C. V. Jawahar, Ravi Kiran Sarvadevabhatla, Rohit Saluja Pages: 25:1-25:9 Year: 2022 


Automatic Annotation of Handwritten Document Images at Word Level.

Authors: Ajoy Mondal, Krishna Tulsyan, C. V. Jawahar Pages: 44:1-44:9 Year: 2022 


Generalized Keyword Spotting using ASR embeddings.

Authors: Kirandevraj R, Vinod Kumar Kurmi, Vinay P. Namboodiri, C. V. Jawahar Pages: 126-130 Year: 2022 


New Objects on the Road? No Problem, We’ll Learn Them Too.

Authors: Deepak Kumar Singh, Shyam Nandan Rai, K. J. Joseph, Rohit Saluja, Vineeth N. Balasubramanian, Chetan Arora 0001, Anbumani Subramanian, C. V. Jawahar Pages: 1972-1978 Year: 2022 


Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild.

Authors: Sindhu B. Hegde, K. R. Prajwal, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar Pages: 6250-6258 Year: 2022 


Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors.

Authors: Sindhu B. Hegde, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar Pages: 6511-6520 Year: 2022 


Grounded Video Situation Recognition.

Authors: Zeeshan Khan, C. V. Jawahar, Makarand Tapaswi Year: 2022 


ETL: Efficient Transfer Learning for Face Tasks.

Authors: Thrupthi Ann John, Isha Dua, Vineeth N. Balasubramanian, C. V. Jawahar Pages: 248-257 Year: 2022 


To miss-attend is to misalign! Residual Self-Attentive Feature Alignment for Adapting Object Detectors.

Authors: Vaishnavi Khindkar, Chetan Arora 0001, Vineeth N. Balasubramanian, Anbumani Subramanian, Rohit Saluja, C. V. Jawahar Pages: 376-386 Year: 2022 


FLUID: Few-Shot Self-Supervised Image Deraining.

Authors: Shyam Nandan Rai, Rohit Saluja, Chetan Arora 0001, Vineeth N. Balasubramanian, Anbumani Subramanian, C. V. Jawahar Pages: 418-427 Year: 2022 


Multi-Domain Incremental Learning for Semantic Segmentation.

Authors: Prachi Garg, Rohit Saluja, Vineeth N. Balasubramanian, Chetan Arora 0001, Anbumani Subramanian, C. V. Jawahar Pages: 2080-2090 Year: 2022 

Scroll to Top