Publications


ScribbleNet: Efficient interactive annotation of urban city scenes for semantic segmentation.

Authors: Bhavani Sambaturu, Ashutosh Gupta, C. V. Jawahar, Chetan Arora 0001 Pages: 109011 Year: 2023 


Dataset agnostic document object detection.

Authors: Ajoy Mondal, Madhav Agarwal, C. V. Jawahar Pages: 109698 Year: 2023 


Document Image Analysis Using Deep Multi-modular Features.

Authors: K. V. Jobin, Ajoy Mondal, C. V. Jawahar Pages: 5 Year: 2023 


Understanding Video Scenes through Text: Insights from Text-based Video Question Answering.

Authors: Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar Pages: 4648-4652 Year: 2023 


Reading Between the Lanes: Text VideoQA on the Road.

Authors: George Tom, Minesh Mathew, Sergi Garcia-Bordils, Dimosthenis Karatzas, C. V. Jawahar Pages: 137-154 Year: 2023 


IndicSTR12: A Dataset for Indic Scene Text Recognition.

Authors: Harsh Lunia, Ajoy Mondal, C. V. Jawahar Pages: 233-250 Year: 2023 


ICDAR 2023 Competition on Indic Handwriting Text Recognition.

Authors: Ajoy Mondal, C. V. Jawahar Pages: 435-453 Year: 2023 


ICDAR 2023 Competition on Visual Question Answering on Business Document Images.

Authors: Sachin Raja, Ajoy Mondal, C. V. Jawahar Pages: 454-470 Year: 2023 


ICDAR 2023 Competition on RoadText Video Text Detection, Tracking and Recognition.

Authors: George Tom, Minesh Mathew, Sergi Garcia-Bordils, Dimosthenis Karatzas, C. V. Jawahar Pages: 577-586 Year: 2023 


CueCAn: Cue-driven Contextual Attention for Identifying Missing Traffic Signs on Unconstrained Roads.

Authors: Varun Gupta, Anbumani Subramanian, C. V. Jawahar, Rohit Saluja Pages: 1486-1492 Year: 2023 


Towards Accurate Lip-to-Speech Synthesis in-the-Wild.

Authors: Sindhu B. Hegde, Rudrabha Mukhopadhyay, C. V. Jawahar, Vinay P. Namboodiri Pages: 5523-5531 Year: 2023 


Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale.

Authors: Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar Pages: 2216-2225 Year: 2023 


FaceOff: A Video-to-Video Face Swapping System.

Authors: Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar Pages: 3484-3493 Year: 2023 


Watching the News: Towards VideoQA Models that can Read.

Authors: Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar Pages: 4430-4439 Year: 2023 


IDD-3D: Indian Driving Dataset for 3D Unstructured Road Scenes.

Authors: Shubham Dokania, A. H. Abdul Hafez, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar Pages: 4471-4480 Year: 2023 

Scroll to Top