ScribbleNet: Efficient interactive annotation of urban city scenes for semantic segmentation.
Authors: Bhavani Sambaturu, Ashutosh Gupta, C. V. Jawahar, Chetan Arora 0001 Pages: 109011 Year: 2023
Dataset agnostic document object detection.
Authors: Ajoy Mondal, Madhav Agarwal, C. V. Jawahar Pages: 109698 Year: 2023
Document Image Analysis Using Deep Multi-modular Features.
Authors: K. V. Jobin, Ajoy Mondal, C. V. Jawahar Pages: 5 Year: 2023
Understanding Video Scenes through Text: Insights from Text-based Video Question Answering.
Authors: Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar Pages: 4648-4652 Year: 2023
Reading Between the Lanes: Text VideoQA on the Road.
Authors: George Tom, Minesh Mathew, Sergi Garcia-Bordils, Dimosthenis Karatzas, C. V. Jawahar Pages: 137-154 Year: 2023
IndicSTR12: A Dataset for Indic Scene Text Recognition.
Authors: Harsh Lunia, Ajoy Mondal, C. V. Jawahar Pages: 233-250 Year: 2023
ICDAR 2023 Competition on Indic Handwriting Text Recognition.
Authors: Ajoy Mondal, C. V. Jawahar Pages: 435-453 Year: 2023
ICDAR 2023 Competition on Visual Question Answering on Business Document Images.
Authors: Sachin Raja, Ajoy Mondal, C. V. Jawahar Pages: 454-470 Year: 2023
ICDAR 2023 Competition on RoadText Video Text Detection, Tracking and Recognition.
Authors: George Tom, Minesh Mathew, Sergi Garcia-Bordils, Dimosthenis Karatzas, C. V. Jawahar Pages: 577-586 Year: 2023
CueCAn: Cue-driven Contextual Attention for Identifying Missing Traffic Signs on Unconstrained Roads.
Authors: Varun Gupta, Anbumani Subramanian, C. V. Jawahar, Rohit Saluja Pages: 1486-1492 Year: 2023
Towards Accurate Lip-to-Speech Synthesis in-the-Wild.
Authors: Sindhu B. Hegde, Rudrabha Mukhopadhyay, C. V. Jawahar, Vinay P. Namboodiri Pages: 5523-5531 Year: 2023
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale.
Authors: Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar Pages: 2216-2225 Year: 2023
FaceOff: A Video-to-Video Face Swapping System.
Authors: Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar Pages: 3484-3493 Year: 2023
Watching the News: Towards VideoQA Models that can Read.
Authors: Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar Pages: 4430-4439 Year: 2023
IDD-3D: Indian Driving Dataset for 3D Unstructured Road Scenes.
Authors: Shubham Dokania, A. H. Abdul Hafez, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar Pages: 4471-4480 Year: 2023