Chitrarth: Bridging Vision and Language for a Billion People
Shaharukh Khan, Ayush Tarun, Abhinav Ravi, Ali Faraz, Praveen Kumar Pokala , Anagha Bhangare, Raja Kolla, Chandra Khatri and Shubham Agarwal
Vision Model
ICASSP
NeurIPS
IndicST: Indian Multilingual Translation Corpus for Evaluating Speech Large Language Models
Sanket Shah, Kavya Ranjan Saxena, Kancharana Manideep Bharadwaj, Sharath Adavanne, Nagaraj Adiga
Speech and Language
ICASSP
Chitranuvad: Adapting Multi-lingual LLMs for Multimodal Translation
Shaharukh Khan, Ayush Tarun, Ali Faraz, Palash Kamble, Vivek Dahiya, Praveen Pokala, Ashish Kulkarni, Chandra Khatri, Abhinav Ravi, Shubham Agarwal
Multimodal Translation
ACL
ADAS: Estimation of Appearance and Occupancy Information in Birds Eye View from Surround Monocular Images
Sarthak Sharma , Unnikrishnan R. Nair , Udit Singh Parihar , Midhun Menon S, Srikanth Vidapanakal
ADAS: Bridging Sim2Real Gap Using Image Gradients for the Task of End-to-End Autonomous Driving
Unnikrishnan R Nair, Sarthak Sharma, Udit Singh Parihar Midhun S Menon, Srikanth Vidapanakal
Autonomous Driving
NeurIPS