Research Interests :
Cyber Physical Systems, Machine Learning.Publications : (Last Five, while at IITM)DBLP | View All
- Towards Bringing Parity in Pretraining Datasets for Low-resource Indian Languages.

Authors :
Kaushal Santosh Bhogale,
Deovrat Mehendale,
Tahir Javed,
Devbrat Anuragi,
Sakshi Joshi,
Sai Sundaresan,
Aparna Ananthanarayanan,
Sharmistha Dey,
Sathish Kumar Reddy G,
Anusha Srinivasan,
Abhigyan Raman,
Pratyush Kumar,
Mitesh KhapraAppeared in
2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025, Hyderabad, India, April 6-11, 2025 (ICASSP 2025) ,Vol , No., pp.1-5, Apr 2025
- A Primer on Pretrained Multilingual Language Models.

Authors :
Sumanth Doddapaneni,
Gowtham Ramesh,
Mitesh Khapra,
Anoop Kunchukuttan,
Pratyush KumarAppeared in
ACM Comput. Surv., Vol 57, No., pp.232:1-232:39, Jan 2025
- Empowering Low-Resource Language ASR via Large-Scale Pseudo Labeling.

Authors :
Kaushal Santosh Bhogale,
Deovrat Mehendale,
Niharika Parasa,
Sathish Kumar Reddy G,
Tahir Javed,
Pratyush Kumar,
Mitesh KhapraAppeared in
25th Annual Conference of the International Speech Communication Association, Interspeech 2024, Kos, Greece, September 1-5, 2024., Vol , No., Sep 2024
- IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages.

Authors :
Tahir Javed,
Janki Nawale,
Eldho Ittan George,
Sakshi Joshi,
Kaushal Santosh Bhogale,
Deovrat Mehendale,
Ishvinder Virender Sethi,
Aparna Ananthanarayanan,
Hafsah Faquih,
Pratiti Palit,
Sneha Ravishankar,
Saranya Sukumaran,
Tripura Panchagnula,
Sunjay Murali,
Kunal Sharad Gandhi,
Ambujavalli R,
Manickam K. M,
C. Venkata Vaijayanthi,
Krishnan Srinivasa Raghavan Karunganni,
Pratyush Kumar,
Mitesh KhapraAppeared in
Findings of the Association for Computational Linguistics, ACL 2024, Bangkok, Thailand and virtual meeting, August 11-16, 2024 (ACL 2024) ,Vol , No., pp.10740-10782, Aug 2024
- IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages.

Authors :
Mohammed Safi Ur Rahman Khan,
Priyam Mehta,
Ananth Sankar,
Umashankar Kumaravelan,
Sumanth Doddapaneni,
Suriyaprasaad B,
Varun Balan G,
Sparsh Jain,
Anoop Kunchukuttan,
Pratyush Kumar,
Raj Dabre,
Mitesh KhapraAppeared in
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024, Bangkok, Thailand, August 11-16, 2024 (ACL 2024) ,Vol , No., pp.15831-15879, Aug 2024
Feb 2021 - May 2021 | : | - Foundations of Deep Learning (CS6910) |
Feb 2021 - May 2021 | : | - Algorithmic Foundations of Data Science (CS6741) |
Aug 2020 - Dec 2020 | : | - Systems Engineering for Deep Learning (CS6886) |
Jan 2020 - May 2020 | : | - Systems Engineering for Deep Learning (CS6886) |
Jul 2019 - Nov 2019 | : | - Operating Systems (CS3500) |