Research Interests :
Statistical Machine Translation, Text Analytics, Deep Learning and Crowd-sourcingPublications : (Last Five, while at IITM)DBLP | View All
- IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages.
Authors :
Tahir Javed,
Janki Nawale,
Eldho Ittan George,
Sakshi Joshi,
Kaushal Santosh Bhogale,
Deovrat Mehendale,
Ishvinder Virender Sethi,
Aparna Ananthanarayanan,
Hafsah Faquih,
Pratiti Palit,
Sneha Ravishankar,
Saranya Sukumaran,
Tripura Panchagnula,
Sunjay Murali,
Kunal Sharad Gandhi,
Ambujavalli R,
Manickam K. M,
C. Venkata Vaijayanthi,
Krishnan Srinivasa Raghavan Karunganni,
Pratyush Kumar,
Mitesh KhapraAppeared in
Findings of the Association for Computational Linguistics, ACL 2024, Bangkok, Thailand and virtual meeting, August 11-16, 2024 (ACL 2024) ,Vol , No., pp.10740-10782, Aug 2024
- How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?
Authors :
Anushka Singh,
Ananya Sai,
Raj Dabre,
Ratish Puduppully,
Anoop Kunchukuttan,
Mitesh KhapraAppeared in
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Student Research Workshop, Bangkok, Thailand, August 11-16, 2024 (ACL 2024) ,Vol , No., pp.640-649, Aug 2024
- IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages.
Authors :
Mohammed Safi Ur Rahman Khan,
Priyam Mehta,
Ananth Sankar,
Umashankar Kumaravelan,
Sumanth Doddapaneni,
Suriyaprasaad B,
Varun Balan G,
Sparsh Jain,
Anoop Kunchukuttan,
Pratyush Kumar,
Raj Dabre,
Mitesh KhapraAppeared in
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024, Bangkok, Thailand, August 11-16, 2024 (ACL 2024) ,Vol , No., pp.15831-15879, Aug 2024
- A Comprehensive Analysis of Adapter Efficiency.
Authors :
Nandini Mundra,
Sumanth Doddapaneni,
Raj Dabre,
Anoop Kunchukuttan,
Ratish Puduppully,
Mitesh KhapraAppeared in
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), Bangalore, India, January 4-7, 2024, pp.136-154, Jan 2024
- Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users.
Authors :
Yash Madhani,
Sushane Parthan,
Priyanka Bedekar,
Gokul NC,
Ruchi Khapra,
Anoop Kunchukuttan,
Pratyush Kumar,
Mitesh KhapraAppeared in
Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. (EMNLP 2023) ,pp.40-57, Dec 2023
Jul 2023 - Nov 2023 | : | - Introduction to Programming (CS1100) |
Jan 2023 - May 2023 | : | - Foundations of Deep Learning (CS6910) |
Jan 2022 - Apr 2022 | : | - Dual Degree Project - III (CS5815) |
Jan 2022 - Apr 2022 | : | - Foundations of Deep Learning (CS6910) |
Aug 2021 - Dec 2021 | : | - Linear Algebra and Random Processes (CS6015) |