Ashwin Sankar

AI Resident @ AI4Bharat, IIT Madras.

profile_pic.jpg

434, SSB

Indian Institute of Technology Madras

Chennai, TN 600 042

I’m an AI Resident at AI4Bharat, Indian Institute of Technology Madras. I work with Prof. Mitesh Khapra on developing natural and expressive TTS systems, indistinguishable from human speech, for the Indian languages.

My research interests broady cover TTS and Speech/Audio X Multimodal models. One of my primary goals is to develop high-quality, multilingual, and multispeaker TTS systems that can be deployed in real-world applications. Released Rasa*, IndicOOV, and IndicVoices-R* as a step towards this goal.

Currently, I’m working on robust and large-scale evaluation of speech data and TTS systems. I welcome ideas and collaborations on this front.

I’m on the lookout for a PhD position starting Fall 2025. I’m interested in fluent, multilingual and interruptible conversational systems. Feel free to check out my resume or drop me an email to chat with me.


Research Collaborators: Prof. Mitesh Khapra, IIT Madras; Suvrat Bhooshan, Gan.ai

news

Jun 04, 2024 :books: 2 papers accepted at InterSpeech 2024. See you in Greece.
Dec 28, 2022 :books: Joined TTSTeam @ AI4Bharat as AI Resident.
May 25, 2022 :mortar_board: Graduated with distinction from my Bachelors degree (B.E. Computer Science and Engineering).

selected publications

  1. Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings
    Praveen Srinivasa Varadhan ,  Ashwin Sankar ,  Giri Raju , and 1 more author
    In Proc. INTERSPEECH 2024 , 2024
  2. Enhancing Out-of-Vocabulary Performance of Indian TTS Systems for Practical Applications through Low-Effort Data Strategies
    Srija Anand ,  Praveen Srinivasa Varadhan ,  Ashwin Sankar , and 2 more authors
    In Proc. INTERSPEECH 2024 , 2024