Skip to main content | Skip to Navigation | Text Size : | Language :

logo of Linguistic Data Consortium for Indian Languages (LDC-IL)
Speech Data Creation | Official Website of Linguistic Data Consortium for Indian Languages

Speech Data Creation

LDC-IL collects two kinds of speech data: Read speech data and Spontaneous speech data. Collecting speech data involves capturing various linguistic elements to enhance speech-driven machine-learning programs used in natural language processing. Achieving the correct sensitivity, accuracy, and contextual comprehension of language nuances is crucial to avoid misinterpreting fundamental speech data. The project is focused on creating extensive speech datasets for all Indian languages. It aims to promote and support speech recognition technology across diverse linguistic communities. LDC-IL supports advancements in language technology and improves the digital communication experience in Indian languages.

Team members

Kashmiri Dr. Zargar Adil Ahmad