Skip to main content | Skip to Navigation | Text Size : | Language :

logo of Linguistic Data Consortium for Indian Languages (LDC-IL)
Current status | Official Website of Linguistic Data Consortium for Indian Languages

Current status

Linguistic Data Consortium for Indian Languages develops and distributes qualitative linguistic resources and software technologies to support and enhance Indian languages. LDC-IL created 56 datasets in Indian language both in text and speech. Moreover it developed many AI tools, Non AI tools, Desktop tools and Mobile Applications.

Current Projects:
Text Corpora:
Speech corpora:
Parallel Corpus:

270 languages

TTS Data:

Creating speech corpora for Indian languages

Datasets:
AI Application:
  1. Bhasha Setu, Translation Engine
  2. Lipyantara, Transliteration application
  3. Lipidha, OCR tool
  4. Shabd Sandhan, Corpus search tool
  5. Dhvani Parivartak, Audio converter tool
  6. AnuLekhika, Transcription tool
  7. AnuVachika, Text to speech tool.
Desktop Applications:
  1. Chanu, Meitei Mayek In-Script Keyboard
  2. Leimaren, Meitei Mayek Phonetic Keyboard
  3. Darpana, Grapheme to Phoneme
  4. Seema, Iterative type-token Analyser
  5. Sandra, WAV to MP3/AAC/WMA Converter
  6. Meya, Edit distance calculator
  7. Mili, Transliterator
  8. Padavriti, Frequency counter
  9. Padanveshi, Keyword finder
  10. Taranga, WAV Metadata Extractor
  11. Unicode finder
  12. Quick File Renamer