Skip to main content | Skip to Navigation | Text Size : | Language :

logo of Linguistic Data Consortium for Indian Languages (LDC-IL)
Mr. Shantanu Kumar
Mr. Shantanu Kumar | Official Website of Linguistic Data Consortium for Indian Languages
Academic Qualification
  • B.A. English Hons., Mathematics, French, Sanskrit & Telugu BHU, Varanasi
  • M.A. Linguistics Dissertation: Named Entity Recognition in Maithili BHU, Varanasi
  • M.A. English Literature IGNOU, New Delhi
  • NET Lectureship UGC, New Delhi
  • GATE Research Fellowship MHRD, New Delhi
  • PhD Computational Linguistics Topic: "Automatic Speech Recognition in Maithili: Issues & Challenges" Mysore University, Mysore Research Centre: CIIL, Mysore
Trained in
  • POS Tagging
  • Annotation
  • Translation
  • Corpus Generation
  • Text Processing
  • Speech Processing
  • Pronunciation Lexicon
Position held Junior Resource Person - II
Experience in research, training and documentation
  • Coordinator : National Seminar on Data Sampling in Angika, Bhagalpur, Bihar. Nov, 2022.
  • Coordinator : Field Work on Data Collection in Angika, Bhagalpur, Bihar. Nov 5th-13th, 2022.
  • Maithili Resource Person: LDC-IL, CIIL. Feb, 2021 - Aug, 2022.
  • Coordinator : International Symposium on Maithili, Darbhanga, Bihar. Dec, 2021.
  • Internship: Named Entity Recognition on Maithili, IIT-BHU, Varanasi. Jan, 2020 - June, 2020.
  • Internship: Hindi-Maithili Machine Translation Evaluation, IIT-BHU, Varanasi, July, 2019 - Dec, 2019.
Publications
  • Sathisha, Muktha & Kumar, Shantanu & Vinay, A.. (2022). An Overview of Idioms of Mundari Language Spoken in Jharkhand. VI. 13.
  • Kumar, Shantanu.(2020) "Named Entity Recognition in Maithili." Masters diss., Banaras Hindu University, Varanasi.
  • Mundotiya, Rajesh Kumar, Shantanu Kumar, Umesh Chandra Chaudhary, Supriya Chauhan, Swasti Mishra, Praveen Gatla, and Anil Kumar Singh. "Development of a Dataset and a Deep Learning Baseline Named Entity Recognizer for Three Low Resource Languages: Bhojpuri, Maithili and Magahi." arXiv preprint arXiv:2009.06451 (2020).
Mother tongue Maithili
Other Languages known Hindi, Sanskrit, Telugu, English, French