Central Institute of Indian Languages [CIIL] MISSION STATEMENT: Annotated, quality language data (both-text & speech) and tools in Indian Languages to Individuals, Institutions and Industry for Research & Development - Created in-house, through outsourcing and acquisition..  Our Other Sites  Related Sites 
You are here: BACK
Faculty >Shantanu Kumar
Shantanu Kumar

Position Held:

  •      JRP - II Maithili

Academic Record Button Trained In Button Address
Button Awards & Distinctions Button Computing Skills Certification
Professional Experience Publication Button Workshop & Conferences
Button Languages Known

Academic Record:






English Hons., Mathematics, French, Sanskrit & Telugu

BHU, Varanasi



Dissertation: Named Entity Recognition in Maithili

BHU, Varanasi



English Literature

IGNOU, New Delhi




UGC, New Delhi

Dec 2019, Jun 2020


Research Fellowship

MHRD, New Delhi

Jan 2021


Computational Linguistics
Topic: "Automatic Speech Recognition in Maithili: Issues & Challenges"

Mysore University, Mysore
Research Centre: CIIL, Mysore


Awards & Distinctions:

  • Graduate Aptitude Test in Engineering in Linguistics (2021)

  • UGC-NET Lectureship award in Linguistics in Dec 2019 and June 2020

  • Awarded by the District Megistrate for position in 12th examination

  • Awarded by the Superintendent of Police for position in 10th examination

Trained In:

  • PoS Tagging
  • Annotation
  • Translation
  • Corpus Generation
  • Text Processing
  • Speech Processing
  • Pronunciation Lexicon

Professional Experience:

  •      Coordinator : National Seminar on Data Sampling in Angika, Bhagalpur, Bihar. Nov, 2022.
  •      Coordinator : Field Work on Data Collection in Angika, Bhagalpur, Bihar. Nov 5th-13th, 2022.
  •      Maithili Resource Person: LDC-IL, CIIL. Feb, 2021 - Aug, 2022.
  •      Coordinator : International Symposium on Maithili, Darbhanga, Bihar. Dec, 2021.
  •      Internship: Named Entity Recognition on Maithili, IIT-BHU, Varanasi. Jan, 2020 - June, 2020.
  •      Internship: Hindi-Maithili Machine Translation Evaluation, IIT-BHU, Varanasi, July, 2019 - Dec, 2019.


  • National Service Scheme, BHU. 2017

  • Diploma in Computer Management, 2016.

Computing Skills:

  • Platforms: Well versed with Windows and Linux (Ubuntu)

  • Development Environment: Python, R, MySQL 5, CSS, HTML

  • Training in : NLP, Data Science, Deep Learning, Language Technology



  • Sathisha, Muktha & Kumar, Shantanu & Vinay, A.. (2022). An Overview of Idioms of Mundari Language Spoken in Jharkhand. VI. 13.

  • Kumar, Shantanu.(2020) "Named Entity Recognition in Maithili." Masters diss., Banaras Hindu University, Varanasi.

  • Mundotiya, Rajesh Kumar, Shantanu Kumar, Umesh Chandra Chaudhary, Supriya Chauhan, Swasti Mishra, Praveen Gatla, and Anil Kumar Singh. "Development of a Dataset and a Deep Learning Baseline Named Entity Recognizer for Three Low Resource Languages: Bhojpuri, Maithili and Magahi." arXiv preprint arXiv:2009.06451 (2020).

Workshop & Conferences:

  • Summer School on Language Documentation organised by SPPEL, CIIL, Myosre (May 17-31, 2022)

  • Training Programme on Language Documentation organized by SPPEL, CIIL, Mysore (March 21-29, 2022)

  • FDP on "Python Programming for Beginners using Artificial Intelligence and Machine Learning" at NIT-Warangal(Oct 2021-Nov 2021)

  • FDP on "Introduction to Speech Processing and its Applications using AI-ML (ISPA)" at CDAC-Kolkata (October 25th - October 29th 2021)

  • FDP on "Recent Advancements in Automatic Speech Recognition and Speaker Verification" at NIT-Sikkim (September 27th -October 1st 2021)

  • Summer School on "Automatic Speech Recognition" organised at IIT-Dharwar with IIIT-Dharwar, Karnataka (July 19th - July 30th 2021).

  • Workshop on Indian Language Data: Resources and Evaluation workshop on May 24, 2020, CIIL, Mysore.

  • Workshop on Indian Language Data: Resources and Evaluation workshop on May 24, 2020 International Webinar Series on Linguistics, Team Bhasha-Chintan, Banaras Hindu University (06/06-25/07/2020)

  • National Workshop on Digitization and Development of E-Resources for Sanskrit (27-31 May) (Jointly organized by Jawaharlal Nehru University and Delhi University)

Languages Known:

  • Mother Tongue: Maithili

  • Other Languages: Hindi, Sanskrit, Telugu, English, French


  • Current : W201, SRLC, CIIL Campus, Manasagangothri, Hunsur Road, Mysore-570006
    Email: shantanuk.ciil@gmail.com

Visitor Counter


Developed & Maintained by:
Copyright © LDC-IL,
Central Institute of Indian Languages
Central Institute of Indian Languages
Department of Higher Education
Ministry of Education
Government of India
Manasagangothri, Hunsur Road, Mysore-570006, Karnataka, India.
Tel: (0821) 2515820 (Director)
Reception/PABX : (0821) 2345000
Fax: (0821) 2515032 (Off)
        Home | Announcements | News | CIIL | Contact Us