IITKGP

Krothapalli Sreenivasa Rao

Professor

Computer Science and Engineering

+91-3222-282336

ksrao@cse.iitkgp.ac.in

Research Areas

For the last 16 years I have been working on signal processing and machine learning aspects, targeted to mainly speech applications. In collaboration with Govt. of India (DIT, MCIT, DST) and other premium technological institutes of India, we have developed various speech systems in Indian languages. During the initial period of my career my focus was on acquisition and incorporation of prosody for developing various speech systems. Later my focus has been shifted to (i) expressive speech analysis/synthesis, (ii) development of robust speech systems, (iii) vocal folds activity analysis and syntheis in view of speecha nd biomedical applications, (iv) development of appropriate signal processing methods to extract the characteristic features from Hindustani music and (v) big-data analysis framework and audio and multimedia analytics.

My current focus is on (i) development of robust speech interfaces in the context of Indian languages targeted to the objectives such as E-Governance, Digital India and Smart phones, (ii) Exploring signal processing and machine learning paradigms for automatic processing of Hindustani music and (iii) Exploring big-data analytics for speech, music, audio and video document representation, indexing and retrieval tasks.

 
  • Prosody modification using instants of significant excitation Sreenivasa Rao K., Yegnanarayana B. By Krothapalli Sreenivasa Rao 14 972-980 (2006)
  • Voice/Non-voice Detection Using Phase of Zero Frequency Filtered Speech Signal S. B. Sunil Kumar and K. Sreenivasa Rao By Speech Communication Vol. 81, Elsevier 90 - 103 (2016)
  • Determination of instants of significant excitation in speech using Hilbert envelope and group delay function K. Sreenivasa Rao, S. R. M. Prasanna and B. Yegnanarayana By IEEE Signal Processing Letters Vol. 14, IEEE 762 - 765 (2007)
  • Duration modification using Glottal Closure Instants and Vowel Onset Points Sreenivasa Rao K., Yegnanarayana B. By Speech communication 51 1263-1269 (2009)
  • Voice Conversion by Mapping the Speaker-specific features using Pitch Synchronous Approach K. Sreenivasa Rao By Computer Speech and Language Vol. 24, Elsevier 474 - 494 (2010)
  • Vowel Onset Point Detection for Low Bit Rate Coded Speech Anil Kumar Vuppala, Jainath Yadav, Saswat Chakrabarti and K. Sreenivasa Rao By IEEETransactions on Audio, Speech and Language Processing Vol. 20, No. 6, IEEE 1894 - 1903 (2012)
  • Non -Uniform Time Scale Modification Using Instants of Significant Excitation and Vowel Onset Points K. Sreenivasa Rao and Anil Kumar Vuppala By Speech Communication Vol. 55, No. 6, Elsevier 745 - 756 (2013)
  • Two -Stage Intonation Modeling using Feedforward Neural Networks for syllable based Text -to-Speech Synthesis V. Ramu Reddy and K. Sreenivasa Rao By Computer Speech and Language Vol. 27, Elsevier 1105 - 1126 (2013)
  • Detection of Vowel Offset Point from Speech Signal Jainath Yadav and K. Sreenivasa Rao By Signal Processing Letters Vol. 20, No. 4, IEEE 299 - 302 (2013)
  • Time - Domain Deterministic Plus Noise Model based Hybrid Source Modeling for HMM - Based Speech Synthesis N. P. Narendra and K. Sreenivasa Rao By Speech Communication Vol. 77, Elsevier 65 - 83 (2016)

Principal Investigator

  • National Language Translation Mission (NLTM): BHASHINI

Ph. D. Students

Abhijit Debnath

Area of Research: Multimedia Data Analytics

Annepu. Sai Sriharsha

Area of Research: Speech and Natural Language Processing

Aravinda Reddy P N

Area of Research: Speech Processing

Arup Kumar Dutta

Area of Research: Speech and Audio Processing

Haque Arijul

Area of Research: Speech processing

Priya Dharshini G

Area of Research: Speech Processing

Saikat Biswas

Area of Research: Audio Data Analytics

Soumen Paul

Area of Research: Human Computer Interactions - Computer Vision

Soumya Majumdar

Area of Research: Speech Processing

Sudhakar P

Area of Research: Speech Processing

Y Madhu Keerthana

Area of Research: Speech Processing