Indian Institute of Technology Kharagpur |

Journal

A Novel Zero-Resource Spoken Term Detection Using Affinity Kernel Propagation with Acoustic Feature Map Sudhakar P., Rao K. S., Mitra P. By SN Computer Science - (Accepted/In-Press)
Detection of Neurogenic Voice Disorders Using the Fisher Vector Representation of Cepstral Features Keerthana Y. M., Alku P. , Rao K. S., Mitra P. By Journal of Voice - (Accepted/In-Press)
Phoneme Segmentation-Based Unsupervised Pattern Discovery and Clustering of Speech Signals Ravi K.K., Krothapalli S.R. By Circuits, Systems, and Signal Processing 41 2088-2117 (2022)
CycleGAN-Based Speech Mode Transformation Model for Robust Multilingual ASR Tripathi K., Rao K.S. By Circuits, Systems, and Signal Processing 41 5283-5305 (2022)
A novel approach to unsupervised pattern discovery in speech using Convolutional Neural Network Kishore Kumar R., Sreenivasa Rao K. By Computer Speech and Language 71 - (2022)
Dysarthric speech detection from telephone quality speech using epoch-based pitch perturbation features Keerthana Y. M., Rao K. S., Mitra P. By International Journal of Speech Technology 25 967-973 (2022)
SongF0: A Spectrum-Based Fundamental Frequency Estimation for Monophonic Songs Rengaswamy P., Rao K. S., Dasgupta P. By Circuits, Systems, and Signal Processing 40 772-797 (2021)
Relation Prediction of Co-morbid Diseases Using Knowledge Graph Completion Biswas S., Mitra P. , Sreenivasa Rao K. By IEEE/ACM transactions on computational biology and bioinformatics 18 708-717 (2021)
Approaches for Multilingual Phone Recognition in Code-switched and Non-code-switched Scenarios Using Indian Languages Manjunath K.E., Srinivasa Raghavan S.R.K., Rao K.S., Jayagopi D.B., Ramasubramanian V. By ACM Transactions on Asian and Low-Resource Language Information Processing 20 - (2021)
Approaches for Multilingual Phone Recognition in Code-Switched and NonCode-Switched Scenarios using Indian Languages K E M., Jayagopi D. , Rao K. S., Raghavan S. , V R. By Transactions on Asian and Low-Resource Language Information Processing (TALLIP) - (2021)
hf0: A Hybrid Pitch Extraction Method for Multimodal Voice Rengaswamy P., M G. R., Rao K. S., Dasgupta P. By Circuits, Systems, and Signal Processing 40 262-275 (2021)
Multilingual Audio-Visual Smartphone Dataset and Evaluation Hareesh M., Aravinda Reddy P. N., Raghavendra R. , Sreenivasa Rao K. , Mitra P. , Mahadeva Prasanna S. R., Busch . By IEEE Access 9 - (2021)
Multilingual Audio-Visual Smartphone Dataset and Evaluation Mandalapu H., Reddy P.N.A., Ramachandra R., Rao K.S., Mitra P., Prasanna S.R.M., Busch C. By IEEE Access 9 153240-153257 (2021)
Robust vowel region detection method for multimode speech Tripathi K., Rao K.S. By Multimedia Tools and Applications 80 13615-13637 (2021)
Utterance partitioning for for speaker recognition : An experimental review and analysis with new findings under GMM-SVM framework Sen N., M. , Patil H. , Das S. K., Sreenivasa Rao K. , Basu T. K. By International Journal of Speech Technology 21 1067-1088 (2021)
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework Sen N., Sahidullah M., Patil H.A., Das Mandal S.K., Rao K.S., Basu T.K. By International Journal of Speech Technology 24 1067-1088 (2021)
VOP detection for read and conversation speech using CWT coefficients and phone boundaries Tripathi K., Rao K.S. By Journal of Ambient Intelligence and Humanized Computing - (2021)
Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey Mandalapu H., Reddy A. P., Ramachandra R. , Rao K. S., Mitra P. , S R M P. , Busch C. By IEEE Access 9 37431-37455 (2021)
Multilingual and multimode phone recognition system for Indian languages Tripathi K., Reddy M.K., Rao K.S. By Speech Communication 119 12-23 (2020)
Articulatory feature based methods for performance improvement of multilingual phone recognition systems using Indian languages E M. K., Jayagopi D. , Rao K. S., V R. By SADHANA 45 - (2020)
hf : A Hybrid Pitch Extraction Method for Multimodal Voice Rengaswamy P., Reddy M.G., Rao K.S., Dasgupta P. By Circuits, Systems, and Signal Processing - (2020)
DNN-based cross-lingual voice conversion using Bottleneck Features Kiran Reddy M., Rao K. S. By Neural Processing Letters 51 2029-2042 (2020)
Excitation modelling using epoch features for statistical parametric speech synthesis Reddy M.K., Rao K.S. By Computer Speech and Language 60 - (2020)
Identification of glottal instants using electroglottographic signal for vulnerable cases of voicing Mandal T., Rao K.S., Gupta S.K. By Healthcare Technology Letters 7 132-138 (2020)
Robust f0 extraction from monophonic signals using adaptive sub-band filtering Pradeep R., Kiran Reddy M. , Rao K. S., Dasgupta P. By Speech Communication 116 77-85 (2020)
VEP Detection for Read, Extempore and Conversation Speech Tripathi K., Rao K.S. By IETE Journal of Research - (2020)
BOXREC: Recommending a Box of Preferred Outfits in Online Shopping Banerjee D., Rao K.S., Sural S., Ganguly N. By ACM Transactions on Intelligent Systems and Technology 11 1-28 (2020)
Detection of Specific Language Impairment in Children Using Glottal Source Features Reddy M.K., alku P., Rao K.S. By IEEE Access 8 15273-15279 (2020)
"CWT-Based Approach for Epoch Extraction From Telephone Quality Speech Keerthana Y., Kiran Reddy M. , Rao K. S. By IEEE Signal Processing Letters 26 1107-1111 (2019)
LSTM-Based Robust Voicing Decision Applied to DNN-Based Speech Synthesis Pradeep R., Reddy M.K., Rao K.S. By Automatic Control and Computer Sciences 53 328-332 (2019)
Development and Analysis of Multilingual Phone Recognition Systems using Indian Languages K E M., Jayagopi D. B., Rao K. S., V R. By International Journal of Speech Technology 22 157-168 (2019)
Incorporation of Manner of Articulation Constraint in LSTM for Speech Recognition Pradeep R., Rao K.S. By Circuits, Systems, and Signal Processing 38 3482-3500 (2019)
Children Story Classification in Indian Languages using Linguistic and Keyword based Features Hari Krishna D., Rao K. S. By ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 19 - (2019)
A Robust Unsupervised Pattern Discovery and Clustering of Speech Signals Ravi K. K., Birla L. , Rao K. S. By Pattern Recognition Letters 116 254-261 (2018)
Predominant Melody Extraction from Vocal Polyphonic Music Signal by Time Domain Adaptive Filtering Based Method Reddy M., Rao K. S. By Circuits, Systems and Signal Processing - (2018)
Language identification using phase information Datta A., Rao K. S. By International Journal of Speech Technology - (2018)
Neural Network and GMM based Feature Mappings for Consonant-Vowel Recognition in Emotional Environment Yadav J., Rao K. S. By International Journal of Speech Technology 21 421-433 (2018)
Automatic note transcription system for Hindustani classical music Dhara P., Rao K.S. By International Journal of Speech Technology 21 987-1003 (2018)
Improvement of phone recognition accuracy using speech mode classification Tripathi K., Rao K. S. By International Journal of Speech Technology - (2018)
Inverse filter based excitation model for HMM-based speech synthesis system Reddy M., Rao K. S. By IET Signal Processing - (2018)
Epoch detection from emotional speech signal using zero time windowing Yadav J., Fahad M.S., Rao K.S. By Speech Communication 96 142-149 (2018)
Raga and Tonic Identification in Carnatic Music Manjabhat S., Koolagudi S. G., Rao K. S., Ramteke P. B. By Journal of New Music Research 46 229-245 (2017)
Generation of creaky voice for improving the quality of HMM-based speech synthesis N., Rao K. S. By Computer Speech and Language 42 38-58 (2017)
Supervector-based approaches in discriminative freamework for speaker verification in noisy environments Sarkar S., Rao K. S. By International Journal of Speech Technology 20 387-416 (2017)
Robust Glottal Activity Detection using the Phase of an Electroglottographic Signal S., Mandal T. , Rao K. S. By Biomedical Signal Processing and Control - (2017)
Robust pitch extraction method for HMM-based speech synthesis system Reddy M., Rao K. S. By IEEE Signal Processing Letters 24 1133-1137 (2017)
Parameterization of excitation signal for improving the quality of HMM-based speech synthesis system Narendra N. P., Sreenivasa Rao K. By Circuits, Systems and Signal Processing - (2017)
Parametric representation of excitation source information for language identification Nandi D., Pati D. , Rao K. S. By Computer Speech and Language 41 88-115 (2017)
Implicit processing of LP residual for language identification Nandi D., Pati D. , Rao K. S. By Computer Speech and Language 41 68-87 (2017)
Improvement of Phone Recognition Accuracy using Articulatory Features M., Rao K. S. By Circuits, Systems and Signal Processing - (2017)
Modification of energy spectra, epoch parameters and prosody for emotion conversion in speech Haque A., Rao K. S. By International Journal of Speech Technology 20 15-25 (2017)
Time - Domain Deterministic Plus Noise Model based Hybrid Source Modeling for HMM - Based Speech Synthesis N. P. Narendra and K. Sreenivasa Rao By Speech Communication Vol. 77, Elsevier 65 - 83 (2016)
Articulatory and Excitation Source Features for Speech Recognition in Read, Extempore and Conversation Modes Manjunath K. E., Sreenivasa Rao K. By International Journal of Speech Technology 19 121-134 (2016)
Prosodic Mapping using Neural Networks for Emotion Conversion in Hindi Language Jainath Y., Sreenivasa Rao K. By Circuits, Systems and Signal Processing 35 139-162 (2016)
Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks Ramu Reddy V., Sreenivasa Rao K. By Neurocomputing 171 1323-1334 (2016)
Voice/Non-voice Detection Using Phase of Zero Frequency Filtered Speech Signal S. B. Sunil Kumar and K. Sreenivasa Rao By Speech Communication Vol. 81, Elsevier 90 - 103 (2016)
Source and System Features for Phone Recognition Manjunath K. E., Sreenivasa Rao K. By International Journal of Speech Technology 18 257-270 (2015)
Recognition of emotions from video using acoustic and facial features Sreenivasa Rao K., Shashidhar K. G. By Signal Image and Video Processing 9 1029-1045 (2015)
Implicit excitation source features for robust language identification Nandi D., Pati D. , Sreenivasa Rao K. By International Journal of Speech Technology 18 459-477 (2015)
Robust Voicing Detection and F0 Estimation for HMM - Based Speech Synthesis N. P. Narendra and K. Sreenivasa Rao By Circuits, Systems and Signal Processing Vol. 34, Springer 2597 - 2619 (2015)
Segmentation, indexing and retrieval of TV broadcast news bulletins using Gaussian mixture models and vector quantization codebooks Sreenivasa Rao K., Pachpande K. By International Journal of Speech Technology 17 259-269 (2014)
Film segmentation and indexing using autoassociative neural networks Sreenivasa Rao K., Nandi D. , Shashidhar K. G. By International Journal of Speech Technology 17 65-74 (2014)
Stochastic feature compensation methods for speaker ver ification in noisy environments Sourjya Sarkar and K. Sreenivasa Rao, By Applied Soft Computing Vol. 19, Elsevier 198 - 214 (2014)
Speaker Identification under Background Noise using Features Extracted from Steady Vowel Regions Anil Kumar V., Sreenivasa Rao K. By International Journal of Adaptive control and Signal processing 27 781-792 (2013)
Detection of Vowel Offset Point from Speech Signal Jainath Yadav and K. Sreenivasa Rao By Signal Processing Letters Vol. 20, No. 4, IEEE 299 - 302 (2013)
Non -Uniform Time Scale Modification Using Instants of Significant Excitation and Vowel Onset Points K. Sreenivasa Rao and Anil Kumar Vuppala By Speech Communication Vol. 55, No. 6, Elsevier 745 - 756 (2013)
Optimal weight tuning method for unit selection cost functions in syllable based text -to-speech synthesis N. P. Narendra and K. Sreenivasa Rao By Applied Soft Computing Vol. 13, Elsevier 773 - 781 (2013)
Improved Speaker Identification in Wireless Environment Anil Kumar V., Sreenivasa Rao K. , Chakrabarti S. By International Journal of Signal and Imaging Systems Engineering 130-137 (2013)
Pitch Synchronous and Glottal Closure based Speech Analysis for Language Recognition Sreenivasa Rao K., Maity S. , Ramu Reddy V. By International Journal of Speech Technology 413-430 (2013)
Two -Stage Intonation Modeling using Feedforward Neural Networks for syllable based Text -to-Speech Synthesis V. Ramu Reddy and K. Sreenivasa Rao By Computer Speech and Language Vol. 27, Elsevier 1105 - 1126 (2013)
Emotion Recognition from Speech using global and local prosodic features Sreenivasa Rao K., Shashidhar K. G., Ramu Reddy V. By International Journal of Speech Technology 143-160 (2013)
Recognition of Indian languages using multi-level spectral and prosodic features Ramu Reddy V., Maity S. , Sreenivasa Rao K. By International Journal of Speech Technology 489-510 (2013)
Vowel Onset Point Detection for Noisy Speech using Spectral Energy at Formant Frequencies Anil Kumar V., Sreenivasa Rao K. By International Journal of Speech Technology 229-235 (2013)
Characterization and recognition of emotions from speech using excitation source information Sreenivasa Rao K., Shashidhar K. G. By International Journal of Speech Technology 181-201 (2013)
Classification of Infant Cries Using Dynamics of Epoch Features Singh A. K., Mukhopadhyay J. , Rao K. S., Viswanath K. By Journal of Intelligent Systems 22 253-267 (2013)
Vowel Onset Point Detection for Low Bit Rate Coded Speech Anil Kumar Vuppala, Jainath Yadav, Saswat Chakrabarti and K. Sreenivasa Rao By IEEETransactions on Audio, Speech and Language Processing Vol. 20, No. 6, IEEE 1894 - 1903 (2012)
Emotion Recognition from Speech : A Review Shashidhar K. G., Sreenivasa Rao K. By International Journal of Speech Technology 99-117 (2012)
A pitch synchronous approach to design voice conversion system using source-filter correlation Laskar R. H., Banerjee K. , Talukdar F. A., Sreenivasa Rao K. By International Journal of Speech Technology 419-431 (2012)
Emotion Recognition from Speech using Source, System and Prosodic features Shashidhar K. G., Sreenivasa Rao K. By International Journal of Speech Technology 265-289 (2012)
Emotion Recognition from Speech using subsyllabic and pitch synchronous spectral features Shashidhar K. G., Sreenivasa Rao K. By International Journal of Speech Technology 495-511 (2012)
Spotting and Recognition of Consonant-Vowel Units from Continuous Speech using Accurate Vowel Onset Point Anil Kumar V., Sreenivasa Rao K. , Chakrabarti S. By Circuits, Systems and Signal Processing 31 1459-1474 (2012)
Neural network based feature transformation for emotion independent speaker identification Sreenivasa Rao K., Yadav J. , Sarkar S. , Shashidhar K. G., Anil Kumar V. By International Journal of Speech Technology 335-349 (2012)
Syllable Specific Unit Selection Cost Functions for Text-to-Speech Syntheisis N. P. Narendra and K. Sreenivasa Rao By ACM Transactions on speech and language processing Vol. 9, ACM - (2012)
Comparing ANN and GMM in a voice conversion framework Laskar R. H., Chakraborty D. , Talukdar F. A., Sreenivasa Rao K. , Banerjee K. By Applied Soft Computing 12 3332-3342 (2012)
Unconstrained pitch contour modification using instants of significant excitation Sreenivasa Rao K. By Circuits, Systems and Signal Processing 31 2133-2152 (2012)
Improved consonant vowel recognition for low bit-rate coded speech Anil Kumar V., Sreenivasa Rao K. , Chakrabarti S. By International Journal of Speech Technology 333-349 (2012)
Improved Vowel Onset Point Detection using Epoch Intervals Anil Kumar V., Sreenivasa Rao K. , Chakrabarti S. By International Journal of Electronics and Communications 66 697-700 (2012)
Development of Syllable-based Text-to-Speech Synthesis System in Bengali Narendra N. P., Sreenivasa Rao K. , Ghosh K. , Ramu Reddy V. , Maity S. By International Journal of Speech Technology 167-181 (2011)
Two Stage Emotion Recognition Based on Speaking Rate Shashidhar K. G., Sreenivasa Rao K. By International Journal of Speech Technology 35-48 (2011)
Application of Prosody models for Developing Speech systems in Indian languages Sreenivasa Rao K. By International Journal of Speech Technology 19-33 (2011)
Recognition of Consonant-Vowel (CV) Units under Background Noise using Combined Temporal and Spectral Preprocessing Anil Kumar V., Sreenivasa Rao K. , Chakrabarti S. , Krishnamoorthy P. , Prasanna S. R. By International Journal of Speech Technology 259-272 (2011)
Recognition of emotions from video using neural network models Sreenivasa Rao K., Saroj V. K., Maity S. , Shashidhar K. G. By Expert systems and applications 38 13181-13185 (2011)
Identification of Hindi Dialects and Emotions using Spectral and Prosodic features of Speech Sreenivasa Rao K., Shashidhar K. G. By Journal of Systems, Cybernetics and Informatics 24-33 (2011)
Role of Neural network models for developing speech systems Sreenivasa Rao K. By SADHANA 36 783-836 (2011)
Real time prosody modification Sreenivasa Rao K. By Journal of Signal and Information Processing 50-62 (2010)
Selection of suitable features for modeling the durations of syllables Sreenivasa Rao K., Shashidhar K. G. By Journal of Software Engineering and Applications 1107-1117 (2010)
Voice Conversion by Mapping the Speaker-specific features using Pitch Synchronous Approach K. Sreenivasa Rao By Computer Speech and Language Vol. 24, Elsevier 474 - 494 (2010)
Intonation modeling for Indian languages Sreenivasa Rao K., Yegnanarayana B. By Computer Speech and Language 23 240-256 (2009)
Duration modification using Glottal Closure Instants and Vowel Onset Points Sreenivasa Rao K., Yegnanarayana B. By Speech communication 51 1263-1269 (2009)
Determination of instants of significant excitation in speech using Hilbert envelope and group delay function K. Sreenivasa Rao, S. R. M. Prasanna and B. Yegnanarayana By IEEE Signal Processing Letters Vol. 14, IEEE 762 - 765 (2007)
Modeling durations of syllables using neural networks Sreenivasa Rao K., Yegnanarayana B. By Computer Speech and Language 21 282-295 (2007)
Prosody modification using instants of significant excitation Sreenivasa Rao K., Yegnanarayana B. By Krothapalli Sreenivasa Rao 14 972-980 (2006)

Conferences

Relation Predictions in Comorbid Disease Centric Knowledge Graph Using Heterogeneous GNN Models Biswas S., Chaudhuri K. D., Mitra P. , Rao K. S. By 10 th International Work-Conference on Bioinformatics and Biomedical Engineering (IWBBIO 2023) - (2023)
NrityaManch: An Annotation and Retrieval System for Bharatanatyam Dance Paul S., Saha R. , Padhi S. , Majumdar S. , Das P. P., Rao K. S. By Forum for Information Retrieval Evaluation (FIRE 22) - (2022)
Knowledge distillation for singing voice detection Paul S., Gurunath Reddy M., Sreenivasa Rao K., Das P.P. By Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 5 3671-3675 (2021)
Knowledge Distillation for Singing Voice Detection Paul S., Reddy G. M., Sreenivasa Rao K. , Das P. P. By INTERSPEECH - (2021)
Multilingual Phone Recognition: Comparison of Traditional versus Common Multilingual Phone-Set Approaches and Applications in Code-Switching Manjunath K.E., Raghavan K.M.S., Rao K.S., Jayagopi D.B., Ramasubramanian V. By Communications in Computer and Information Science 1209 CCIS 75-86 (2020)
Multilingual speech mode classification model for indian languages Tripathi K., Sreenivasa Rao K. By 26th National Conference on Communications, NCC 2020 - (2020)
Glottal closure instants detection from EGG signal by classification approach Gurunath Reddy M., Sreenivasa Rao K., Das P.P. By Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2020-October 4891-4895 (2020)
Improved HMM-Based Mixed-Language (Telugu Hindi) Polyglot Speech Synthesis Reddy M.K., Rao K.S. By Lecture Notes in Electrical Engineering 614 279-287 (2020)
Mel-scaled Wavelet-based Features for Spoofing Speech Detection Kiran Reddy M., Rao K. S. By International Conference on Electrical, Control and Computer Engineering - (2019)
Modifying LSTM Posteriors with Manner of Articulation Knowledge to Improve Speech Recognition Performance Pradeep R., Rao K.S. By Proceedings - 17th IEEE International Conference on Machine Learning and Applications, ICMLA 2018 769-772 (2019)
Glottal Closure Instants Detection from Speech Signal by Deep Features Extracted from Raw Speech and Linear Prediction Residual Gurunath Reddy M., Rao K. S., Das P. P. By INTERSPEECH-2019 - (2019)
Comparison of Common Multilingual Phone-set Based and LID-switched Monolingual Approaches for Multilingual Phone Recognition using Indian Languages Manjunath K., Srinivasa Raghavan K. , Rao K. S., Dinesh Babu J. , Ramasubramanyam V. By International Conference on Electronics, Computing and Communication Technologies (IEEE CONECCT) - (2019)
Manner of Articulation Based Split Lattices for Phoneme Recognition R., Rao K. S. By National Conference on Communications (NCC-2018) - (2018)
Audio Mining: Unsupervised Spoken Term Detection over an Audio Database Kishore Kumar R., Sarkar S. , Rangaswamy P. , Sreenivasa Rao K. By 7th International Conference on Advances in Computing, Communications and Informatics (ICACCI-2018) 514-518 (2018)
Robust Detection of Glottal Activity using Unwrapped Phase Electroglottographic Signal Mandal T., Sreenivasa Rao K. By International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2018) 5584-5588 (2018)
Modifying LSTM Posteriors with Manner of Articulation Knowledge to Improve Speech Recognition Performance R P., Rao K. S. By IEEE 17th International Conference on Machine Learning and Applications (ICMLA 2018) - (2018)
Harmonic-Percussive Source Separation of Polyphonic Music by Suppressing Impulsive Noise Events Reddy M., Rao K. S., Das P. P. By INTERSPEECH - 2018 - (2018)
Discriminative sparse representation for speech mode classification Tripathi K., Sreenivasa Rao K. By 7th International Conference on Advances in Computing, Communications and Informatics (ICACCI-2018) 655-659 (2018)
DNN-based Bilingual (Telugu-Hindi) Polyglot Speech Synthesis Kiran Reddy M., Sreenivasa Rao K. By 7th International Conference on Advances in Computing, Communications and Informatics (ICACCI-2018) 1808-1811 (2018)
Note Transcription from Carnatic Music Suma S. M., Koolagudi S. G., Ramteke P. B., Sreenivasa Rao K. By International Conference on Advanced Computing Networking and Informatics (ICACNI-2018) - (2018)
One for the Road: Recommendign Male Street Attire Banerjee D., Ganguly N. , Sural S. , Sreenivasa Rao K. By 26th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2018) 571-582 (2018)
Classification of disorders in vocal folds using Electroglottographic Signal Mandal T., Sreenivasa Rao K. , Gupta S. K. By Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH - 2018 3002-3004 (2018)
Analysis of sparse representation based feature on speech mode classification Tripathi K., Rao K. S. By INTERSPEECH - 2018 - (2018)
Indian languages ASR: A multilingual phone recognition framework with IPA based common phone-set, predicted articulatory features and feature fusion K E M., Rao K. S., Jayagopi D. B., V R. By INTERSPEECH - 2018 - (2018)
Split Acoustic Modeling in Decoder for Phoneme Recognition Pradeep R., Sreenivasa Rao K. By 14th IEEE India Council International Conference (INDICON) - (2017)
Automatic Evaluation of Hindustani Learner s SARGAM Practice Reddy G., Rao K. S. By 25th European Signal Processing Conference (EUSIPCO-2017) - (2017)
Neutral to Joyous Happy Emotion Conversion Gurunath Reddy M., Sreenivasa Rao K. By 14th IEEE India Council International Conference (INDICON) - (2017)
Accurate Synchronization of Speech and EGG signal using Phase Information S., Rao K. S., Mandal T. By INTERSPEECH-2017 - (2017)
Predominant vocal melody extraction from enhanced partial harmonic content Reddy G., Rao K. S. By 25th European Signal Processing Conference (EUSIPCO-2017) - (2017)
Emotion-specific features for classifying emotions in story text Hari Krishna D. M., Sreenivasa Rao K. By National Conference on Communications (NCC-2016) (IEEE Explore) - (2016)
Enhanced Harmonic Content and Vocal Note Based Predominant Melody Extraction from Vocal Polyphonic Music Signals M G., Rao K. S. By INTERSPEECH - (2016)
Speaker Identification in Emotional Environment using Trajectory-based Stochastic Feature Mapping Yadav J., Fahad M. , Kumar R. , Rao K. S. By International Conference on Recent Advances and Innovations in Engineering (ICRAIE-2016) - (2016)
A Robust Non-Parametric and Filtering Based Approach for Glottal Closure Instant Detection Rangaswamy P., Reddy M. , Rao K. S., Dasgupta P. By INTERSPEECH 1795-1799 (2016)
Excitation modeling for HMM-based speech synthesis based on principal component analysis Narendra N. P., Kiran Reddy M. , Sreenivasa Rao K. By National Conference on Communications (NCC-2016) (IEEE Explore) - (2016)
Predominant melody extraction from vocal polyphonic music signal Gurunath Reddy M., Sreenivasa Rao K. By International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2016) (IEEE Explore) - (2016)
Deep Neural Networks for Kannada Phoneme recognition R P., Rao K. S. By IEEE 9 th International Conference on Contemporary Computing (IC3) 67-72 (2016)
Designing Automatic Note Transcription System for Hindustani Classical Music Dhara P., Rengaswamy P. , Rao K. S. By IEEE 5th International Conference on Advances in Computing, Communications and Informatics (ICACCI) - (2016)
A deterministic plus noise model of excitation signal using principal component analysis for parametric speech synthesis Narendra N. P., Sreenivasa Rao K. By International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2016) (IEEE Explore) - (2016)
Language Identification Using PLDA Based on I-Vector in Noisy Environment Rai M., Yadav J. , Rao K. S., Kumar N. , Fahad M. By IEEE 5th International Conference on Advances in Computing, Communications and Informatics (ICACCI) - (2016)
Sentence Based Discourse Classification for Hindi Story Text-to-Speech (TTS) System Tripathi K., Sarkar P. , Rao K. S. By 13th International Conference on Natural Language Processing (ICON) - (2016)
Children story classification based on structure of the story Harikrishna D., Sreenivasa Rao K. By International Conference on Advances in Computing, Communications and Informatics (ICACCI-2015) - (2015)
Classification of children stories in Hindi using keywords and POS density Hari Krishna D., Sreenivasa Rao K. By International Conference on Computer, Communication and Control (IC4-2015) (IEEE Explore) - (2015)
Data-driven pause prediction for synthesis of storytelling style speech based on discourse modes Sarkar P., Sreenivasa Rao K. By International Conference on Electronics, Computing and Communication Technologies (CONECCT-2015) - (2015)
Improved recognition rate of language identification system in noisy environment Randheer B., Yadav J. , Sreenivasa Rao K. By IInternational Conference on Contemporary Computing (IC3-2015) (IEEE Explore) - (2015)
Modeling pauses for synthesis of storytelling style speech using unsupervised word features Sarkar P., Sreenivasa Rao K. By International Conference on Advances in Computing, Communications and Informatics (ICACCI-2015) - (2015)
Modification and incorporation of excitation source features for emotion conversion Arijul H., Sreenivasa Rao K. By International Conference on Computer, Communication and Control (IC4-2015) (IEEE Explore) - (2015)
Multi-stage children story speech synthesis for Hindi Hari Krishna D., Sreenivasa Rao K. By IInternational Conference on Contemporary Computing (IC3-2015) (IEEE Explore) - (2015)
Neutral to happy emotion conversion by blending prosody and laughter Gurunath Reddy M., Sreenivasa Rao K. By IInternational Conference on Contemporary Computing (IC3-2015) (IEEE Explore) - (2015)
Analysis and modeling pauses for synthesis of storytelling speech based on discourse modes Sarkar P., Sreenivasa Rao K. By IInternational Conference on Contemporary Computing (IC3-2015) (IEEE Explore) - (2015)
Analysis and modification of spectral energy for neutral to sad emotion conversion Arijul H., Sreenivasa Rao K. By International Conference on Contemporary Computing (IC3-2015) - (2015)
Analysis of linear prediction residual signal, its magnitude and phase for language identification on NIST LRE (2003) database Dutta A. K., Sreenivasa Rao K. By International Conference on Computer, Communication and Control (IC4-2015) (IEEE Explore) - (2015)
Analysis of perturbation in pitch period and contact quotient for classifying age groups Mandal T., Sreenivasa Rao K. By International Conference on Computer, Communication and Control (IC4-2015) (IEEE Explore) - (2015)
Raga identification based on normalized note histogram features Pradeep R., Dhara P. , Sreenivasa Rao K. , Dasgupta P. By International Conference on Advances in Computing, Communications and Informatics (ICACCI-2015) - (2015)
Robust language identification using power normalized cepstral coefficients Dutta A. K., Sreenivasa Rao K. By IInternational Conference on Contemporary Computing (IC3-2015) (IEEE Explore) - (2015)
Automatic detection of creaky voice using epoch parameters Narendra N., Sreenivasa Rao K. By Interspeech - 2015 - (2015)
Automatic pitch accent contour transcription for Indian languages Gurunath Reddy M., Procheta S. , Manjunath K. , Dutta A. K., Arijul H. , Sarkar P. , Sreenivasa Rao K. By International Conference on Computer, Communication and Control (IC4-2015) (IEEE Explore) - (2015)
Designing prosody rule-set for converting neutral TTS speech to storytelling style speech for Indian languages Bengali, Hindi and Telugu Sarkar P., Arijul H. , Dutta A. K., Gurunath Reddy M. , Hari Krishna D. M., Dhara P. , Verma R. , Narendra N. P., Sunil Kumar S. B., Yadav J. , Sreenivasa Rao K. By 7th International Conference on Contemporary Computing (IC3), (IEEE Explore) - (2014)
Duration Modeling by MultiModels based on Vowel Production characteristics Ramu Reddy V., Sreenivasa Rao K. , Sarkar P. By 11th International Conference on Natural Language Processing, (ICON 2014) - (2014)

Book Chapter/Section

Multilingual Phone Recognition: Comparison of Traditional versus Common Multilingual Phone-Set Approaches and Applications in Code-Switching Manjunath K., Srinivasa Raghavan M. , Sreenivasa Rao K. , Dinesh Babu J. , Ramasubramanyam V. By Advances in Signal Processing and Intelligent Recognition Systems - (2020)
Infant Cry Recognition using Source, System, Prosody and Epoch features Sreenivasa Rao K., Singh A. K., Mukhopadhyay J. , Kumar S. A., Kumar S. S., Reddy R. V. By Acoustic analysis of Infant Cries, Toddler Vocalizations, and Yound Adult Dysarthria, Speech Technology in Medicine and Health Care - (2019)
One for the road: Recommending male street attire Banerjee D., Ganguly N., Sural S., Rao K.S. By Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 10939 LNAI 571-582 (2018)
Excitation modeling method based on inverse filtering for HMM-based speech synthesis Kiran Reddy M., Sreenivasa Rao K. By Machine Intelligence and Signal Processing - (2017)
Hybrid source modeling method utilizing optimal residual frames for HMM-based speech synthesis Narendra N. P., Sreenivasa Rao K. By Mining Intelligence and Knowledge Exploration - (2015)
Indexing and Retrieval of Speech Documents Piyush Kumar P. S., Manjunath K. E., Ravi Kiran R. , Jainath Y. , Sreenivasa Rao K. By Advanced Computing, Networking and Informatics - Volume-1 - (2014)
Importance of Utterance Partitioning in SVM Classifier with GMM Supervectors for Text Independent Speaker Verification Nirmalya S., Patil H. A., Das Mandal S. K., Sreenivasa Rao K. By Mining Intelligence and Knowledge Exploration (LNCS) 780-789 (2013)
Corpus Based Emotional Speech Synthesis in Hindi Ravi Kalyan B., Narendra N. P., Sreenivasa Rao K. By Pattern Recognition and Machine Intelligence (LNCS) 390-395 (2013)
Duration Modeling Using Multi-model Based on Positional Information Ramu Reddy V., Sreenivasa Rao K. By Pattern Recognition and Machine Intelligence (LNCS) 404-409 (2013)
Data-driven Phase break prediction for Bengali Text-to-Speech system Krishnendu G., Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS): Contemporary Computing 118-129 (2012)
Emotion Recognition from Semi Natural Speech using Artificial Neural Networks and Excitation Source Features Shashidhar K. G., Swati D. , Anurag B. , Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS): Contemporary Computing 273-282 (2012)
Intensity modeling for Syllable based Text-to-Speech synthesis Ramu Reddy V., Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS): Contemporary Computing 106-117 (2012)
Real Life Emotion Classification from speech using Gaussian Mixture Models Shashidhar K. G., Anurag B. , Swati D. , Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS): Contemporary Computing 250-261 (2012)
Speaker Recognition in Emotional Environment Shashidhar K. G., Kritika S. , Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS) : Echofriendly computing and communication systems 117-124 (2012)
Spoken language identification using spectral features Shashidhar K. G., Deepika R. , Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS): Contemporary Computing - (2012)
Vowel Recognition from Telephonic Speech using MFCC features and Gaussian Mixture Models Shashidhar K. G., Sujata Negi T. , Anurag B. , Manoj Kumar S. , Ramesh S R. , Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS) : Echo-friendly computing and communication systems 170-177 (2012)
Effect of Noise on Recognition of Consonant-Vowel (CV) Units Anil Kumar V., Sreenivasa Rao K. , Chakrabarti S. By Communications in Computer and Information Science (CCIS): Contemporary Computing 191-200 (2011)
Effect of Noise on Vowel Onset Point Detection Anil Kumar V., Jainath Y. , Sreenivasa Rao K. , Chakrabarti S. By Communications in Computer and Information Science (CCIS): Contemporary Computing 201-211 (2011)
Effect of speech coding on recognition of Consonant-Vowel (CV) units Anil Kumar V., Sreenivasa Rao K. , Chakrabarti S. By Communications in Computer and Information Science (CCIS): Contemporary Computing 284-294 (2011)
Robust speaker recognition in noisy environments: Using dynamics of speaker-specific prosody Shashidhar K. G., Sreenivasa Rao K. , Ramu Reddy V. , Anil Kumar V. , Chakrabarti S. By Forensic speaker recognition 183-204 (2011)
Segment Specific Concatenation Cost for Syllable Based Bengali TTS Narendra N. P., Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS): Contemporary Computing 371-382 (2011)
Text independent emotion recognition using spectral features Rahul C., Jainath Y. , Shashidhar K. G., Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS): Contemporary Computing 359-370 (2011)
Emotion Classification Based on Speaking Rate Shashidhar K. G., Sudhin R. , Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS): Contemporary Computing 316-327 (2010)
Exploring Speech Features for Classifying Emotions along Valence Dimension Shashidhar K. G., Sreenivasa Rao K. By Pattern Recognition and Machine Intelligence (LNCS) 537-542 (2009)
IITKGP-SESC : Speech database for emotion analysis Shashidhar K. G., Anil Kumar V. , Chakrabarti S. , Sudhamay M. , Sreenivasa Rao K. By Communications in Computer and Information Science (CCIS) : Contemporary Computing 485-492 (2009)
Unit selection using linguistic, prosodic and spectral distance for developing text-to-speech system in Hindi Sreenivasa Rao K., Shashidhar K. G., Sudhamay M. , Amol T. By Pattern Recognition and Machine Intelligence (LNCS) 531-536 (2009)
Modeling supra-segmental features of syllables using neural networks Sreenivasa Rao K. By Biomedical signal processing using neural networks 71-95 (2008)
Voice Transformation by Mapping the Features at Syllable Level Sreenivasa Rao K., Laskar R. H., Shashidhar K. G. By : Pattern Recognition and Machine Intelligence (LNCS) 479-486 (2007)
Two-stage duration model for Indian languages using neural networks Sreenivasa Rao K., Yegnanarayana B. By Lecture Notes in Computer Science : Neural Information Processing 1179-1185 (2004)

Book

Source Modeling Techniques for Quality Enhancement in Statistical Parametric Speech Synthesis Sreenivasa Rao K., Narendra N. P. By 1-136 (2019)
Speech Recognition using Articulatory and Excitation Source Features Sreenivasa Rao K., Manjunath K. E. By 1-92 (2017)
Language Identification Using Excitation Source Features Sreenivasa Rao K., Dipanjan N. By 1-119 (2015)
Language Identification using Spectral and Prosodic Features Sreenivasa Rao K., Ramu Reddy V. , Maity S. By 1-98 (2015)
Robust Speaker Recognition in Noisy Environments Sreenivasa Rao K., Sarkar S. By 1-139 (2014)
Speech Processing in Mobile Environments Sreenivasa Rao K., Anil Kumar V. By 1-121 (2014)
Emotion Recognition using Speech Features Sreenivasa Rao K., Shashidhar G K. By 1-124 (2013)
Robust Emotion Recognition using Spectral and Prosodic Features Sreenivasa Rao K., Shashidhar K. G. By 1-118 (2013)
Predicting Prosody from Text for Text-to-Speech Synthesis Sreenivasa Rao K. By 1-130 (2012)

Workshop

Glottal Closure Instants Detection From Pathological Acoustic Speech Signal Using Deep Learning Reddy M., Mandal T. , Rao K. S. By Machine Learning for Health (ML4H) Workshop - (2018)

Computer Science and Engineering

Faculty

Research Areas

Principal Investigator

Ph. D. Students

Abhijit Debnath

Annepu. Sai Sriharsha

Aravinda Reddy P N

Arup Kumar Dutta

Haque Arijul

Priya Dharshini G

Saikat Biswas

Soumen Paul

Soumya Majumdar

Sudhakar P

Y Madhu Keerthana