Show simple item record

dc.contributor.author Nallathamby, JD
dc.contributor.author Kariyawasam, KKR
dc.contributor.author Pullaperuma, HD
dc.contributor.author Vithana, DC
dc.contributor.author Jayasena, S
dc.contributor.editor Weerawardhana, S
dc.contributor.editor Madusanka, A
dc.contributor.editor Dilrukshi, T
dc.contributor.editor Aravinda, H
dc.date.accessioned 2022-12-05T07:31:43Z
dc.date.available 2022-12-05T07:31:43Z
dc.date.issued 2011-11
dc.identifier.citation ****** en_US
dc.identifier.uri http://dl.lib.uom.lk/handle/123/19670
dc.description.abstract Although there are widely used Interactive Voice Response (IVR) systems in many languages today, there is no Sinhala language IVR system yet. This paper talks about an approach taken in developing a complete Sinhala IVR system. It talks about the research carried out in this area, the process taken, the overall design and implementation aspects and the future work that can be carried out in this area. deBas IVR is a complete Sinhala IVR with automatic speech recognition (ASR) and text-to-speech (TTS) synthesis modules that work in compliance with Media Resource Control Protocol (MRCP). In the ASR component, training the acoustic model is done with SphinxTrain, and decoding with PocketSphinx, which are based on Hidden Markov Models (HMM). In the TTS component, AMoRA Sinhala TTS knowledge base is used, which uses Festival speech synthesis engine and a female diphonic voice, built using Festvox voice building tools. Asterisk is used as the IVR gateway and dial-plan interpreter. MRCPv2 protocol has been followed in developing the speech resources, which uses Session Initiation Protocol (SIP) for establishing controlled connections to external media streaming devices and Real-time Transport Protocol (RTP) for media delivery. The language model of the ASR component has been restricted to digits from 0-9 that are commonly used in IVR systems and the set of words used for our demo application. The word-error-rate and the sentence-error-rate of the ASR component are reported to be 31.4% and 54% respectively, as observed in our experiments. In addition to these, we also introduce a new intonation model that can be applied to any existing Sinhala diphonic voices. en_US
dc.language.iso en en_US
dc.publisher Computer Science & Engineering Society c/o Department of Computer Science and Engineering, University of Moratuwa. en_US
dc.subject Sinhala en_US
dc.subject Interactive Voice Response en_US
dc.subject Automatic speech recognition en_US
dc.subject Text-to-speech en_US
dc.subject Media resource control protocol en_US
dc.subject Hidden markov model en_US
dc.subject Word-error-rate en_US
dc.subject Sentence-error-rate en_US
dc.title Debas – a sinhala interactive voice response system en_US
dc.type Conference-Full-text en_US
dc.identifier.faculty Engineering en_US
dc.identifier.department Department of Computer Science and Engineering en_US
dc.identifier.year 2011 en_US
dc.identifier.conference CS & ES Conference 2011 en_US
dc.identifier.place Moratuwa. Sri Lanka en_US
dc.identifier.proceeding Proceedings of the CS & ES Conference 2011 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record