Home            Contact us            FAQs
    
      Journal Home      |      Aim & Scope     |     Author(s) Information      |      Editorial Board      |      MSP Download Statistics

     Research Journal of Applied Sciences, Engineering and Technology


Statistical Parametric Speech Synthesis of Malay Language using Found Training Data

Lau Chee Yong and Tan Tian Swee
Medical Implant Technology Group (MediTEG), Cardiovascular Engineering Center, Material Manufacturing Research Alliance (MMRA), Faculty of Biosciences and Medical Engineering (FBME), Universiti Teknologi Malaysia, Malaysia
Research Journal of Applied Sciences, Engineering and Technology  2014  24:5143-5147
http://dx.doi.org/10.19026/rjaset.7.910  |  © The Author(s) 2014
Received: January 28, 2014  |  Accepted: February 10, 2014  |  Published: June 25, 2014

Abstract

The preparation of training data for statistical parametric speech synthesis can be sophisticated. To ensure the good quality of synthetic speech, high quality low noise recording must be prepared. The preparation of recording script can be also tremendous from words collection, words selection and sentences design. It requires tremendous human effort and takes a lot of time. In this study, we used alternative free source of recording and text such as audio-book, clean speech and so on as the training data. Some of the free source can provide high quality recording with low noise which is suitable to become training data. Statistical parametric speech synthesis method applying Hidden Markov Model (HMM) has been used. To test the reliability of synthetic speech, perceptual test has been conducted. The result of naturalness test is fairly reasonable. The intelligibility test showed encouraging result. The Word Error Rate (WER) for normal synthetic sentences is below 15% while for Semantically Unpredictable Sentences (SUS) is averagely in 30%. In short, using free and ready source as training data can leverage the process of preparing training data while obtaining motivating synthetic result.

Keywords:

Hidden Markov Model (HMM), letter to sound rule, statistical parametric speech synthesis,


References

  1. Benoît, C., M. Grice and V. Hazan, 1996. The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using semantically unpredictable sentences. Speech Commun., 18(4): 381-392.
    CrossRef    
  2. Ekpenyong, M., E.A. Urua, O. Watts, S. King and J. Yamagishi, 2014. Statistical parametric speech synthesis for Ibibio. Speech Commun., 56: 243-251.
    CrossRef    
  3. Ibe, O.C., 2013. 14-hidden Markov Models. In: Ibe, O.C. (Ed.), Markov Processes for Stochastic Modeling. 2nd Edn., Elsevier, Oxford, pp: 417-451.
    CrossRef    
  4. Lim, Y.C., T.S. Tan, S.H. Shaikh Salleh and D.K. Ling, 2012. Application of genetic algorithm in unit selection for Malay speech synthesis system. Expert Syst. Appl., 39(5): 5376-5383.
    CrossRef    
  5. Tan, T.S. and S.H.S. Salleh, 2009. Corpus design for Malay corpus-based speech synthesis system. Am. J. Appl. Sci., 6(4): 696-702.
    CrossRef    
  6. Tokuda, K., T. Yoshimura, T. Masuko, T. Kobayashi and T. Kitamura, 2000. Speech parameter generation algorithm for HMM-based speech synthesis. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '00). Istanbul, 3: 315-1318.
    CrossRef    
  7. Yong, L.C. and T.T. Swee, 2014. Low footprint high intelligibility Malay speech synthesizer based on statistical data. J. Comput. Sci., 10(2): 316-324.
    CrossRef    
  8. Yoshimura, T., K. Tokuda, T. Masuko, T. Kobayashi and T. Kitamura, 1999. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis. Proceedings of the Eurospeech, 1999.
  9. Zen, H., K. Tokuda and A.W. Black, 2009. Statistical parametric speech synthesis. Speech Commun., 51(11): 1039-1064.
    CrossRef    

Competing interests

The authors have no competing interests.

Open Access Policy

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Copyright

The authors have no competing interests.

ISSN (Online):  2040-7467
ISSN (Print):   2040-7459
Submit Manuscript
   Information
   Sales & Services
Home   |  Contact us   |  About us   |  Privacy Policy
Copyright © 2025. MAXWELL Scientific Publication Corp., All rights reserved