Open Speech and Language Resources

Phone: 425 247 4129
(Daniel Povey)

Sinhala TTS

Identifier: SLR30

Summary: Sinhalese multi-speaker TTS corpora

Category: Speech

License: Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

si_lk.tar [259M]   (Audio files )
si_lk.lines.txt [197K]   ( Transcription of the audio )
README.txt [479 bytes]   (Additional readme )
LICENSE.txt [20K]   (Licensing information )

About this resource:

This data set contains multi-speaker high quality transcribed audio data for Sinhalese. The data set consists of wave files, and a TSV file. The file si_lk.lines.txt contains a FileID, which in tern contains the UserID and the Transcription of audio in the file.

The data set has been manually quality checked, but there might still be errors.

This dataset was collected by Google in Sri Lanka.

See LICENSE.txt file for license information.

Copyright 2015, 2016 Google, Inc.