VTML (VoiceText Markup Language) – This Powerful hidden gem in Text to Speech helps you control the generated speech by adding pause, changing the pitch, and other cool voice features.
This post on customizing text to speech Pronunciations received many comments requesting a way to control the generated speech with html like tags.
The interesting thing is that this feature is already there in Captivate! Even more interesting is the fact that it is so simple to use. All you need to do is insert the appropriate tags into the slide notes. You can type in (or copy paste) the tags just like you do any other text.
Just before making this post, I quickly tried few of them:
1. Hello I am here <vtml_pause time=”1000″/> for a break: Introduces a pause of 1 sec in between.
2. <vtml_speed value=”50″> This is my text </vtml_speed>: Helps you control the speed of the text
There are many more similar useful tags. The VTML tags are listed in Appendix C of the document here. It has examples as well on the usage.
Is there a way to display captions for audio on the slide as well as for the video? For example, I have audio for a 30 second slide that I added captions for. However, say, at 15 seconds I show a video for the remaining 15 seconds. I want to show the slide audio captions for 15 seconds and I thought the video would show its own captions that I added from then on – but the first 15 seconds are blank until it gets to the video captions.
Hi Marilyn, can you please share the project with us at CaptivateHelp@adobe.com to investigate the issue.
I have Captivate 5 as a standalone program (not Adobe Elearning Suite). I have the voices installed and didn’t have any problems changing the text to speech to the Kate voice. But it still sounds all wrong…I have no idea how to change the pitch and emphasis on words. The files that might be for that don’t open. ????
The link has the word neospeech attached to the end of it. You need to cut that off. https://ondemand.neospeech.com/vt_eng-Engine-VTML-v3.9.0-3.pdf
Your info is very valuable for me. Thank you very much. Keep writting more!
Your info is very valuable for me. Thank you very much. Keep writting more!
NIce piece of info. Really great! So, go on composing these fine purchase acomplia or purchase xenical or purchase adipex or purchase xanax or buspar articles
NIce piece of info. Really great! So, go on composing these fine purchase acomplia or purchase xenical or purchase adipex or purchase xanax or buspar articles
NIce piece of info. Really great! So, go on composing these fine purchase acomplia or purchase xenical or purchase adipex or purchase xanax or buspar articles
Leive,I can understand your frustration, but the VTML standard was created by the company that created the particular voices/TS Engine used (their engineers were also involved in the SSML standard, go figure), and I’ve included a link to the US version of the VoiceText User’s Guide. https://ondemand.neospeech.com/vt_eng-Engine-VTML-v3.9.0-3.pdfNeospeech also has a web-based service that DOES accept SSML at http://ws.neospeech.com, but for SSML support you do have to pay – but they DO have a free account for folks that have limited TTS conversion needs.Unfortunately for us, there are not very many good quality TTS voices that are within the range of the average developers pocketbook. Neospeech voices were probably choosen for their willingness to sub-license their voices per application – that’s why it works within captivate only (or so I assume).Personally, Loquendo voices are by far the best I’ve heard, and they also have there own emotion mark-up langauge which makes their voices appear very natural – it’s almost spooky. But, their licensing is also very expensive, and unless you plan on creating a lot of e-learning it’s a bit pricey. They also sub-license their voices to individual applications, such as Character Builder (Flash-based character scripting/animation program) http://www.mediasemantics.com
I think it is a shame that a company like Adobe selected a proprietary control method like vtml??? The industry already has a well established standard… SSML (speech synthesis markup language). Why tie customers to a proprietary non-standards based method? Who are the speech engines that support this besides your Korean partner? Will my high-quality Cepstral voices take advantage of this?Even more vexing…vtml is a copy of SSML. Brilliant – take an open standard and slightly change all the descriptors so they don’t work with other engines. How difficult would it have been to support the proper SSML method that ALL speech engines conform to? I’m going to take a second time-out now.
You must be logged in to post a comment.