Menu
Amazon Polly
Developer Guide

Common Questions

This topic provides answers to questions that are commonly asked about Amazon Polly.

General Questions

Q: Can I save the synthesized speech?

When you use the SynthesizeSpeech operation, you can save the output of the synthesis for use on your own system. You can also call Amazon Polly, and then encrypt the file with any encryption key and store it in Amazon Simple Storage Service (Amazon S3) or any other secure storage. The Amazon Polly SynthesizeSpeech call is stateless and is not associated with a customer identity. You can't retrieve it from Amazon Polly later.

When you use the StartSpeechSynthesisTask operation (asynchronous synthesis), the output of the synthesis is automatically stored in an Amazon S3 bucket. You can then retrieve it at need. For more information on this, see Creating Long Audio Files.

Content Rendering

Q: I would like to use the same voice in two different languages. Are there any bilingual voices?

Currently, only one voice is bilingual: Aditi (Hindi and Indian English). Using a bilingual voice enables you to synthesize speech that includes either language by itself, or both within a single text. With Aditi, Amazon Polly can also synthesize speech when it contains both Devangari (Hindi script) and Romanagari (Latin script).

We are constantly working to improve Amazon Polly's language options, including increasing the number of bilingual voices. For more information, see Bilingual Voices

Q: What pcm details are used when pcm is selected as an output format?

When pcm is used, the content returned is audio/pcm in a signed 16-bit, 1 channel (mono), little-endian format.

Q: Some of my text coming out with the stress on the wrong syllable when it is spoken by Amazon Polly. I've even tried using an acute accent (U+0301) to mark the stress but it is still on the wrong syllable. How can I fix this?

Amazon Polly doesn't currently recognize an acute accent (U+0301) as indicating syllable stress in a word. However, there are two ways you can change the stress in a word. You can use an IPA phone and ssml tags to alter the pronunciation of the word. For more information, see SSML Tags Supported by Amazon Polly. In some languages, you can also use an apostrophe immediately after the syllable to indicate a change in stress. For instance, in Russian, the words страны́ and стра́ны have different stresses (marked here with an acute accent). However, because of the identical spelling, Amazon Polly will pronounce them both with the stress on the final syllable, according to standard language usage. You can use an apostrophe to mark the alternatively stressed syllable, as in стран'ы, and the Amazon Polly will stress the correct syllable.

Q: When I use bullet points in my text, Amazon Polly doesn't render them correctly. It says "minus" every time it encounters one. What do I do?

If you use "-" (a hyphen) as a substitute for a bullet point, in some languages,Amazon Polly renders it as a minus sign. If you want to use hyphens as substitutes for a bullet point, you can do so with a lexicon entry. For more information, see Managing Lexicons.

Q: I use the "/" (forward slash) symbol frequently in my text, especially when saying "and/or" and "yes/no." How does Amazon Polly render this?

In English, Amazon Polly renders "and/or" in speech as "and or." Currently, this rule isn't available in other languages. In languages other than English, Amazon Polly renders "yes/no" as "yes slash no." If you want to change this behavior, you can use a lexicon entry. For more information, see Managing Lexicons.

Q: When I use text from an existing source in order to synthesize speach using the AWS CLI on a Linux machine, some UTF-8 characters do not seem work with Amazon Polly, even though the same characters seem to work properly using the Console. What is happening?

This is based in how the Unix Shell handles Unicode and isn't a Amazon Polly-specific problem. Two options are available: you can find the problem characters and replace them in the input text, or you can u tilize an alternate means of accessing Amazon Polly that does not experience this issue, such as the PHP interface. This is a known issue that we are working to address and only a few uncommon unicode characters have this issue.

Q: When I try to synthesize text from a source containing International Phonetic Alphabet (IPA) symbols, Amazon Polly doesn't recognize them and even tries to pronounce some of them. How do I fix this?

Amazon Polly does not recognize IPA symbols unless SSML (Speech Synthesis Markup Language) is used to delineate it. However, since small sections of IPA symbols usually indicate a pronunciation guide for a reader, in many cases, this section can be safely removed from the input text by simple deletion. You can also use a lexicon to change the way this is rendered by Amazon Polly. For more information, see Generating Speech from SSML Documents and Managing Lexicons.

Data Security and Confidentiality

Q: Can I opt out of request logging with request APIs?

Yes, you can request that by contacting AWS Support.

Q: Can I choose to mask certain data fields so that they are not stored?(For instance, if I convert text with some sensitive data, but don't want it stored on the AWS systems, can I mask it?

No. Amazon Polly doesn’t currently support this functionality.

Q: The text I want to use with Amazon Polly is confidential. How is my data protected?

All text submissions are protected by Secure Sockets Layer (SSL) while in transit, and are stored using RSA encryption. We keep the service logs and text separate, so that the content can't be linked with the customer ID. As a result, Amazon Polly does not associate text submissions with customer identity.

Q: How long is data retained?

Amazon Polly retains data for 14 days. After that, it's automatically deleted from our system.

Q: Can I request that data be wiped earlier?

Yes, you can request that by contacting AWS Support.