Try our new research platform with insights from 80,000+ expert users

Amazon Polly vs Google Cloud Text-to-Speech comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Oct 8, 2024
 

Categories and Ranking

Amazon Polly
Ranking in Text-To-Speech Services
1st
Average Rating
7.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
Google Cloud Text-to-Speech
Ranking in Text-To-Speech Services
2nd
Average Rating
8.6
Number of Reviews
2
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of November 2024, in the Text-To-Speech Services category, the mindshare of Amazon Polly is 40.2%, up from 38.5% compared to the previous year. The mindshare of Google Cloud Text-to-Speech is 33.7%, up from 32.2% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Text-To-Speech Services
 

Featured Reviews

SB
Helps modify text-to-speech by controlling speech patterns, but putting more tags gives an error
We can play text-to-speech when we need to play some prompts inside Amazon Connect. Any number we provide will be played as digits when we play text-to-speech. By using SSML tags in a conversational manner in Amazon Polly, we can convert those digits into numbers. We can use the SSML tags in Amazon Polly to modify text-to-speech by controlling speech patterns and behaviour.
Rafael Keller - PeerSpot reviewer
A stable solution that is used to vocalize the text written on the API to the client
We use Google Cloud Text-to-Speech when we have an IVR and need to say something written on the API through the IVR. For example, if I'm on the IVR and need to say something to the customer that is written on the API, I need to vocalize this text. I use Google Cloud Text-to-Speech to take this text…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"We can use the SSML tags in Amazon Polly to modify text-to-speech by controlling speech patterns and behaviour."
"Amazon Polly is useful because it's helpful to hear the words on top of it when I can't take in information in a general way. Sometimes, it's very taxing if I'm trying to read cases. They have the neural voices, and they're so realistic. You don't even know that a person is not reading to you, making things much better. I know that they do have the ability to provide you with your own lexicon that's personal to you. I like that you can adjust the pitch and the speed of the voice because some people talk way too fast. Or if you're reading, I read slowly, so that's always helpful. One of the functions that I find helpful is that when reading material on the web, it's like it has its own browser. You go to the URL, and you don't have to read the whole thing, and you can stick the cursor on the place where you want it to start. Then if you want it to skip over something, you put it somewhere else, and that's ideal for reading case law because you skip around a lot. You don't really read it from start to finish. It helps if someone's going to read all those citations because they definitely want to be able to skip that."
"It's not complex to set up."
"Precision is the most valuable feature of Google Cloud Text-to-Speech because the text is perfectly voiced."
 

Cons

"The price could be better. I wish it weren't so expensive to do because it's really cool. I would love to see them have lexicon packages of them like, this is for lawyers, this is for accountants, and it's going to have a lot of things in it. I also think they could do a better job at showing use cases other than telemarketing or contact center stuff like bots that are very commercial. I know that's where the money is, but it's such a huge hole that's missing for people with disabilities that are even worse than mine. Some people cannot see or hear at all, but they're not just cognitively impaired."
"When you put more tags inside Amazon Polly to define break time and instruct the speech to be conversational, sometimes it gives you an error."
"We had some problems with Dialogflow."
"Google Cloud Text-to-Speech has just one female voice and one male voice in Brazil, while it has a lot of voices in other countries."
 

Pricing and Cost Advice

"The solution has a pay-as-you-go pricing model, where you must pay according to your usage."
"The price could be better. Neural voices are so realistic, and I want to say that they have it so that you can try to tell where the voice is coming from or something like that. But if I have more than one, it's so expensive to have to listen to a bunch of cases on my phone and have the neural voice read to me. It really wouldn't be worth it. It'd be paying probably more than what I make in the case. Right now, I'm on the free tier, and I think the number of minutes that you get is reasonable as long as you're not doing this all the time and you're using it judiciously. I have some credits that I think I can use, but I don't know how fast they'll go through."
"I rate Google Cloud Text-to-Speech three out of ten for pricing."
report
Use our free recommendation engine to learn which Text-To-Speech Services solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Computer Software Company
16%
Financial Services Firm
10%
Manufacturing Company
8%
University
7%
Computer Software Company
14%
Financial Services Firm
11%
University
8%
Manufacturing Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
No data available
 

Questions from the Community

What is your experience regarding pricing and costs for Amazon Polly?
The solution has a pay-as-you-go pricing model, where you must pay according to your usage.
What needs improvement with Amazon Polly?
When you put more tags inside Amazon Polly to define break time and instruct the speech to be conversational, sometimes it gives you an error.
What is your primary use case for Amazon Polly?
We use Amazon Polly to check the SSML tags to see how they play the prompt.
What do you like most about Google Cloud Text-to-Speech?
Precision is the most valuable feature of Google Cloud Text-to-Speech because the text is perfectly voiced.
What needs improvement with Google Cloud Text-to-Speech?
Google Cloud Text-to-Speech has just one female voice and one male voice in Brazil, while it has a lot of voices in other countries.
 

Overview

 

Sample Customers

GoAnimate, Duolingo, Bandwidth
Home Depot, Paypal, Target, HSBC, McKesson
Find out what your peers are saying about Amazon Polly vs. Google Cloud Text-to-Speech and other solutions. Updated: October 2024.
816,406 professionals have used our research since 2012.