I'm trying to find out how I may punctuate in cloud Cloud Speech-to-Text, not in English, but another language. This is a basic requirement for my use case. I'm sure google has thought of it.
Has anybody experience of this?
Currently, automatic punctuation is only available for US English only (en-US). It is likely that this feature will be available for other languages at some point, but I would recommend you to ask GCP about your particular language(s) by filling a Feature Request using this form.
Related
I have a text in japanese that I'm turning into an mp3 with the Google Cloud Text to Speech functionality.
I also want to have word timestamps for the mp3 that gets returned by Google.
Google Speech to Text offers this functionality but when I submit the files I get from TTS to STT, the result is not always good.
What is the best way to also get word timestamps for the TTS mp3?
Google Cloud Speech-to-Text it's a ML based service, so it's expected that the results are not always as "good" as you may expect them, it has it's limitations.
What I could suggest is to take a look at their relevant documentation about this topic like the best practices, the guide and the basics page that talk about it. Additionally, you could take a look at the issues within their issue tracker platform, like for example this issue for additional information on it and even if you find a reproducible issue within the service you can publish it there, so their team can be aware of it.
The new Google Cloud Speech-to-Text API is said to be the best in the market. Does it provide speaker annotation (or other speaker information) at all? Like who says what at what time. I can't seem to find anywhere on its documentation or examples provided that mentions it.
Both IBM and Amazon do it.
I'd be appreciated if anyone can let me know, thanks!
Individual speaker recognition is not currently a feature provided by the API. It’s noted in the issue tracker [1] as a feature request, however there’s no ETA for it currently. I’d recommend starring the issue to receive future comments and updates regarding it.
[1] https://issuetracker.google.com/35901846
Programming a "question answering engine" based on an internal or cloud DB, is a very complicated thing to do, especially when the questions are asked in natural language.
However, Google does it prety well recently. Is there a way to use Googles high performance for that? Does Google give that service to use their platform for enterprise QA?
Thank you
Gal
If you want to provide answers in natural language to questions in natural language, you should check out the Wolfram Alpha API's or the WebKnox API.
I'm trying to improve the semantic web services' discovery to satisfy better the user. For that, I choose to use OWL-S. So I created an ontology using Protégé then I generated semantic web services using OWL-S editor within Eclipse (these services have been in part generated automatically since they are already developed in Java). What I can't do now is the execution of these services: I don't know how to proceed. I need your help please, I didn't find good documentation or tutorials about the life cycle of OWL-S services. Could you please help me and suggest me some useful tools, tutorials..?
PS: do you think that OWL-S is still used or do you suggest something else? What is the most common way to develop semantic web services nowadays?
Thank you!
OWl-S is a W3C consortium standard, therefore documentation is available regarding that. Check the below links
http://www.w3.org/Submission/OWL-S/
http://www.daml.org/services/owl-s/
http://www.ai.sri.com/daml/services/owl-s/examples.html
Web services and web APIs have managed to increase the accessibility of the information stored and catalogued on the internet. They have also opened up a vast array of enterprise power functionality for smaller thin client applications.
By taping into these services developers can provide functionality that would have taken them months perhaps years to set up. They can combine them into single applications that make life generally easier for its users.
Whether displaying information about the music being played, finding items of interest in the locale of the user or just simply tweeting and blogging from the same application - the possibilities are growing everyday.
I want to know about the most interesting or useful services that are out there, especially ones that most of us may not have heard about yet. Do you maintain an API or service? or do you have a clever mash up that provides even more benefits than the originals?
YQL - Yahoo provide a tool that lets you query many different API's across the web, even for sites that don't provide an API as such.
From the site:
The Yahoo! Query Language is an
expressive SQL-like language that lets
you query, filter, and join data
across Web services.
...
With YQL, developers can access and
shape data across the Internet through
one simple language, eliminating the
need to learn how to call different
APIs.
The World Bank API is pretty cool. Google uses it in search results. My favourite implementations are the cartograms at worldmapper.
(source: worldmapper.org)
It's very niche, but I happen to think the OpenCongress API is amazing.
Less niche: Google Translate has an API which will guess the language of something. You'd be AMAZED how frequently this comes in handy (even though it's not as tweakable as you'd like and is not trained on small samples).
I was just about to have a stab at using the SoundCloud API
I know many people who already use for sharing their musical masterpieces and its a pretty good site. Hopefully the api will be as well!
I like the RESTful API for weather.com. It's free and very useful for the new age of location-aware apps: https://registration.weather.com/ursa/xmloap/step1
It does require registration, but they don't spam you or anything - it's just to provide you a key to use the API.
Ah yes - here's another one I've been meaning to check out but haven't tried yet
The BBC offer a bunch of apis/feeds that look very promising
http://ideas.welcomebackstage.com/data
They include apis for accessing schedule data for both TV and Radio listings along with all kinds of news searches. It even looks like they'll be offering some sort of geo-location service soon so it will be interesting to see what that has to offer
Another interesting one for liberal brits! ;)
The Guardian news paper have their own api
http://www.guardian.co.uk/open-platform
MuiscBrainz
Excellent service for music mashups.
Not so many knows that Last.FM initial database was scraped from this service.
The United States Postal Service offers a web service that does address standardization. Quite useful in reducing clutter and cleaning data before it gets put into your database.