I would like to use dutch english russian and portugese in Pandora chatbots during the same session - aiml

We use pandorabots in Dutch and English and would like to use Portugese & Ukranian in same chatbot.

Related

Pepper - QiChat: pronunciation of a word in a different language

I'd doing an italian dialog on Pepper with qichat language
I have to change the pronunce of a single word (i.e. Engineering) to English.
What is the qichat instruction that I have to write in the dialog box?
Thanks,
Debora
I think you have two options there:
Try different spellings (e.g. "Ingeniireng", ...) until Peppers pronunciation satisfies your expectations.
Inserting phonetic text doc

Translation of Accelerators into Chinese

When translating an app (MFC in this case) into Chinese, what do I do with Accelerators?
Is F1 still used for Help?
What about things like CTRL-A? Will the translator know what to do those?
Any advice or links appreciated.
From Wikipedia:
Chinese keyboards are usually in US layout with/without Chinese input
method labels printed on keys.
They also have F-Keys, so F1 for help is fine.
Don't translate accelerators, keep them in latin alphabet. Ampersand accelerators within text are usually moved to the right of the text, changed to uppercase and wrapped in parenthesis. For example "E&nter the text:" becomes "输入文字(&N):". There is no whitespace between Chinese text and the first parenthesis.
This is how the Windows "run" dialog looks in Chinese (simplified):
And this is Notepad's menu:
You can see for yourself by installing a Chinese language pack and changing the primary display language to Chinese via the Windows settings app.

NLTK synset with other languages

Right now i'm trying to compare words from two different files, one english, one chinese. I have to identify if any of the english words are related to the chinese words and if they are, are they equal or is one a hypernym of the other. I can use synsets for english but what can i do about the chinese words?
It looks like there is a Chinese (cmn) WordNet available from a university in Taiwan: http://casta-net.jp/~kuribayashi/multi/ . If this WordNet has the same format as the English WordNet, then you can probably use the WordNetCorpusReader (http://nltk.googlecode.com/svn/trunk/doc/api/nltk.corpus.reader.wordnet-pysrc.html#WordNetCorpusReader) in NLTK to import the Mandarin data. I don't know how you're doing your alignments or translations between the two datasets, but assuming you can map English to Chinese, this should help you figure out how the relation between two English words compares to the relation between two Mandarin words. Note that if your data uses the simplified script, you may also need to convert to the traditional script before using this cmn WordNet.

How to get same result in C or C++ same as toLowerCase in Java or string.lower() in Python?

I need a C or C++ function(or library) which works like String.toLowerCase() in Java.
I know that I can use tolower() for English, but what I need is a function (or library) can cover global language. (Actually, It needs to cover 9 languages listed below.)
language list
Dutch
English
French
German
Italian
Portuguese
Russian
Spanish
Ukrainian
Add, These characters in the first line below are Input and the second line is expected result
LINE 1:
AÁÀÂÄĂĀÃÅÆBCĆČÇDEÉÈÊËĚĘFGHℏIÍÌÎÏJKLŁMNŃŇÑOÓÒÔÖÕØŒPQRŘSŚŠŞTŢUÚÙÛÜŪVWXYÝZŹŽΑΔΕΘΛΜΝΠΡΣΣΦΩАБВГҐДЕЁЄЖЗИЙІЇКЛМНОПРСТУФХЦЧШЩЪЫЬЭЮЯ
LINE 2:
aáàâäăāãåæbcćčçdeéèêëěęfghℏiíìîïjklłmnńňñoóòôöõøœpqrřsśšştţuúùûüūvwxyýzźžαδεθλμνπρσςφωабвгґдеёєжзийіїклмнопрстуфхцчшщъыьэюя
I verified results from Java toLowerCase() and Python string.lower()
and both are correct.
Is there any way to translate to lowercase letter in C or C++?
And Important thing is that the letters are read from a file encoded 'UTF-8'!
Please help me. My English is not very good, so please use simple English as much as you can.
I think you will find what you need in the Boost libraries - see http://www.boost.org/doc/libs/1_51_0/libs/locale/doc/html/conversions.html
Quoting from their website:
Boost.Locale gives powerful tools for development of cross platform
localized software - the software that talks to user in its language.
Provided Features:
Correct case conversion, case folding and normalization.
Collation (sorting), including support for 4 Unicode collation levels.
....
You get the idea, I hope. The function you need is
Boost::Locale::to_lower(yourUTF8String)

geocode webservice address parameter written in another language

I try to use the following google map webservice in order to locate greek addresses:
http://maps.google.com/maps/api/geocode/json?address=Ακαδημίας 16&sensor=false and it does not work.
If I hit the same exactly address but written with latin alphabet characters: maps.google.com/maps/api/geocode/json?address=akadimias 16&sensor=false, it works and returns the right result. Could somebody help with this? (To use this service with greek letters as language parameter)
Thank you in advance,
Nicholas
You should encode the characters. In python you could do:
import urllib
address = 'Ακαδημίας'
encoded_address = urllib.quote(address)
query = 'http://maps.google.com/maps/api/geocode/json?address='+encoded_address+'&sensor=false'
As I am not familiar with greek I don't know if it should read 'Ακαδημίας' or 'Ακαδημίας 16', but the example above gives the encoded address to be
'%CE%91%CE%BA%CE%B1%CE%B4%CE%B7%CE%BC%CE%AF%CE%B1%CF%82'
and seems to work. I guess you have similar methods in PHP or whatever you want to use. For PHP have a look at Problem with greek url characters in IE