Languages Supported in Amazon Comprehend
Amazon Comprehend supports a wide variety of languages for its various features. The languages supported and the features that support them can be seen in the following tables.
Supported Languages
Amazon Comprehend (except the Detect Dominant Language feature) supports the following languages for one or more features.
Code | Language |
---|---|
de |
German |
en |
English |
es |
Spanish |
it |
Italian |
pt |
Portuguese |
fr |
French |
ja |
Japanese |
ko |
Korean |
hi |
Hindi |
ar |
Arabic |
zh |
Chinese (simplified) |
zh-TW |
Chinese (traditional) |
Amazon Comprehend identifies the language using identifiers from RFC 5646 — if there is a 2-letter ISO 639-1 identifier, with a regional subtag if necessary, it uses that. Otherwise, it uses the ISO 639-2 3-letter code. For more information about RFC 5646, see the IETF Tools web site.
Languages Supported by Amazon Comprehend Features
Feature |
Supported Languages |
---|---|
All supported languages. |
|
All supported languages. |
|
English |
|
All supported languages. |
|
German (de), English (en), Spanish (es), French (fr), Italian (it), and Portuguese (pt). |
|
Not dependent on the language used. |
|
German (de), English (en), Spanish (es), French (fr), Italian (it), and Portuguese (pt). |
|
German (de), English (en), Spanish (es), French (fr), Italian (it), and Portuguese (pt). |
Languages Supported by Detect Dominant Language
The Detect the Dominant Language feature can detect the following languages
Code | Language | Code | Language | Code | Language |
---|---|---|---|---|---|
af | Afrikaans | hy | Armenian | ps | Pushto |
am | Amharic | ilo | Iloko | qu | Quechua |
ar | Arabic | id | Indonesian | ro | Romanian |
as | Assamese | is | Icelandic | ru | Russian |
az | Azerbaijani | it | Italian | sa | Sanskrit |
ba | Bashkir | jv | Javanese | si | Sinhala |
be | Belarusian | ja | Japanese | sk | Slovak |
bn | Bengali | kn | Kannada | sl | Slovenian |
bs | Bosnian | ka | Georgian | sd | Sindhi |
bg | Bulgarian | kk | Kazakh | so | Somali |
ca | Catalan | km | Central Khmer | es | Spanish |
ceb | Cebuano | ky | Kirghiz | sq | Albanian |
cs | Czech | ko | Korean | sr | Serbian |
cv | Chuvash | ku | Kurdish | su | Sundanese |
cy | Welsh | la | Latin | sw | Swahili |
da | Danish | lv | Latvian | sv | Swedish |
de | German | lt | Lithuanian | ta | Tamil |
el | Greek | lb | Luxembourgish | tt | Tatar |
en | English | ml | Malayalam | te | Telugu |
eo | Esperanto | mr | Marathi | tg | Tajik |
et | Estonian | mk | Macedonian | tl | Tagalog |
eu | Basque | mg | Malagasy | th | Thai |
fa | Persian | mn | Mongolian | tk | Turkmen |
fi | Finnish | ms | Malay | tr | Turkish |
fr | French | my | Burmese | ug | Uighur |
gd | Scottish Gaelic | ne | Nepali | uk | Ukrainian |
ga | Irish | new | Newari | ur | Urdu |
gl | Galician | nl | Dutch | uz | Uzbek |
gu | Gujarati | no | Norwegian | vi | Vietnamese |
ht | Haitian | or | Oriya | yi | Yiddish |
he | Hebrew | pa | Punjabi | yo | Yoruba |
hi | Hindi | pl | Polish | zh | Chinese (Simplified) |
hr | Croatian | pt | Portuguese | zh-TW | Chinese (Traditional) |
hu | Hungarian |