Amazon Comprehend
Developer Guide

Languages Supported in Amazon Comprehend

Amazon Comprehend supports a wide variety of languages for its various features. The languages supported and the features that support them can be seen in the following tables.

Supported Languages

Amazon Comprehend (except the Detect Dominant Language feature) supports the following languages for one or more features.

Code Language Code Language Code Language

de

German

en

English

es

Spanish

it

Italian

pt

Portuguese

fr

French

ja

Japanese

ko

Korean

hi

Hindi

ar

Arabic

zh

Chinese (simplified)

zh-TW

Chinese (traditional)

Note

Amazon Comprehend identifies the language using identifiers from RFC 5646 — if there is a 2-letter ISO 639-1 identifier, with a regional subtag if necessary, it uses that. Otherwise, it uses the ISO 639-2 3-letter code. For more information about RFC 5646, see the IETF Tools web site.

Languages Supported by Amazon Comprehend Features

Feature

Supported Languages

Detect the Dominant Language

See Languages Supported by Detect Dominant Language

Detect Entities

Available in all supported languages.

Detect Key Phrases

Available in all supported languages.

Determine Sentiment

Available in all supported languages.

Analyze Syntax

Available in German (de), English (en), Spanish (es), French (fr), Italian (it), and Portuguese (pt).

Topic Modeling

Not dependent on the language used.

Custom Classification

Available in German (de), English (en), Spanish (es), French (fr), Italian (it), and Portuguese (pt).

Custom Entity Recognition

Available in English (en) only.

Languages Supported by Detect Dominant Language

The Detect the Dominant Language feature can detect the following languages

Code Language Code Language Code Language
af Afrikaans hy Armenian ps Pushto
am Amharic ilo Iloko qu Quechua
ar Arabic id Indonesian ro Romanian
as Assamese is Icelandic ru Russian
az Azerbaijani it Italian sa Sanskrit
ba Bashkir jv Javanese si Sinhala
be Belarusian ja Japanese sk Slovak
bn Bengali kn Kannada sl Slovenian
bs Bosnian ka Georgian sd Sindhi
bg Bulgarian kk Kazakh so Somali
ca Catalan km Central Khmer es Spanish
ceb Cebuano ky Kirghiz sq Albanian
cs Czech ko Korean sr Serbian
cv Chuvash ku Kurdish su Sundanese
cy Welsh la Latin sw Swahili
da Danish lv Latvian sv Swedish
de German lt Lithuanian ta Tamil
el Greek lb Luxembourgish tt Tatar
en English ml Malayalam te Telugu
eo Esperanto mr Marathi tg Tajik
et Estonian mk Macedonian tl Tagalog
eu Basque mg Malagasy th Thai
fa Persian mn Mongolian tk Turkmen
fi Finnish ms Malay tr Turkish
fr French my Burmese ug Uighur
gd Scottish Gaelic ne Nepali uk Ukrainian
ga Irish new Newari ur Urdu
gl Galician nl Dutch uz Uzbek
gu Gujarati no Norwegian vi Vietnamese
ht Haitian or Oriya yi Yiddish
he Hebrew pa Punjabi yo Yoruba
hi Hindi pl Polish zh Chinese (Simplified)
hr Croatian pt Portuguese zh-TW Chinese (Traditional)
hu Hungarian