‚Äč

Cloud OCR SDK Documentation

Recognition Languages

ABBYY Cloud OCR SDK supports the following recognition languages:

Internal nameRecognition languageCan be used for OCRCan be used for ICRCan be used for BCR
Abkhaz Abkhaz +
Adyghe Adyghe +  
Afrikaans Afrikaans + +
Agul Agul +  
Albanian Albanian + +
Altaic Altaic +  
Arabic Arabic (Saudi Arabia) +  
ArmenianEastern Armenian (Eastern) +  
ArmenianGrabar Armenian (Grabar) +  
ArmenianWestern Armenian (Western) +  
Awar Avar +  
Aymara Aymara + +
AzeriCyrillic Azerbaijani (Cyrillic) +  
AzeriLatin Azerbaijani (Latin) + +
Bashkir Bashkir +  
Basque Basque + +
Belarusian Belarussian +  
Bemba Bemba + +
Blackfoot Blackfoot + +
Breton Breton + +
Bugotu Bugotu + +
Bulgarian Bulgarian + +
Buryat Buryat + +
Catalan Catalan +  
Chamorro Chamorro + +
Chechen Chechen +  
ChinesePRC Chinese Simplified +   +
ChineseTaiwan Chinese Traditional +   +
Chukcha Chukcha +  
Chuvash Chuvash +  
CMC7 For MICR CMC-7 text type +
Corsican Corsican + +
CrimeanTatar Crimean Tatar + +
Croatian Croatian + +
Crow Crow + +
Czech Czech + + +
Danish Danish + + +
Dargwa Dargwa +  
Digits Numbers* + +
Dungan Dungan +  
Dutch Dutch (Netherlands) + + +
DutchBelgian Dutch (Belgium) + +
E13B For MICR (E-13B) text type +
English English + + +
EskimoCyrillic Eskimo (Cyrillic) +  
EskimoLatin Eskimo (Latin) +
Esperanto Esperanto +
Estonian Estonian + + +
Even Even + +
Evenki Evenki + +
Farsi Farsi +
Faeroese Faeroese +
Fijian Fijian + +
Finnish Finnish + + +
French French + + +
Frisian Frisian + +
Friulian Friulian + +
GaelicScottish Scottish Gaelic + +
Gagauz Gagauz +
Galician Galician + +
Ganda Ganda + +
German German + + +
GermanLuxembourg German (Luxembourg) + +
GermanNewSpelling German (new spelling) + +
Greek Greek + + +
Guarani Guarani + +
Hani Hani + +
Hausa Hausa +
Hawaiian Hawaiian + +
Hebrew Hebrew +
Hungarian Hungarian + + +
Icelandic Icelandic +
Ido Ido + +
Indonesian Indonesian + + +
Ingush Ingush +  
Interlingua Interlingua + +
Irish Irish + +
Italian Italian + + +
Japanese Japanese + +
Kabardian Kabardian +
Kalmyk Kalmyk +  
KarachayBalkar Karachay-Balkar + +
Karakalpak Karakalpak +  
Kasub Kasub + +
Kawa Kawa + +
Kazakh Kazakh + +
Khakas Khakas +  
Khanty Khanty +  
Kikuyu Kikuyu +
Kirgiz Kirghiz + +
Kongo Kongo + +
Korean Korean +   +
KoreanHangul Korean (Hangul) +
Koryak Koryak +
Kpelle Kpelle + +
Kumyk Kumyk + +
Kurdish Kurdish + +
Lak Lak +
Lappish Sami (Lappish) + +
Latin Latin + +
Latvian Latvian + +
LatvianGothic Latvian language written in Gothic script +  
Lezgin Lezgin +
Lithuanian Lithuanian + +
Luba Luba + +
Macedonian Macedonian +
Malagasy Malagasy + +
Malay Malay +
Malinke Malinke + +
Maltese Maltese +
Mansi Mansi +
Maori Maori + +
Mari Mari +
Maya Maya + +
Miao Miao + +
Minankabaw Minangkabau + +
Mohawk Mohawk + +
Mongol Mongol + +
Mordvin Mordvin + +
Nahuatl Nahuatl + +
Nenets Nenets + +
Nivkh Nivkh + +
Nogay Nogay + +
Norwegian NorwegianNynorsk + NorwegianBokmal + + +
NorwegianBokmal Norwegian (Bokmal) + + +
NorwegianNynorsk Norwegian (Nynorsk) + + +
Nyanja Nyanja + +
Occidental Occidental +
Ojibway Ojibway + +
OldEnglish Old English + +
OldFrench Old French + +
OldGerman Old German + +
OldItalian Old Italian + +
OldSlavonic Old Slavonic +
OldSpanish Old Spanish + +
Ossetic Ossetian +  
Papiamento Papiamento + +
PidginEnglish Tok Pisin + +
Polish Polish + + +
PortugueseBrazilian Portuguese (Brazil) + + +
PortugueseStandard Portuguese (Portugal) + + +
Provencal Provencal +
Quechua Quechua + +
RhaetoRomanic Rhaeto-Romanic + +
Romanian Romanian + +
RomanianMoldavia Romanian (Moldavia) + +
Romany Romany + +
Ruanda Ruanda + +
Rundi Rundi + +
RussianOldSpelling Russian (old spelling) +  
Russian Russian + + +
Samoan Samoan + +
Selkup Selkup + +
SerbianCyrillic Serbian (Cyrillic) + +
SerbianLatin Serbian (Latin) + +
Shona Shona +
Sioux Sioux (Dakota) + +
Slovak Slovak + +
Slovenian Slovenian + +
Somali Somali + +
Sorbian Sorbian +  
Sotho Sotho + +
Spanish Spanish + + +
Sunda Sunda +
Swahili Swahili + +
Swazi Swazi + +
Swedish Swedish + + +
Tabassaran Tabassaran +
Tagalog Tagalog + +
Tahitian Tahitian + +
Tajik Tajik + +
Tatar Tatar +
Thai Thai +  
Tinpo Jingpo + +
Tongan Tongan + +
Tswana Tswana + +
Tun Tun + +
Turkish Turkish + + +
Turkmen Turkmen +  
Tuvin Tuvan + +
Udmurt Udmurt +  
UighurCyrillic Uighur (Cyrillic) +
UighurLatin Uighur (Latin) + +
Ukrainian Ukrainian + + +
UzbekCyrillic Uzbek (Cyrillic) +
UzbekLatin Uzbek (Latin) + +
Vietnamese Vietnamese +  
Visayan Cebuano + +
Welsh Welsh +  
Wolof Wolof + +
Xhosa Xhosa + +
Yakut Yakut +  
Yiddish Yiddish +  
Zapotec Zapotec + +
Zulu Zulu +  

* Besides the ten digits 0123456789, the Digits predefined language contains the following characters:

  • punctuation marks ()+,-./:=
  • #$([{¢£€ characters are allowed to precede the word
  • %).]}°¼½¾ characters are allowed after the word

So you can recognize sequences like "$450" or "12%" using this predefined language.