Languages Supported
Nanonets supports character/word recognition in over 40+ languages including but not limited to:
- Afrikaans
- Albanian
- Arabic
- Armenian
- Belarusian
- Bengali
- Bulgarian
- Catalan
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Filipino
- Finnish
- French
- German
- Greek
- Gujarati
- Hebrew
- Hindi
- Hungarian
- Icelandic
- Indonesian
- Italian
- Japanese
- Kannada
- Khmer
- Korean
- Lao
- Latvian
- Lithuanian
- Macedonian
- Malay
- Malayalam
- Marathi
- Nepali
- Norwegian
- Persian
- Polish
- Portuguese
- Punjabi
- Romanian
- Russian
- Russian
- Serbian
- Serbian
- Slovak
- Slovenian
- Spanish
- Swedish
- Tagalog
- Tamil
- Telugu
- Thai
- Turkish
- Ukrainian
- Vietnamese
- Yiddish
- Amharic
- Ancient Greek
- Assamese
- Azerbaijani
- Azerbaijani
- Basque
- Bosnian
- Burmese
- Cebuano
- Cherokee
- Dhivehi
- Dzonkha
- Esperanto
- Galician
- Georgian
- Haitian Creole
- Irish
- Javanese
- Kazakh
- Kirghiz
- Latin
- Maltese
- Mongolian
- Oriya
- Pashto
- Sanskrit
- Sinhala
- Swahili
- Syriac
- Tibetan
- Tigirinya
- Urdu
- Uzbek
- Uzbek
- Welsh
- Zulu
- Acehnese
- Acholi
- Adangme
- Akan
- Algonquinian
- Araucanian/Mapuche
- Asturian
- Athabaskan
- Aymara
- Balinese
- Bambara
- Bantu
- Bashkir
- Batak
- Bemba
- Bikol
- Bislama
- Breton
- Chechen
- Chinese (Mandarin, Simplified,)
- Chinese (Mandarin, Traditional)
- Chinese (Mandarin, Hong Kong)
- Choctaw
- Chuvash
- Cree
- Creek
- Crimean Tatar
- Dakota
- Duala
- Efik
- English (British)
- Ewe
- Faroese
- Fijian
- Fon
- French (Canadian)
- Fulah
- Ga
- Ganda
- Gayo
- Gilbertese
- Gothic
- Guarani
- Hausa
- Hawaiian
- Herero
- Hiligaynon
- Iban
- Igbo
- Iloko
- Kabyle
- Kachin
- Kalaallisut
- Kamba
- Kanuri
- Kara-Kalpak
- Khasi
- Kikuyu
- Kinyarwanda
- Komi
- Kongo
- Kosraean
- Kuanyama
- Lingala
- Low German
- Lozi
- Luba-Katanga
- Luo
- Madurese
- Malagasy
- Mandingo
- Manx
- Maori
- Marshallese
- Mende
- Middle English
- Middle High German
- Minangkabau
- Mohawk
- Mongo
- Nahuatl
- Navajo
- Ndonga
- Niuean
- North Ndebele
- Northern Sotho
- Nyanja
- Nyankole
- Nyasa Tonga
- Nzima
- Occitan
- Ojibwa
- Old English
- Old French
- Old High German
- Old Norse
- Old Provencal
- Ossetic
- Pampanga
- Pangasinan
- Papiamento
- Portuguese (European)
- Quechua
- Romansh
- Romany
- Rundi
- Sakha
- Samoan
- Sango
- Scots
- Scottish Gaelic
- Shona
- Songhai
- Southern Sotho
- Spanish (Latin American)
- Sundanese
- Swati
- Tahitian
- Tajik
- Tatar
- Temne
- Tongan
- Tsonga
- Tswana
- Turkmen
- Udmurt
- Venda
- Votic
- Western Frisian
- Wolof
- Xhosa
- Yoruba
- Zapotec
Can a single model support multiple different languages?
Yes. You can build a single model to handle documents in multiple languages. It contributes to better accuracy and is easy to deploy.
Is handwritten text supported?
Yes, we support handwritten documents — with some important notes:
- Handwriting must be legible to the human eye. If a human reader struggles to understand the handwriting, AI models may also have difficulty extracting text reliably
- Results may vary depending on writing style, image clarity, and document structure
- We recommend testing a few samples to evaluate performance for your specific use case
Tip: Neat, well-scanned handwritten forms (e.g., block letters or clearly written fields) tend to produce better results
Updated 9 days ago