Languages Supported
Nanonets supports character/word recognition in over 200+ languages including but not limited to:
- Acehnese
- Acholi
- Adangme
- Afrikaans
- Akan
- Albanian
- Algonquinian
- Amharic
- Ancient Greek
- Arabic
- Araucanian/Mapuche
- Armenian
- Assamese
- Asturian
- Athabaskan
- Azerbaijani
- Aymara
- Balinese
- Bambara
- Bantu
- Bashkir
- Basque
- Batak
- Belarusian
- Bemba
- Bengali
- Bikol
- Bislama
- Bosnian
- Breton
- Bulgarian
- Burmese
- Catalan
- Cebuano
- Chechen
- Cherokee
- Chinese
- Chinese (Mandarin, Hong Kong)
- Chinese (Mandarin, Simplified)
- Chinese (Mandarin, Traditional)
- Choctaw
- Chuvash
- Cree
- Creek
- Crimean Tatar
- Croatian
- Czech
- Dakota
- Danish
- Dhivehi
- Duala
- Dutch
- Dzonkha
- Efik
- English
- English (British)
- Esperanto
- Estonian
- Ewe
- Faroese
- Fijian
- Filipino
- Finnish
- Fon
- French
- French (Canadian)
- Fulah
- Ga
- Galician
- Ganda
- Gayo
- Georgian
- German
- Gilbertese
- Gothic
- Greek
- Guarani
- Gujarati
- Haitian Creole
- Hausa
- Hawaiian
- Hebrew
- Herero
- Hiligaynon
- Hindi
- Hungarian
- Iban
- Icelandic
- Igbo
- Iloko
- Indonesian
- Irish
- Italian
- Japanese
- Javanese
- Kabyle
- Kachin
- Kalaallisut
- Kamba
- Kannada
- Kanuri
- Kara-Kalpak
- Kazakh
- Khmer
- Khasi
- Kikuyu
- Kinyarwanda
- Kirghiz
- Komi
- Kongo
- Korean
- Kosraean
- Kuanyama
- Lao
- Latin
- Latvian
- Lingala
- Lithuanian
- Low German
- Lozi
- Luba-Katanga
- Luo
- Macedonian
- Madurese
- Malagasy
- Malay
- Malayalam
- Maltese
- Mandingo
- Manx
- Maori
- Marathi
- Marshallese
- Mende
- Middle English
- Middle High German
- Minangkabau
- Mohawk
- Mongo
- Mongolian
- Nahuatl
- Navajo
- Ndonga
- Nepali
- Niuean
- North Ndebele
- Northern Sotho
- Norwegian
- Nyanja
- Nyankole
- Nyasa Tonga
- Nzima
- Occitan
- Ojibwa
- Old English
- Old French
- Old High German
- Old Norse
- Old Provencal
- Oriya
- Ossetic
- Pampanga
- Pangasinan
- Papiamento
- Pashto
- Persian
- Polish
- Portuguese
- Portuguese (European)
- Punjabi
- Quechua
- Romanian
- Romansh
- Romany
- Rundi
- Russian
- Sakha
- Samoan
- Sango
- Sanskrit
- Scots
- Scottish Gaelic
- Serbian
- Shona
- Sinhala
- Slovak
- Slovenian
- Songhai
- Southern Sotho
- Spanish
- Spanish (Latin American)
- Sundanese
- Swahili
- Swati
- Swedish
- Syriac
- Tagalog
- Tahitian
- Tajik
- Tamil
- Tatar
- Telugu
- Temne
- Thai
- Tibetan
- Tigirinya
- Tongan
- Tsonga
- Tswana
- Turkish
- Turkmen
- Udmurt
- Ukrainian
- Urdu
- Uzbek
- Venda
- Vietnamese
- Votic
- Welsh
- Western Frisian
- Wolof
- Xhosa
- Yiddish
- Yoruba
- Zapotec
- Zulu
Can a single model support multiple different languages?
Yes. You can build a single model to handle documents in multiple languages. It contributes to better accuracy and is easy to deploy.
Is handwritten text supported?
Yes, we support handwritten documents — with some important notes:
- Handwriting must be legible to the human eye. If a human reader struggles to understand the handwriting, AI models may also have difficulty extracting text reliably
- Results may vary depending on writing style, image clarity, and document structure
- We recommend testing a few samples to evaluate performance for your specific use case
Tip: Neat, well-scanned handwritten forms (e.g., block letters or clearly written fields) tend to produce better results
Updated 17 days ago
