Language list - unicode-org/inflection GitHub Wiki
List of languages in priority order
Our end goal is to cover all the languages, but if you are looking for a prioritized list to contribute to, see the table below.
List is derived from CLDR database and refers to languages marked as modern in column "Target level".
Data Quantity Key
This is the data quantity of lexemes from Wikidata on July 30, 2025
Symbol | Description |
---|---|
✅ | >10000 lexemes |
⚠️ | >1000 lexemes |
❌ | <1000 lexemes |
N/A | Not applicable or not needed |
Languages
Code | Script code | Description | Data Quantity | Supported |
---|---|---|---|---|
en | Latn | English | ✅ | ✅ |
zh | Hans | Chinese (Simplified, Mandarin) | N/A | ✅ |
es | Latn | Spanish | ✅ | ✅ |
fr | Latn | French | ✅ | ✅ |
pt | Latn | Portuguese | ⚠️ | ✅ |
hi | Deva | Hindi | ⚠️ | ✅ |
ar | Arab | Arabic (Modern Standard) | ⚠️ | ✅ |
ru | Cyrl | Russian | ✅ | ✅ |
de | Latn | German | ✅ | ✅ |
ja | Jpan | Japanese | N/A | ✅ |
it | Latn | Italian | ✅ | ✅ |
id | Latn | Indonesian | N/A | ✅ |
vi | Latn | Vietnamese | N/A | ✅ |
pl | Latn | Polish | ⚠️ | ❌ |
ko | Kore | Korean | ⚠️ | ✅ |
tr | Latn | Turkish | ⚠️ | ✅ |
nl | Latn | Dutch | ⚠️ | ✅ |
zh | Hant | Chinese (Traditional, Mandarin) | N/A | ✅ |
sv | Latn | Swedish | ✅ | ✅ |
ro | Latn | Romanian | ❌ | ❌ |
bn | Beng | Bangla (Bengali) | ✅ | ❌ |
th | Thai | Thai | N/A | ✅ |
cs | Latn | Czech | ✅ | ❌ |
hu | Latn | Hungarian | ❌ | ❌ |
no | Latn | Norwegian (Bokmål) | ✅ | ✅ |
el | Grek | Greek | ✅ | ❌ |
fi | Latn | Finnish | ⚠️ | ❌ |
da | Latn | Danish | ✅ | ✅ |
sk | Latn | Slovak | ✅ | ❌ |
uk | Cyrl | Ukrainian | ✅ | ❌ |
bg | Cyrl | Bulgarian | ❌ | ❌ |
hr | Latn | Croatian | ❌ | ❌ |
iw | Hebr | Hebrew | ✅ | ✅ |
lt | Latn | Lithuanian | ❌ | ❌ |
sl | Latn | Slovenian | ❌ | ❌ |
ms | Latn | Malay | N/A | ✅ |
ca | Latn | Catalan | ❌ | ❌ |
kk | Cyrl | Kazakh | ❌ | ❌ |
fa | Arab | Persian | ✅ | ❌ |
ur | Arab | Urdu | ⚠️ | ❌ |
sw | Latn | Swahili | ❌ | ❌ |
lv | Latn | Latvian | ❌ | ❌ |
et | Latn | Estonian | ✅ | ❌ |
te | Telu | Telugu | ❌ | ❌ |
ta | Taml | Tamil | ❌ | ❌ |
mr | Deva | Marathi | ❌ | ❌ |
fil | Latn | Filipino | ❌ | ❌ |
gu | Gujr | Gujarati | ❌ | ❌ |
is | Latn | Icelandic | ❌ | ❌ |
kn | Knda | Kannada | ❌ | ❌ |
ml | Mlym | Malayalam | ✅ | ❌ |
sr | Cyrl | Serbian (Cyrillic) | ❌ | ✅ |
pa | Guru | Punjabi | ⚠️ | ❌ |
or | Orya | Odia | ❌ | ❌ |
my | Mymr | Burmese (Myanmar) | ❌ | ❌ |
uz | Latn | Uzbek | ❌ | ❌ |
mk | Cyrl | Macedonian | ❌ | ❌ |
az | Latn | Azerbaijani | ❌ | ❌ |
hy | Armn | Armenian | ❌ | ❌ |
as | Beng | Assamese | ❌ | ❌ |
eu | Latn | Basque | ✅ | ❌ |
si | Sinh | Sinhala | ❌ | ❌ |
af | Latn | Afrikaans | ❌ | ❌ |
ka | Geor | Georgian | ❌ | ❌ |
ne | Deva | Nepali | ❌ | ❌ |
sq | Latn | Albanian | ⚠️ | ❌ |