tesseract: add new lanaguages and others

Tagalo was replaced with filipino [1] in newer tesseract versions, so it
doesn't make sense for us to use the new name and map it to the old
"tgl" name (Tagalo) under the hood.

Language names obtained from tesseract's man page [2].

[1]: 58f7a72f00
[2]: https://github.com/tesseract-ocr/tesseract/blob/main/doc/tesseract.1.asc
This commit is contained in:
deeplow 2023-02-28 19:36:54 +00:00
parent d8d83ff036
commit 58332fdd6e
No known key found for this signature in database
GPG key ID: 577982871529A52A

View file

@ -24,6 +24,7 @@
"Frankish": "frk",
"French, Middle (ca.1400-1600)": "frm",
"Galician": "glg",
"Greek, Ancient, to 1453": "grc",
"Hebrew": "heb",
"Hindi": "hin",
"Croatian": "hrv",
@ -50,14 +51,16 @@
"Russian": "rus",
"Slovakian": "slk",
"Spanish": "spa",
"Spanish": "spa_old",
"Spanish; Castilian - Old": "spa_old",
"Albanian": "sqi",
"Serbian": "srp",
"Swahili": "swa",
"Swedish": "swe",
"Tamil": "tam",
"Telugu": "tel",
"Filipino": "tgl",
"Thai": "tha",
"Turkish": "tur",
"Ukrainian": "ukr"
"Ukrainian": "ukr",
"Vietnamese": "vie"
}