Skip to content

Commit b85d8c8

Browse files
authored
feat: Added Czech to the set of vocabularies in datasets/vocabs.py (#885)
1 parent 2c697ff commit b85d8c8

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

doctr/datasets/vocabs.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -31,6 +31,7 @@
3131
VOCABS['german'] = VOCABS['english'] + 'äöüßÄÖÜẞ'
3232
VOCABS['arabic'] = (VOCABS['digits'] + VOCABS['hindi_digits'] + VOCABS['arabic_letters'] + VOCABS['persian_letters'] +
3333
VOCABS['arabic_diacritics'] + VOCABS['arabic_punctuation'] + VOCABS['punctuation'])
34+
VOCABS['czech'] = VOCABS['english'] + 'áčďéěíňóřšťúůýžÁČĎÉĚÍŇÓŘŠŤÚŮÝŽ'
3435
VOCABS['vietnamese'] = (VOCABS['english'] +
3536
'áàảạãăắằẳẵặâấầẩẫậéèẻẽẹêếềểễệóòỏõọôốồổộỗơớờởợỡúùủũụưứừửữựiíìỉĩịýỳỷỹỵ' +
3637
'ÁÀẢẠÃĂẮẰẲẴẶÂẤẦẨẪẬÉÈẺẼẸÊẾỀỂỄỆÓÒỎÕỌÔỐỒỔỘỖƠỚỜỞỢỠÚÙỦŨỤƯỨỪỬỮỰIÍÌỈĨỊÝỲỶỸỴ')

0 commit comments

Comments
 (0)