ucto

ucto

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuatio

C++65gpl-3.0

6 days ago

computational-linguisticsfolialanguage

python-ucto

python-ucto

This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first

Cython29

2 months ago

computational-linguisticsfolianlp