Reviews
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
Search similar apps
License
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
Creator
Related apps
python-frog
Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger,
Cython47gpl-3.0
24 days ago
colibri-core
Colibri core is an NLP tool as well as a C++ and Python library for working with
C++124gpl-3.0
3 days ago
c-plus-pluscomputational-linguisticscorpus
pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Proc
Python479gpl-3.0
last year
computational-linguisticsevaluation-metricsfolia
python-timbl
python-timbl, originally developed by Sander Canisius, is a Python extension mod
Python18gpl-3.0
22 days ago
k-nearest-neighboursknnmachine-learning