std::string str("w0rd, token-izer. pup's, U.S.a., us., hel.lo");
TermTokenizer tokenizer(str);
std::vector<std::string> tokens(tokenizer.begin(), tokenizer.end());
pombredanne / tokenizer Goto Github PK
View Code? Open in Web Editor NEWThis project forked from pisa-engine/tokenizer
License: Apache License 2.0