Skip to content

Latest commit

 

History

History
7 lines (6 loc) · 195 Bytes

File metadata and controls

7 lines (6 loc) · 195 Bytes

Tokenizer

std::string str("w0rd, token-izer. pup's, U.S.a., us., hel.lo");
TermTokenizer tokenizer(str);
std::vector<std::string> tokens(tokenizer.begin(), tokenizer.end());