![How to get N-grams and TF-IDF count from Chinese documents](https://blogs.sas.com/content/sgf/files/2018/07/ngram_feature-702x230.png)
SAS Visual Text Analytics provides dictionary-based and non-domain-specific tokenization functionality for Chinese documents, however sometimes you still want to get N-gram tokens. This can be especially helpful when the documents are domain-specific and most of the tokens are not included into the SAS-provided Chinese dictionary. What is an N-gram? An