SAS Text Analytics Archives

Patricia NeriMay 22, 2019 3

Analysis of Movie Reviews using Visual Text Analytics

This blog shows how the automatically generated concepts and categories in Visual Text Analytics (VTA) can be refined using LITI and Boolean rules. I will use a data set that contains information on 1527 randomly selected movies: their titles, reviews, MPAA Ratings, Main Genre classifications and Viewer Ratings.

English

Advanced Analytics | Programming Tips

Emily GaoJuly 26, 2018 3

How to tokenize documents into sentences

SAS Visual Analytics includes text parsing actions that can help tokenize sentences, and SAS Visual Text Analytics provides even better, more sophisticated methods. This article contains code samples and cites papers for more details.

English

Advanced Analytics | Programming Tips

Emily GaoJuly 18, 2018 0

How to sample textual data with SAS

See how to sample unstructured (text) data using SAS Viya and CAS actions. This post includes complete code to cluster the text documents via k-means, and treats the cluster memberships as strata for analysis.

English

Advanced Analytics | Programming Tips

Emily GaoJuly 5, 2018 1

How to get N-grams and TF-IDF count from Chinese documents

SAS Visual Text Analytics provides dictionary-based and non-domain-specific tokenization functionality for Chinese documents, however sometimes you still want to get N-gram tokens. This can be especially helpful when the documents are domain-specific and most of the tokens are not included into the SAS-provided Chinese dictionary. What is an N-gram? An

English

Advanced Analytics | Machine Learning

Emily GaoMarch 1, 2017 0

How to extract domain-specific sentiment lexicons

In 2011, Loughran and McDonald applied a general sentiment word list to accounting and finance topics, and this led to a high rate of misclassification. They found that about three-fourths of the negative words in the Harvard IV TagNeg dictionary of negative words are typically not negative in a financial

English

Advanced Analytics

Emily GaoJanuary 25, 2017 4

Analyzing Trump v. Clinton text data at Reddit

Recently a colleague told me Google had published new, interesting data sets at BigQuery. I found a lot of Reddit data as well, so I quickly tried running BigQuery with these text data to see what I could produce. After getting some pretty interesting results, I wanted to see if

English

Advanced Analytics

Emily GaoJanuary 4, 2017 0

Word scatter plot with SAS

In my last post, I showed you how to generate a word cloud of pdf collections. Word clouds show you which terms are mentioned by your documents and the frequency with which they occur in the documents. However, word clouds cannot lay out words from a semantic or linguistic perspective.

English

Advanced Analytics

Emily GaoDecember 13, 2016 4

Fun with SAS Text Analytics: A qualitative analysis of IALP papers

Last week, I attended the IALP 2016 conference (20th International Conference on Asian Language Processing) in Taiwan. After the conference, each presenter received a u-disk with all accepted papers in PDF format. So when I got back to Beijing, I began going through the papers to extend my learning. Usually, when

English

Advanced Analytics

Waynette TubbsAugust 31, 2012 0

Friday's Innovation Inspiration - Digging in to social data

A super hot topic in most organizations is how to make the most of the troves of social data available. This Post-It Note author isn't specific about the SAS solution that is being used, so I'm going to speculate that he or she is taking advantage of SAS Text Miner, SAS Text

English

Blogs

Blogs

Tag: SAS Text Analytics