Blogs

Author

Daria Rostovtseva RSS
Principal Data Scientist

Daria Rostovtseva is a Principal Data Scientist on the SAS Fraud and Security Intelligence team. In her role, she helps government and private organizations leverage the power of analytics to fight fraud and improve services to their constituents.

Programming Tips

Daria RostovtsevaAugust 16, 2021 0

Classifying messy documents: A common-sense approach (Part II)

In Part I of this blog post, I provided an overview of the approach my team and I took tackling the problem of classifying diverse, messy documents at scale. I shared the details of how we chose to preprocess the data and how we created features from documents of interest

English

Advanced Analytics | Analytics | Machine Learning

Daria RostovtsevaAugust 4, 2021 0

Classifying messy documents: A common-sense approach (Part I)

Unstructured text data is ubiquitous in both business and government and extracting value from it at scale is a common challenge. Organizations that have been around for a while often have vast paper archives. Digitizing these archives does not necessarily make them usable for search and analysis, since documents are

English

Programming Tips

Daria RostovtsevaDecember 7, 2020 0

Append and Replace Records in a CAS Table

In my previous blog post, I talked about using PROC CAS to accomplish various data preparation tasks. Since then, my colleague Todd Braswell and I worked through some interesting challenges implementing an Extract, Transform, Load (ETL) process that continuously updates data in CAS. (Todd is really the brains behind getting

English

1 2 Next