![Classifying messy documents: A common-sense approach (Part I) Figure 5. Candidate feature examples](https://blogs.sas.com/content/sgf/files/2021/07/candidateFeature-702x333.jpg)
Unstructured text data is ubiquitous in both business and government and extracting value from it at scale is a common challenge. Organizations that have been around for a while often have vast paper archives. Digitizing these archives does not necessarily make them usable for search and analysis, since documents are