![Figure 1. Sample document](https://blogs.sas.com/content/sgf/files/2021/07/sampleDocImage.jpg)
In Part I of this blog post, I provided an overview of the approach my team and I took tackling the problem of classifying diverse, messy documents at scale. I shared the details of how we chose to preprocess the data and how we created features from documents of interest