Change is one of the few constants in our fast-moving world. Organisations must respond rapidly and effectively to changes in their environment, including among their customers. Being able to address new problems and issues, especially at speed, means that innovation is now essential for companies to survive and thrive. However,
Tag: data culture and fluency
Jason Colón was hoping to work on a data for good project – any data for good project, really – offered through SAS’ analytic volunteer program. He had no idea the one he would be asked to do would hit so close to home for him. The organization – which
Using such features and Natural Language Processing capabilities like text parsing and information extraction in SAS Visual Text Analytics (VTA) helps us uncover emerging trends and unlock the value of unstructured text data.
To find exact duplicates, matching all string pairs is the simplest approach, but it is not a very efficient or sufficient technique. Using the MD5 or SHA-1 hash algorithms can get us a correct outcome with a faster speed, yet near-duplicates would still not be on the radar. Text similarity is useful for finding files that look alike. There are various approaches to this and each of them has its own way to define documents that are considered duplicates. Furthermore, the definition of duplicate documents has implications for the type of processing and the results produced. Below are some of the options. Using SAS Visual Text Analytics, you can customize and accomplish this task during your corpus analysis journey either with Python SWAT package or with PROC SQL in SAS.
As head of the SAS Data Ethics Practice, I spend a lot of time contemplating the social implications of AI. Considering its benefits like augmenting medical decisions and pitfalls, making decisions based on biased data results in dire consequences for patients. Such implications have the potential to impact society in a variety
Are you looking to broaden your data analytics skills to land your dream job or propel your career? After looking at job posting statistics and the country's labor market, the data shows that now is the time to jump on board. As the demand for data skills is growing, the
AI has, for many years, been the stuff of fantasy. From the monster in Mary Shelley’s Frankenstein to the dystopian futures depicted in films such as Metropolis, the Matrix and Minority Report, the idea of intelligent machines has been capturing the imagination of writers for centuries. Our ability to store
The SAS Batting Lab was recently featured on NBC’s Today Show. If you missed it, you can watch the segment in the video above. For more about The Batting Lab, get a firsthand look at the experience of the batting cage and learn more about the data literacy value of
The SAS Batting Lab is a six-week program designed to help improve kids’ understanding of data while also helping them improve their baseball and softball swings. Using analytics in an interactive, AI-powered batting cage, kids can compare their swings to batting stars. During the program, the participants also became more
Building a data and analytics culture in higher education means equipping key stakeholders with the skills necessary to analyze and leverage insights extracted from data. Doing so can drive faster, more accurate decision-making. When I hear “data and analytics culture,” I immediately think of the work Jason Simon and his team
In the face of rapid digitalization and modernization, data scientists in Cameroon joined the SAS Hackathon seeking a way to preserve indigenous African languages.
Six scholars from North Carolina A&T State University in technology– or STEM-focused majors helped foster the next generation of data-literate students while also donating to those in need. SAS recently facilitated a donation drive with students from the Wake County Young Men's Leadership Academy (WYMLA) in Raleigh, North Carolina and scholars from
Higher education institutions are some of the most data-rich entities in the world. Postsecondary leaders need high-quality, consistent and accurate insights to make the best decisions for their institution, students and constituents. Data governance is a topic that may seem technical in nature and perhaps important to only the IT
Corpus analysis is a technique widely used by data scientists because it provides an understanding of a document collection and provides insights into the text.
Our traditional assumptions about data are evolving, and so is our understanding of data literacy. Data is more than numbers, charts and graphs. And data literacy is not just for data scientists. “If you’re talking with people who aren’t already data fluent, you have to make them aware that data is