SAS Blogs

Tag: spark

Analytics | Data Management | Learn SAS | Programming Tips

Kumar ThangamuthuSeptember 26, 2019 0

Data and Analytics Innovation using SAS & Spark - part 2

In part 1 of this post, we looked at setting up Spark jobs from Cloud Analytics Services (CAS) to load and save data to and from Hadoop. Now we are moving on to the next step in the analytic cycle, scoring data in Hadoop and executing SAS code as a

English

Data Management | Learn SAS | Programming Tips

Kumar ThangamuthuAugust 27, 2019 0

Data and Analytics Innovation using SAS & Spark - part 1

This article is not a tutorial on Hadoop, Spark, or big data. At the same time, no prerequisite knowledge of these technologies is required for understanding. We’ll give you enough background prior to diving into the details. In simplest terms, the Hadoop framework maintains the data and Spark controls and

English

Data Management

programmers working on big data identity resolution

David LoshinJune 29, 2017 0

Understanding big data identity resolution

David Loshin discusses big data identity resolution in a programming and execution environment.

English

Data Management

Brian Kinnebrew, a Senior Solutions Architect at SAS

Bill DavisMarch 18, 2016 0

MapReduce vs. Apache Spark vs. SQL: Your questions answered here and at #StrataHadoop

As the big data era continues to evolve, Hadoop remains the workhorse for distributed computing environments. MapReduce has been the dominant workload in Hadoop, but Spark -- due to its superior in-memory performance -- is seeing rapid acceptance and growing adoption. As the Hadoop ecosystem matures, users need the flexibility to use either traditional MapReduce

English

Data Management

David LoshinMarch 3, 2016 0

Big data quality with continuations

I've been doing some investigation into Apache Spark, and I'm particularly intrigued by the concept of the resilient distributed dataset, or RDD. According to the Apache Spark website, an RDD is “a fault-tolerant collection of elements that can be operated on in parallel.” Two aspects of the RDD are particularly

English

Blogs

Blogs

Tag: spark

Follow Us

What is...