What's your favorite function in SAS?

By Stacey Syphus on SAS Learning Post September 22, 2016 Topics | Learn SAS

Last time I checked, there are well over 500 functions and call routines in SAS. I’ve taught SAS programming courses for 15 years, and I’ll admit that occasionally my students will ask me about a particular function that I have honestly never heard of. I remember the first time this happened, a student told me he thought the SPEDIS function was the greatest thing in SAS. As a new instructor, I was a bit embarrassed I had never heard of the SPEDIS function, so at a break I asked the other three local instructors, who probably had a combined 50+ years of SAS experience. I felt a little better when none of them had heard of it either!

With so many functions available, it is easy for a new programmer to get overwhelmed. I was asked to consolidate the LONG list into a more accessible list of favorites. After putting the request out to my fellow instructors world-wide, these are our collective favorites:

Category	Function	Description
Character	CAT	Does not remove leading or trailing blanks, and returns a concatenated character string.
	CATS	Removes leading and trailing blanks, and returns a concatenated character string.
	CATX	Removes leading and trailing blanks, inserts delimiters, and returns a concatenated character string.
	COMPBL	Removes multiple blanks from a character string.
	COMPRESS	Returns a character string with specified characters removed from the original string.
	FIND	Searches for a specific substring of characters within a character string.
	LEFT	Left-aligns a character string.
	LENGTH	Returns the length of a non-blank character string, excluding trailing blanks, and returns 1 for a blank character string.
	LOWCASE	Converts all letters in an argument to lowercase.
	PROPCASE	Converts all words in an argument to proper case.
	SCAN	Returns the nth word from a character string.
	SUBSTR	Extracts a substring from an argument.
	TRANWRD	Replaces all occurrences of a substring in a character string.
	TRIM	Removes trailing blanks from a character string, and returns one blank if the string is missing.
	UPCASE	Converts all letters in an argument to uppercase.
Date & Time	DATEPART	Extracts the date from a SAS datetime value.
	INTCK	Returns the number of interval boundaries of a given kind that lie between two dates, times, or datetime values.
	INTNX	Increments a date, time, or datetime value by a given time interval, and returns a date, time, or datetime value.
	MDY	Returns a SAS date value from month, day, and year values.
	MONTH	Returns the month from a SAS date value.
	QTR	Returns the quarter of the year from a SAS date value.
	TODAY	Returns the current date as a numeric SAS date value.
	WEEK	Returns the week-number value.
	WEEKDAY	From a SAS date value, returns an integer that corresponds to the day of the week.
	YEAR	Returns the year from a SAS date value.
	YRDIF	Returns the difference in years between two dates according to specified day count conventions; returns a person’s age.
Descriptive Statistics	LARGEST	Returns the kth largest nonmissing value.
	MAX	Returns the largest value.
	MEAN	Returns the arithmetic mean (average).
	MEDIAN	Returns the median value.
	MIN	Returns the smallest value.
	N	Returns the number of nonmissing numeric values.
	NMISS	Returns the number of missing numeric values.
	SMALLEST	Returns the kth smallest nonmissing value.
	STD	Returns the standard deviation of the nonmissing arguments.
	SUM	Returns the sum of the nonmissing arguments.
Special	INPUT	Returns the value that is produced when SAS converts an expression by using the specified informat. (Used for converting character columns to numeric)
Special	PUT	Returns a value using a specified format. (Used for converting numeric columns to character)
Truncation	CEIL	Returns the smallest integer that is greater than or equal to the argument, fuzzed to avoid unexpected floating-point results.
	FLOOR	Returns the largest integer that is less than or equal to the argument, fuzzed to avoid unexpected floating-point results.
	INT	Returns the integer value, fuzzed to avoid unexpected floating-point results.
	ROUND	Rounds the first argument to the nearest multiple of the second argument, or to the nearest integer when the second argument is omitted.

Are we missing any of your favorites?

About Author

Stacey Syphus
Technical Trainer/SAS Enterprise Guide Curriculum Manager

Stacey Syphus is a senior manager and instructor for SAS Education. Her areas of expertise include SAS programming, SAS Enterprise Guide and SAS Studio. Occasionally she gets back to her roots and teaches a statistics course. When not teaching or writing SAS training, she is likely chauffeuring her 3 children to various activities, visiting fun places near her Northern California home, or watching college football or basketball.

46 Comments

Tony on November 18, 2016 11:18 am

I have to give another vote to STRIP and COALESCE(C) as well. Also, I'm a frequent user of NOTDIGIT for checking if it's safe to perform character-to-numeric conversion using PUT.
David Rosenfeld on November 9, 2016 10:35 am

What about the LAG function? When understood and used properly (not inside a test) it makes cross-record comparison easy, especially when used in conjunction with the retain statement, as when one needs to collapse clumps of records while testing for a break in time between the end date of one and the start date of another.
David R.
Terry Eastman on November 8, 2016 9:11 pm

STRIP = LEFT( TRIM ( ... ) ) less is more :)
Philip R Holland on November 8, 2016 1:59 pm

My favourites are probably STRIP() and COALESCE(), which I use mostly in PROC SQL, but CHOOSEN() and CHOOSEC() are also useful in PROC SQL instead of the CASE WHEN ELSE END when you have sequential numeric choices.
DarthPathos on October 20, 2016 9:36 pm

So many cool functions I've never used!

Two of my favourites have already been mentioned - GEODIST and PRXMATCH.

One of my other favourites is SOUNDEX (similar to SPEDIS); however, I have ditched it for COMPGED. If you do a lot of text-based analytics, check out my post https://communities.sas.com/t5/SAS-Communities-Library/PROC-SQL-Continued-Basic-Text-Analytics-Using-Song-Titles/ta-p/241007 where I use SOUNDEX, and then update it with COMPGED.

Now I'm curious abut UUIDGEN - something to research tomorrow at work :-)
Chris
Robert Allison on October 13, 2016 7:48 am

I find myself reading lots of oddball data into SAS from text, and the scan() function really comes in handy for parsing the data in various flexible ways.
Prasanna Sondur on October 6, 2016 2:39 pm

Good one to refer. Thanks for sharing! Would have wished to see index() and strip() as well in the list.
- Stacey Syphus on October 6, 2016 3:09 pm
  
  Thanks! I agree I overlooked STRIP(), but I left out INDEX on purpose... Did you know the FIND function was introduced in SAS 9, and does exactly what INDEX does, but has 2 additional arguments that allow you to make the search case insensitive and select a start position? Along with the CAT functions, I consider it to be one of the great "new" additions.
Otto Schramek on October 5, 2016 7:13 am

Very good choice! :-)
Some of you already mentioned MISSING and COALESCE[C].
I want to add just one more: REVERSE
For example, You can get the last word of a string:
last_word=reverse(scan(reverse(), 1))
- Otto Schramek on October 5, 2016 7:16 am
  
  Sorry, I have forgotten the string... ;-)
  last_word=reverse(scan(reverse(text), 1))
  - Stacey Syphus on October 5, 2016 4:50 pm
    
    REVERSE is cool, but did you know you can use negative numbers with the SCAN function to count from the right? So SCAN(var, -1) will give you the last word. I love that trick!
loredana on October 5, 2016 5:31 am

I absolutely love the SCAN function, especially its macro version (%SCAN). Combined with an iterative step, I can extract elements in a macro variable that represent key words, such as client IDs or variable names.
- Stacey Syphus on October 5, 2016 4:50 pm
  
  Maybe I'll have to do a separate post for macro functions :)
Sunil on October 4, 2016 7:10 pm

Thanks Stacey for posting this blog on favorite SAS functions! I also like the additional SAS functions posted by users.
tc on September 26, 2016 11:33 pm

RESOLVE is so insanely great I wrote a whole paper on it years ago:
A Better SYSIN Than SYSIN: Instream Files on Any Platform
http://www2.sas.com/proceedings/sugi30/034-30.pdf

And thanks to PROC FCMP, you can now author your own favorite SAS function:
Gee! No, GTL! Visualizing Data With The SAS Graph Template Language (featuring my DIY "GetDeltas" function)
http://support.sas.com/resources/papers/proceedings13/286-2013.pdf
- Stacey Syphus on October 5, 2016 4:54 pm
  
  Thanks, Grandpa Ted! (you have to check out his papers to get that reference...)
Larissa Martin on September 23, 2016 12:21 pm

all of the above are my favorite! and +500 that were not mentioned, b/c I am testing all of mva and tk functions :-).
- Stacey Syphus on October 5, 2016 4:55 pm
  
  Sounds like that will be PLENTY to keep you busy for a full career :)
Wes Patton on September 23, 2016 11:34 am

No love for coallesce\coallesceC? So much utility, replaces any instance of 'if not missing X then X else if not missing Y then Y... etc'.
- David Pope on September 26, 2016 3:14 pm
  
  Wes - you beat me to it, I've always found coalesce/coalescec very useful.
- Peter Lancashire on November 9, 2016 7:53 am
  
  Yes, I was thinking of voting for that. It is invaluable in complex SQL outer joins to get default processing right. Saves a ton of obscure DATA step code.
Bill Csont on September 23, 2016 11:03 am

How about strip() which combines left and trim.
- Stacey Syphus on October 5, 2016 4:58 pm
  
  Very good point... I probably should have included that one! I always mention it in class.
Warren Kuhfeld on September 23, 2016 9:58 am

You mention that CAT* functions, which make string processing much easier. Two other newer functions (I started with SAS 79, so they seem newer to me) that I use a lot now are IFC and IFN.
- Stacey Syphus on October 5, 2016 4:57 pm
  
  Can't say I was doing anything with SAS 79, but I do still think of the CAT functions as new! What a great addition...
Michelle Homes on September 23, 2016 4:37 am

I recall this great blog post that outlined how to do fuzzy matching and the spedis function which your readers may like http://blogs.sas.com/content/sgf/2015/01/27/how-to-perform-a-fuzzy-match-using-sas-functions/

I've always loved the intnx and intck functions for financial date calculations and the Perl regular expression for character manipulation. Handy tip sheet at https://support.sas.com/rnd/base/datastep/perl_regexp/regexp-tip-sheet.pdf
- Stacey Syphus on September 23, 2016 10:26 am
  
  Great links! Thanks for sharing.
Chris Brooks on September 22, 2016 7:22 pm

If I'm allowed more than one I'd have to choose the PRX family of functions. The ability to use Perl Regular Expressions in SAS can be a huge help if you're doing a lot of complex text processing.
- Stacey Syphus on September 23, 2016 10:35 am
  
  Absolutely... We have a new 1/2 day Live Web class called Take Your SAS Programming Skills to the Next Level where we cover PRX functions. It was pretty popular this year at SGF. Chris, sounds like you are already a PRX expert, but maybe other readers would like to learn more and this class would be the right fit! We should have public course dates scheduled soon.
  - Peter on October 5, 2016 4:48 am
    
    regarding PRX functions,
    I would suggest all interested in these have a look at how PRX can be "built-in" in a user-defined informat with PROC FORMAT. The first doc appeared in Rick Langston's paper at http://support.sas.com/resources/papers/proceedings12/245-2012.pdf
    
    That prompts me also, to adapt the description for the "special" function INPUT(), I think it should read
    "The most amazing facility to parse text"
    >>>>>--------->> any conversion you could wish for!
    There probably should be something similar for PUT()
- Peter Timusk on October 27, 2016 9:52 pm
  
  Also PRX are favorites for me and PRXMatch in particular which allows for data driven programming.
Quentin on September 22, 2016 3:19 pm

Thanks to Chris Hemedinger, I learned of the UUIDGEN() function a few years back on his blog (http://blogs.sas.com/content/sasdummy/2012/10/19/creating-a-somewhat-unique-id-using-the-uuidgen-function/), and have been happily using it ever since. I'd guess it is one of the less-known functions. lexjansen.com only comes up with two papers that mention it! http://www.lexjansen.com/search/searchresults.php?q=uuidgen
- Stacey Syphus on September 23, 2016 10:29 am
  
  I also have Chris to thank for LOTS of things I have learned! I must have missed the UUIDGEN function post, but just checked it out. Very cool!
jesse smedley on September 22, 2016 3:18 pm

My favorites are INTNX and SUBSTR.
- Stacey Syphus on September 23, 2016 10:25 am
  
  Those 2 are definitely in my top 5. Thanks, Jesse!
Adraine Upshaw on September 22, 2016 2:36 pm

I like PUT and INPUT; although I always get them mixed up and have to look up the documentation to make sure I'm using the correct one.
- Stacey Syphus on September 23, 2016 10:24 am
  
  LOL! For at least a year I had a post-it on my monitor reminding me which function to use for which type of variable conversion... You're not alone :)
- Gemma on September 26, 2016 3:25 pm
  
  Hi Adraine,
  
  I was taught to "use the alphabet" to remember which function to use for the different variable conversion (thanks Sam!)So, using the first letters of the following words; character, numeric, input & put, put them in alphabetic order:
  Character
  Input
  Numeric
  Put
  Character
  
  The functions are in the middle, of the variable types!
  So input does character to numeric and put does numeric to character.
  There are probably plenty of other ways to remember it but this stuck with me.
- Kip Hayden-Sr on November 8, 2016 10:53 am
  
  I just remember that "numeric" is shorter than "character", and PUT is shorter than INPUT. So if you're starting with a numeric to convert to character, it's the shorter one--PUT. And if you're starting with a character to convert to numeric, it's the longer one--INPUT.
Ron Cody on September 22, 2016 1:05 pm

My two favorites are Missing and Compress

Ron Cody
- Stacey Syphus on September 23, 2016 10:39 am
  
  You would know! I'll give a plug for your great book, SAS Functions by Example. Thanks, Ron!
  - Duff Cooper on September 29, 2016 2:06 pm
    
    Yep, love that book. And all of Ron's book!
- Mark Jordan on September 26, 2016 1:54 pm
  
  I had been using compress for years to remove a character or two. But when I took Ron's "SAS Functions by Example" course, I learned the true power of this amazing function! With modifiers, COMPRESS is the most powerful text cleanup function you can use without learning PERL regular expressions. Thanks, Ron, for teaching this Jedi a new trick!
Susan Slaughter on September 22, 2016 12:34 pm

Stacey,

Thanks for a nice blog. I can understand why you didn't include it, but I have always thought the GEODIST function was very cool. For anyone who doesn't know, it returns the geodetic distance in kilometers or miles between two latitude and longitude coordinates. Also the various ZIP functions (ZIPCITY, ZIPCITYDISTANCE, ZIPFIPS, ZIPNAME, ZIPNAMEL, ZIPSTATE) can be very useful. ZIPCITYDISTANCE even gives you the distance between two zip codes.

Susan
- mike zdeb on September 22, 2016 4:35 pm
  
  FYI ... ZIPCITYDISTANCE gets the lat/long of the zips used in the function from the SASHELP.ZIPCODE data set and uses the same method to calculate distance as the GEODIST function. One think to watch for is that ZIPCITYDISTANCE returns miles while the default for GEODIST (as already stated is kilometers). You can also get miles from GEODIST by using the M option in the function call.
- Stacey Syphus on September 23, 2016 10:23 am
  
  I am a big fan of the ZIP functions! I know I've had several students light up when I mentioned the ZIPCITY function.

Blogs

Blogs

What's your favorite function in SAS?

About Author

Related Posts

46 Comments

Blogs

About Author

Related Posts

Deviance residuals and the DEVIANCE function in SAS

Find inflection points for a function that is known only at discrete points

Programa Acadêmico: transformando vidas pelo voluntariado

46 Comments