SAS 9 on Microsoft Windows
|Stable release||9.4 / July 10, 2013|
|Operating system||Windows, IBM mainframe, Unix/Linux, OpenVMS Alpha|
SAS (Statistical Analysis System; not to be confused with SAP) is a software suite developed by SAS Institute for advanced analytics, business intelligence, data management, and predictive analytics. It is the largest market-share holder for advanced analytics.
SAS was developed at North Carolina State University from 1966 until 1976, when SAS Institute was incorporated. SAS was further developed in the 1980s and 1990s with the addition of new statistical procedures, additional components and the introduction of JMP. A point-and-click interface was added in version 9 in 2004. A social media analytics product was added in 2010.
Technical overview and terminology
SAS is a software suite that can mine, alter, manage and retrieve data from a variety of sources and perform statistical analysis on it. SAS provides a graphical point-and-click user interface for non-technical users and more advanced options through the SAS programming language. SAS programs have a DATA step, which retrieves and manipulates data, usually creating a SAS data set, and a PROC step, which analyzes the data.
Each step consists of a series of statements. The DATA step has executable statements that result in the software taking an action, and declarative statements that provide instructions to read a data set or alter the data's appearance. The DATA step has two phases, compilation and execution. In the compilation phase, declarative statements are processed and syntax errors are identified. Afterwards, the execution phase processes each executable statement sequentially. Data sets are organized into tables with rows called "observations" and columns called "variables". Additionally, each piece of data has a descriptor and a value.
The PROC step consists of PROC statements that call upon named procedures. Procedures perform analysis and reporting on data sets to produce statistics, analyses and graphics. There are more than 300 procedures and each one contains a substantial body of programming and statistical work. PROC statements can also display results, sort data or perform other operations. SAS Macros are pieces of code or variables that are coded once and referenced to perform repetitive tasks.
SAS data can be published in HTML, PDF, Excel and other formats using the Output Delivery System, which was first introduced in 2007. The SAS Enterprise Guide is SAS' point-and-click interface. It generates code to manipulate data or perform analysis automatically and does not require SAS programming experience to use.
- Base SAS - Basic procedures and data management
- SAS/STAT - Statistical analysis
- SAS/GRAPH - Graphics and presentation
- SAS/OR - Operations research
- SAS/ETS - Econometrics and Time Series Analysis
- SAS/IML - Interactive matrix language
- SAS/AF - Applications facility
- SAS/QC - Quality control
- SAS/INSIGHT - Data mining
- SAS/PH - Clinical trial analysis
- Enterprise Miner - data mining
Sample code - "Hello World!"
data _null_; put "Hello World!"; run; %put "Hello World!";
DATA distance; Miles = 100; Kilometers = 1.61 * Miles; RUN;
DATA step using an existing table:
DATA distance_mi; SET distance_km; Miles = 0.62137*Kilometers; RUN;
The development of SAS began in 1966 after North Carolina State University re-hired Anthony Barr to program his analysis of variance and regression software so that it would run on IBM-360 computers. The project was funded by the National Institute of Health and was originally intended to analyze agricultural data to improve crop yields. Barr was joined by student James Goodnight, who developed the software's statistical routines, and the two became project-leaders. In 1968, Barr and Goodnight integrated new multiple regression and analysis of variance routines. In 1972, after issuing the first release of SAS, the project lost its funding. According to Goodnight, this was because NIH only wanted to fund projects with medical applications. Goodnight continued teaching at the university for a salary of $1 and access to mainframe computers for use with the project, until it was funded by the University Statisticians of the Southern Experiment Stations the following year. John Sall joined the project in 1973 and contributed to the software's econometrics, time series, and matrix algebra. Another early participant, Caroll G. Perkins, contributed to SAS' early programming. Jolayne W. Service and Jane T. Helwig created SAS' first documentation.
The first versions of SAS were named after the year in which they were released. In 1971, SAS 71 was published as a limited release. It was used only on IBM mainframes and had the main elements of SAS programming, such as the DATA step and the most common procedures in the PROC step. The following year a full version was released as SAS 72, which introduced the MERGE statement and added features for handling missing data or combining data sets. In 1976, Barr, Goodnight, Sall, and Helwig took the project out of North Carolina State and incorporated SAS Institute, Inc.
SAS was re-designed in SAS 76 with an open architecture that allowed for compilers and procedures. The INPUT and INFILE statements were improved so they could read most data formats used by IBM mainframes. Generating reports was also added through the PUT and FILE statements. The ability to analyze general linear models was also added as was the FORMAT procedure, which allowed developers to customize the appearance of data. In 1979, SAS 79 added support for the CMS operating system and introduced the DATASETS procedure. Three years later, SAS 82 introduced an early macro language and the APPEND procedure.
SAS version 4 had limited features, but made SAS more accessible. Version 5 introduced a complete macro language, array subscripts, and a full-screen interactive user interface called Display Manager. In 1985, SAS was rewritten in the C programming language. This allowed for the SAS' Multivendor Architecture that allows the software to run on UNIX, MS-DOS, and Windows. It was previously written in PL/I, Fortran, and assembly language.
In the 1980s and 1990s, SAS released a number of components to complement Base SAS. SAS/GRAPH, which produces graphics, was released in 1980, as well as the SAS/ETS component, which supports econometric and time series analysis. A component intended for pharmaceutical users, SAS/PH-Clinical, was released in the 1990s. The Food and Drug Administration standardized on SAS/PH-Clinical for new drug applications in 2002. Vertical products like SAS Financial Management and SAS Human Capital Management (then called CFO Vision and HR Vision respectively) were also introduced. JMP was developed by SAS co-founder John Sall and a team of developers to take advantage of the graphical user interface introduced in the 1984 Apple Macintosh and shipped for the first time in 1989. Updated versions of JMP were released continuously after 2002 with the most recent release being from 2012.
SAS version 6 was used throughout the 1990s and was available on a wider range of operating systems, including Macintosh, OS/2, Silicon Graphics, and Primos. SAS introduced new features through dot-releases. From 6.06 to 6.09, a user interface based on the windows paradigm was introduced and support for SQL was added. Version 7 introduced the Output Delivery System (ODS) and an improved text editor. ODS was improved upon in successive releases. For example, more output options were added in version 8. The number of operating systems that were supported was reduced to UNIX, Windows and z/OS, and Linux was added. SAS version 8 and SAS Enterprise Miner were released in 1999.
In 2002 the Text Miner software was introduced. Text Miner analyzes text data like emails for patterns in Business Intelligence applications. In 2004 SAS Version 9.0 was released, which was dubbed “Project Mercury” and was designed to make SAS accessible to a broader range of business users. Version 9.0 added custom user interfaces based on the user’s role and established the point-and-click user interface of SAS Enterprise Guide as the software's primary graphical user interface (GUI). The Customer Relationship Management (CRM) features were improved in 2004 with SAS Interaction Management. In 2008 SAS announced Project Unity, designed to integrate data quality, data integration and master data management.
SAS sued World Programming, the developers of a competing implementation, World Programming System, alleging that they had infringed SAS's copyright in part by implementing the same functionality. This case was referred from the United Kingdom's High Court of Justice to the European Court of Justice on 11 August 2010. In May 2012, the European Court of Justice ruled in favor of World Programming, finding that "the functionality of a computer program and the programming language cannot be protected by copyright."
A free version was introduced for students in 2010. SAS Social Media Analytics, a tool for social media monitoring, engagement and sentiment analysis, was also released that year. SAS Rapid Predictive Modeler (RPM), which creates basic analytical models using Microsoft Excel, was introduced that same year. JMP 9 in 2010 added a new interface for using the R programming language from JMP and an add-in for Excel. The following year, a High Performance Computing appliance was made available in a partnership with Teradata and EMC Greenplum. In 2011, the company released Enterprise Miner 7.1.
As of 2011 SAS' largest set of products is its line for customer intelligence. Numerous SAS modules for web, social media and marketing analytics may be used to profile customers and prospects, predict their behaviors and manage and optimize communications. SAS also provides the SAS Fraud Framework. The framework's primary functionality is to monitor transactions across different applications, networks and partners and use analytics to identify anomalies that are indicative of fraud. SAS Enterprise GRC (Governance, Risk and Compliance) provides risk modeling, scenario analysis and other functions in order to manage and visualize risk, compliance and corporate policies. There is also a SAS Enterprise Risk Management product-set designed primarily for banks and financial services organizations.
SAS' products for monitoring and managing the operations of IT systems are collectively referred to as SAS IT Management Solutions. SAS collects data from various IT assets on performance and utilization, then creates reports and analyses. SAS' Performance Management products consolidate and provide graphical displays for key performance indicators (KPIs) at the employee, department and organizational level. The SAS Supply Chain Intelligence product suite is offered for supply chain needs, such as forecasting product demand, managing distribution and inventory and optimizing pricing. There is also a "SAS for Sustainability Management" set of software to forecast environmental, social and economic effects and identify causal relationships between operations and an impact on the environmental or ecosystem.
Comparison to other products
In a 2005 article for the Journal of Marriage and Family comparing statistical packages from SAS and its competitors Stata and SPSS, Alan C. Acock wrote that SAS programs provide "extraordinary range of data analysis and data management tasks," but were difficult to use and learn. SPSS and Stata, meanwhile, were both easier to learn (with better documentation) but had less capable analytic abilities, though these could be expanded with paid (in SPSS) or free (in Stata) add-ons. Acock concluded that SAS was best for power users, while occasional users would benefit most from SPSS and Stata. A comparison by the University of California, Los Angeles, gave similar results.
Competitors such as Revolution Analytics and Alpine Data Labs advertise their products as considerably cheaper than SAS'. In a 2011 comparison, Doug Henschen of InformationWeek found that start-up fees for the three are similar, though he admitted that the starting fees were not necessarily the best basis for comparison. SAS' business model is not weighted as heavily on initial fees for its programs, instead focusing on revenue from annual subscription fees.
SAS is the largest market-share holder in advanced analytics with 36.2 percent of the market as of 2012. It is the fifth largest market-share holder for BI software with a 6.9% share and the largest independent vendor. It competes in the BI market against conglomerates, such as SAP BusinessObjects, IBM Cognos, SPSS Modeler, Oracle Hyperion, and Microsoft BI. SAS has been named in the Gartner Leader's Quadrant for Data Integration Tools and for Business Intelligence and Analytical Platforms. A study published in 2011 in BMC Health Services Research found that SAS was used in 42.6 percent of data analyses in health service research, based on a sample of 1,139 articles drawn from three journals.
- Comparison of numerical analysis software
- Comparison of OLAP Servers
- JMP (statistical software), also from SAS Institute Inc.
- SAS language
- "Encyclopedia of Research Design Encyclopedia of research design". 2010. doi:10.4135/9781412961288. ISBN 9781412961271.
- SAS Institute Inc. and World Programming Limited (England and Wales High Court (Chancery Division) July 23, 2010). Text
- Lora D. Delwiche; Susan J. Slaughter (2012). The Little SAS Book: A Primer : a Programming Approach. SAS Institute. p. 6. ISBN 978-1-61290-400-9.
- Arthur Li (10 April 2013). Handbook of SAS DATA Step Programming. CRC Press. p. 149. ISBN 978-1-4665-5238-8.
- Buck, Debbie. "A Hands-On Introduction to SAS DATA Step Programming". SUGI 30: SAS Institute. Retrieved October 2, 2013.
- N. Jyoti Bass; K. Madhavi Lata & Kogent Solutions (1 September 2007). Base Sas Programming Black Book, 2007 Ed. Dreamtech Press. pp. 365–. ISBN 978-81-7722-769-7.
- Tolbert, William (December 1, 2010). "How to Win Friends and Influence People with the SAS Output Delivery System". Clinical Medicine & Research 8 (3–4): 189–190. doi:10.3121/cmr.2010.943.c-c1-04. Retrieved October 2, 2013.
- Der, G.; B. S. Everittt (March 10, 2009). "Basic Statistics using SAS Enterprise Guide". Journal of the Royal Statistical Society: Series A (Statistics in Society) 172 (2): 530. doi:10.1111/j.1467-985X.2009.00588_2.x.
- John R. Schermerhorn (11 October 2011). Exploring Management. John Wiley & Sons. p. 3. ISBN 978-0-470-87821-7.
- Spector, Phil. "An Introduction to the SAS System" (PDF). Berkeley. Archived from the original on October 12, 2013. Retrieved October 4, 2013.
- Chapman, David (2012). "Determine what SAS Version and Components are available" (PDF). NESUG.
- Hallahan, C. (1995). "Data Analysis Using SAS". Sociological Methods & Research 23 (3): 373–391. doi:10.1177/0049124195023003006.
- Nourse, E. Shepley; Greenberg, Bernard G.; Cox, Gertrude M.; Mason, David D.; Grizzle, James E.; Johnson, Norman L.; Jones, Lyle V.; Monroe, John; Simons, Gordon D. (1978). "Statistical Training and Research: The University of North Carolina System". International Statistical Review / Revue Internationale de Statistique 46 (2): 171. doi:10.2307/1402812. ISSN 0306-7734. JSTOR 1402812.
- Alan Agresti; Xiao-Li Meng (2 November 2012). Strength in Numbers: The Rising of Academic Statistics Departments in the U. S.: The Rising of Academic Statistics Departments in the U.S.. Springer. p. 177. ISBN 978-1-4614-3649-2.
- SAS Institute FDA Intellectual Partnership for Efficient Regulated Research Data Archival and Analyses, Presented at Duke University, April 12, 2000, retrieved September 28, 2011
- Dalesio, Emery (May 5, 2001). "Little-known software giant to raise its profile". Associated Press. Retrieved April 8, 2014.
- Kaplan, David (January 22, 2010). "SAS: A new no. 1 best employer". Fortune. Retrieved April 8, 2014.
- "SAS corporate timeline". March 3, 2011. Retrieved October 17, 2011.
- Barr, Anthony; James Goodnight (1976). "The SAS Staff".
SAS 72 and SAS 76 are attributed to Barr, Goodnight, Service, Perkins, and Helwig
- (Barr & Goodnight et al. 1979:front matter) Attribution of the development of various parts of the system to Barr, Goodnight, and Sall.
- Oral History Interview with Jim Goodnight, Oral Histories of the American South, July 22, 1999, retrieved April 8, 2014
- Aster, Rick. "History of SAS version". Global Statements. Retrieved October 4, 2013.
- Barr, Anthony; James Goodnight and James Howard (1971). "Statistical analysis system". North Carolina State University. Retrieved October 3, 2013.
- Service, Jolayne (1972). "A User's Guide to the Statistical Analysis System". North Carolina State University.
- Shacklett, Mary (September 5, 2013). "See if the R language fits in your big data toolkit". The New Republic. Retrieved October 3, 2013.
- Barr, Anthony; James Goodnight, James Sall, John Helwig and Jane T (1979). OCLC 4984363 SAS Programmer's Guide, 1979 Edition. SAS Institute, Inc.
- "SAS history". SAS Institute. Retrieved October 4, 2013.
- Ian Cox; Marie A. Gaudard; Philip J. Ramsey; Mia L. Stephens, Leo Wright (21 December 2009). Visual Six Sigma: Making Data Analysis Lean. John Wiley & Sons. pp. 23–. ISBN 978-0-470-50691-2. Retrieved 16 November 2012.
- "Growing up". JMP Forward (JMP). p. 5. Retrieved October 4, 2013.
- Saul, John (Winter 2010). "JMP is 20 Years Old". JMPer Cable. Retrieved October 13, 2011.
- "Launches SAS JMP 8 for Mac and Linux". Ti Journal. April 11, 2009. Retrieved December 30, 2012.
- "New Features in JMP 9". JMP. Retrieved December 30, 2012.
- Bridgewater, Adriian (November 3, 2010). "JMP Genomics 5: Data Visualization & Exploration". Dr. Dobbs Journal. Retrieved May 31, 2012.
- Shipp, Charles; Kirk Paul Lafler. "Proficiency in JMP Visualization". PharmaSUG 2012. Retrieved December 30, 2012.
- Taylor, James (August 10, 2011). "First Look – JMP Pro". JTonEDM. Retrieved May 31, 2012.
- "Supported Operating Systems". SAS Institute. Retrieved October 6, 2012.
- Dalesio, Emery (February 5, 2002). "Text Miner program to bolster business intelligence". Associated Press. Retrieved April 8, 2014.
- Steven, Dave (July 29, 2002). "SAS is Starting to Look Even Better...". Pennsylvania State University. Retrieved October 17, 2011.
- Whiting, Rick (March 31, 2004). "SAS Extends Business Intelligence to the Masses". InformationWeek. Retrieved October 17, 2011.
- Callaghan, Dennis (September 26, 2002). "SAS to Add to Analytical CRM Arsenal". eWeek. Retrieved October 17, 2011.
- Gonsalves, Antone (October 10, 2008). "SAS, DataFlux Unveil 'Project Unity'". InformationWeek. Retrieved October 17, 2011.
- "Reference for a preliminary ruling from High Court of Justice (Chancery Division) (England and Wales) made on 11 August 2010 - SAS Institute Inc. v World Programming Ltd". European Court of Justice. Retrieved May 19, 2012.
- "The functionality of a computer program and the programming language cannot be protected by copyright". European Court of Justice. Retrieved May 19, 2012.
- Hardy, Quentin (June 9, 2011). "SAS-We Spurned IBM, Now to Win". Forbes. Retrieved October 17, 2011.
- Greenberg, Paul (December 31, 2010). "The CRM Watchlist Part II: The Usual Suspects". ZDNet. Retrieved October 4, 2013.
- "Computing Seminars: Introduction to SAS Macro Language". UCLA Academic Technology Services. Retrieved October 4, 2013.
- Howson, Cindi (September 7, 2010). "SAS Takes Predictive Analytics Mainstream". InformationWeek. Retrieved October 4, 2013.
- New Features in JMP 9, JMP, retrieved December 30, 2012
- Bridgewater, Adriian (November 3, 2010). "JMP Genomics 5: Data Visualization & Exploration". Dr. Dobbs Journal. Retrieved May 31, 2012.
- Laskowski, Nicole (October 26, 2011). "SAS ups 'big data' ante with high-performance computing platform". SearchBusinessAnalytics. Retrieved October 4, 2013.
- Sheina, Madan; Surya Mukherjee (October 17, 2011). "SAS adds in-memory to high-performance computing". Ovum.
- Taylor, James (November 11, 2011). "First Look – SAS Enterprise Miner 7.1". Retrieved October 4, 2013.
- Whiting, Rick (May 11, 2004). "SAS Ships Customer-Intelligence App". InformationWeek. Retrieved January 7, 2014.
- Yasin, Rutrell (May 10, 2013). "Social network analysis, predictive coding enlisted to fight fraud". Government Computer News. Retrieved December 11, 2013.
- Hunsberger, Brent (August 23, 2013). "Credit card blocked? Get used to false positive fraud alerts for now". The Oregonian. Retrieved December 11, 2013.
- Albergotti, Reed (July 22, 2013). "IRS, States Call on IBM, LexisNexis, SAS to Fight Tax Fraud". The Wall Street Journal.
- Kenealy, Bill (March 25, 2009). "Assessing Business Intelligence". Insurance Networking News. Retrieved December 11, 2013.
- Stephani, Justin (May 10, 2012). "SAS Upgrades GRC Software Quicker Compliance". Information Management.
- McClean, Chris (November 30, 2011). "The Forrester Wave™: Enterprise, Governance, Risk, And Compliance Platforms, Q4 2011". Retrieved December 12, 2013.
- Solution Overview, Gleanster, retrieved December 12, 2013
- Hui Pan, Editor. Iraq Telecom Monthly Newsletter November 2009. Information Gatekeepers Inc. pp. 5–. GGKEY:ZPZ8BD13DR8.
- "SAS Launches Suite Of Solutions For IT". ITManagementNews. September 22, 2004. Retrieved December 12, 2013.
- Gary Cokins; Karl D. Schubert; Michael H. Hugos; Randy Betancourt, Alyssa Farrell, Bill Flemming, Jonathan Hujsak (24 September 2010). CIO Best Practices: Enabling Strategic Value With Information Technology. John Wiley & Sons. pp. 64–. ISBN 978-0-470-91255-3.
- Brown, Timothy, Using SAS Strategically: A Case Study, NESUG, retrieved December 12, 2013
- "Product Review SAS SPM Strategic Performance Management". FSN. October 8, 2006. Retrieved December 12, 2013.
- Callaghan, Dennis. "SAS to Extend Its Supply Chain Offerings". eWeek. Retrieved December 12, 2013.
- Li, Kang (May 5, 2008). "Software for sustainability management unveiled". New Straits Times. p. 41.
- "Products & Solutions Index". SAS. Retrieved December 12, 2013.
- Acock, Alan C (November 2005). "SAS, Stata, SPSS: A Comparison". Journal of Marriage and Family 67 (4): 1093–1095. doi:10.1111/j.1741-3737.2005.00196.x.
- "Compare Packages". University of California, Los Angeles. Retrieved 12 January 2014.
- Henschen, Dough (July 26, 2011). "Low-Cost Options For Predictive Analytics Challenge SAS, IBM". InformationWeek. Retrieved January 12, 2014.
- Ranii, David (January 20, 2012). "SAS' revenue up 12% in 2011". The News & Observer (Raleigh, North Carolina). Retrieved January 12, 2014.
- Turchin, Brian. "SAS Profile -- Going Its Own Way". Software Business Online.
- Vesset, Dan; David Schubmehl, Brian McDonough and Mary Wardley (June 2013). "Worldwide Business Analytics Software 2013-2017 Forecast and 2012 Vendor Shares". IDC. Retrieved October 2, 2013.
- Pendse, Nigel (March 7, 2008). "Consolidations in the BI industry". The OLAP Report.
- Thoo, Eric; Ted Friedman and Mark Beyer (July 17, 2013). "Magic Quadrant for Data Integration Tools". Gartner.
- Schlegel, Kurt; Rita Sallam, Daniel Yuen and Joao Tapadinhas (February 5, 2013). "Magic Quadrant for Business Intelligence and Analytics Platforms". Gartner. Retrieved October 1, 2013.
- Dembe, A. E.; Partridge, J. S.; Geist, L. C. (2011). "Statistical software applications used in health services research: Analysis of published studies in the U.S". BMC Health Services Research 11: 252. doi:10.1186/1472-6963-11-252. PMC 3205033. PMID 21977990.
- Greenberg, Bernard G.; Cox, Gertrude M.; Mason, David D.; Grizzle, James E.; Johnson, Norman L.; Jones, Lyle V.; Monroe, John; Simmons, Gordon D., Jr. (1978). Nourse, E. Shepley, ed. "Statistical Training and Research: The University of North Carolina System". International Statistical Review 46 (2): 171–207. doi:10.2307/1402812
|Wikiquote has quotations related to: SAS (software)|
|Wikibooks has a book on the topic of: SAS|
- SAS homepage
- A Glossary of SAS terminology
- The SAS customer community Wiki
- Wikiversity:Data Analysis using the SAS Language