IBM Data Science Experience

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Data Science Experience (DSX) is IBM’s platform for data science, a workspace that includes multiple collaboration and open-source tools for use in data science.[1]

In DSX, a data scientist can create a project with a group of collaborators, all having access to various analytics models and using various languages (R/Python/Scala). DSX brings together staple open source tools including RStudio, Spark and Python in an integrated environment, along with additional tools such as a managed Spark service and data shaping facilities, in a secure and governed environment.[2]

Data Science Experience provides access to data sets that are available through Watson Data Platform, on-premises or on the cloud. The platform also has a large community and embedded resources such as articles on the latest developments from the data science world and public data sets. DSX is available in on-premises, cloud, and desktop forms.

Microsoft and other technology vendors also have products in the machine learning market, which is developing rapidly.[3]

History[edit]

IBM announced the launch of DSX at the Spark Summit 2016 in San Francisco. IBM invested $300 million in efforts to make Spark the analytics operating system for all of the company's big data efforts.[4]

In June 2017, Hortonworks and IBM announced their partnership to collaborate on IBM's Data Science Experience. DSX addresses every step of the Data Science lifecycle, offering notebooks, collaboration spaces, tutorials, Machine Learning models and support for Spark, R, Python, and other ML languages.[5] Hortonworks previously had a partnership relationship with Microsoft.[6]

See also[edit]

References[edit]