|Developer(s)||University of Utah, NYU-Poly|
2.2.4 / May 3, 2016
|Type||Scientific workflow management; Scientific visualization|
|License||BSD License 3-clause|
VisTrails is a scientific workflow management system developed at the Scientific Computing and Imaging Institute at the University of Utah that provides support for data exploration and visualization. It is written in Python and employs Qt via PyQt bindings. The system is open source, released under the GPL v2 license. The pre-compiled versions for Windows, Mac OS X, and Linux come with an installer and several packages, including VTK, matplotlib, and ImageMagick. VisTrails also supports user-defined packages.
VisTrails is a new system that provides provenance management support for exploratory computational tasks. It combines features of workflow and visualization systems. Similar to workflow systems, it allows the combination of loosely coupled resources, specialized libraries, and grid and Web services. Similar to some visualization systems, it provides a mechanism for parameter exploration and comparison of different results. But unlike these other systems, VisTrails was designed to manage exploratory processes in which computational tasks evolve over time as a user iteratively formulates and tests hypotheses. A key distinguishing feature of VisTrails is its comprehensive provenance infrastructure that maintains detailed history information about the steps followed in the course of an exploratory task. VisTrails leverages this information to provide novel operations and user interfaces that streamline this process.
VisTrails has been developed for exploratory visualization, but the system is general, and provides functionality in the following areas:
- Querying and re-using history.
- Support for collaborative exploration.
- Scalable derivation of data products, parameter exploration, multi-view and comparative visualization.
VisTrails is the result of a collaborative effort between computer scientists Cláudio Silva and Juliana Freire. Initial development began in 2004 by graduate students at the University of Utah. Although the first prototypes were implemented in C++, the current version of VisTrails is written in Python. The first public release was in September 2007.
A common use for VisTrails is scientific visualization. Visualizations generated as part of a workflow are rendered in a spreadsheet-style interface, allowing multiple visualizations from different versions of a workflow to be viewed and compared simultaneously. The VisTrails spreadsheet currently supports VTK and HTML rendering.
VisTrails supports four basic modes, or views. Each view interacts with the underlying workflow in a different way.
- The Pipeline view lets the user create, view and edit a visual pipeline of a workflow. The pipelines are typically rendered from top to bottom, with each module's output connecting to the input of the module(s) below it. A module corresponds roughly to a data type or a function.
- The History view displays a tree structure representing the various versions of the pipeline. Each time a change is made in the Pipeline view, a new node is added to the version tree in the History view. The user can optionally assign names, or tags, to one or more nodes of the tree. To conserve space, VisTrails does not render a node unless it is a root or leaf node or has a tag.
- The Query view is similar in appearance to the Pipeline view. Here, the user constructs partial pipelines. When the query is executed, VisTrails identifies the pipeline versions which contain the specified partial pipeline. This is analogous to the Query By Example method of performing database queries.
- The Exploration view automates the process of creating several workflow pipelines that differ only by a few parameter values.
In 2007, the University of Utah formed VisTrails, Inc., a spinoff company intended to commercialize VisTrails technology. Development for the free version of VisTrails is currently funded by the University of Utah and VisTrails, Inc. The company's first product is a plugin for the 3D modeling software Maya. While the main VisTrails distribution is free software, the VisTrails plugin for Maya is distributed under a closed-source/proprietary license.
Version release dates history
- 2.2: April 2015
- 2.1: December 2013
- 2.0: May 2012
- 1.7: June 2011
- 1.6.2: April 2011
- 1.5.1: August 2010
- 1.4.2: March 2010
- 1.4: January 2010
- 1.3: July 2009
- 1.2: July 2008
- 1.1: May 2008
- 1.0: September 2007
- Bioinformatics workflow management systems
- Kepler scientific workflow system
- Scientific visualization
- "LICENSE file in code repository". github.com.
- Cláudio T. Silva, Juliana Freire, and Steven Callahan. "Provenance for Visualizations: Reproducibility and Beyond" (PDF). Computing in Science & Engineering, 9(5), pp. 82-90, 2007.
- Juliana Freire, David Koop, Emanuele Santos, and Cláudio T. Silva. "Provenance for Computational Tasks: A Survey" (PDF). Computing in Science & Engineering, 10(3), pp. 11-21, 2008.
- Carlos E. Scheidegger, David Koop, Emanuele Santos, Huy T. Vo, Steven P. Callahan, Juliana Freire, and Cláudio T. Silva. "Tackling the Provenance Challenge one layer at a time" (PDF). Concurrency and Computation: Practice and Experience, 20(5), pp. 473-483, 2008.
- Carlos E. Scheidegger, Huy T. Vo, David Koop, Juliana Freire and Cláudio T. Silva. "Querying and Creating Visualizations by Analogy" (PDF). IEEE Transactions on Visualization and Computer Graphics, 13(6), pp. 1560-1567, 2007.
- Tommy Ellkvist, David Koop, Erik Anderson, Juliana Freire, and Cláudio T. Silva. "Using Provenance to Support Real-Time Collaborative Design of Workflows" (PDF). Proceedings of International Provenance and Annotation Workshop (IPAW), 2008.
- Louis Bavoil, Steven P. Callahan, Patricia J. Crossno, Juliana Freire, Carlos E. Scheidegger, Cláudio T. Silva, and Huy T. Vo. "VisTrails: Enabling Interactive Multiple-View Visualizations" (PDF). Proceedings of IEEE Visualization, pp. 135-142, 2005.
- "Announcement on VisTrails, Inc. website". www.vistrails.com.