= Visual analytics =

Visual analytics is a multidisciplinary science and technology field that emerged from information visualization and scientific visualization. It focuses on how analytical reasoning can be facilitated by interactive visual interfaces.

== Overview ==
Visual analytics is "the science of analytical reasoning facilitated by interactive visual interfaces." It can address problems whose size, complexity, and need for closely coupled human and machine analysis may make them otherwise intractable. Visual analytics advances scientific and technological development across multiple domains, including analytical reasoning, human–computer interaction, data transformations, visual representation for computation and analysis, analytic reporting, and the transition of new technologies into practice. As a research agenda, visual analytics brings together several scientific and technical communities from computer science, information visualization, cognitive and perceptual sciences, interactive design, graphic design, and social sciences.

Visual analytics integrates new computational and theory-based tools with innovative interactive techniques and visual representations to enable human-information discourse. The design of the tools and techniques is based on cognitive, design, and perceptual principles. This science of analytical reasoning provides the reasoning framework upon which one can build both strategic and tactical visual analytics technologies for threat analysis, prevention, and response. Analytical reasoning is central to the analyst's task of applying human judgments to reach conclusions from a combination of evidence and assumptions.

Visual analytics has some overlapping goals and techniques with information visualization and scientific visualization. There is currently no clear consensus on the boundaries between these fields, but broadly speaking the three areas can be distinguished as follows:

- Scientific visualization deals with data that has a natural geometric structure (e.g., MRI data, wind flows).
- Information visualization handles abstract data structures such as trees or graphs.
- Visual analytics is especially concerned with coupling interactive visual representations with underlying analytical processes (e.g., statistical procedures, data mining techniques) such that high-level, complex activities can be effectively performed (e.g., sense making, reasoning, decision making).

Visual analytics seeks to marry techniques from information visualization with techniques from computational transformation and analysis of data. Information visualization forms part of the direct interface between user and machine, amplifying human cognitive capabilities in six basic ways:

1. by increasing cognitive resources, such as by using a visual resource to expand human working memory,
2. by reducing search, such as by representing a large amount of data in a small space,
3. by enhancing the recognition of patterns, such as when information is organized in space by its time relationships,
4. by supporting the easy perceptual inference of relationships that are otherwise more difficult to induce,
5. by perceptual monitoring of a large number of potential events, and
6. by providing a manipulable medium that, unlike static diagrams, enables the exploration of a space of parameter values

These capabilities of information visualization, combined with computational data analysis, can be applied to analytic reasoning to support the sense-making process.

== History ==
As an interdisciplinary approach, visual analytics has its roots in information visualization, cognitive sciences, and computer science. The term and scope of the field was defined in the early 2000s through researchers such as Jim Thomas, Kristin A. Cook, John Stasko, Pak Chung Wong, Daniel A. Keim and David S. Ebert. As a reaction to the September 11, 2001 attacks the United States Department of Homeland Security was established in late 2002, combining dozens of previously separated government agencies. Building upon earlier work on visual data mining by Daniel A. Keim starting in the late 1990s, this simultaneously lead to the development of a research agenda for visual analytics.

 As part of these efforts the National Visualization and Analytics Center (NVAC) at Pacific Northwest National Laboratory was established in 2004, whose charter was to develop system to mitigate information overload after the September 11, 2001 attacks in the intelligence community. Their research work determined core challenges, posed open research questions, and positioned visual analytics as a new research domain, in particular through the 2005 research agenda Illuminating the Path.
In 2006, the IEEE VIS community led by Pak Chung Wong and Daniel A. Keim launched the annual IEEE Conference on Visual Analytics Science and Technology (VAST), providing a dedicated venue for research into visual analytics, which in 2020 merged to form the IEEE Visualization conference. In 2008, scope and challenges of visual analytics were conceptually defined by Daniel A. Keim and Jim Thomas in their influential book about visual data mining. The domain was further refined as part of the European Commissions FP7 VisMaster program in the late 2000s.

== Topics ==

=== Scope ===

Visual analytics is a multidisciplinary field that includes the following focus areas:

- Analytical reasoning techniques that enable users to obtain deep insights that directly support assessment, planning, and decision making
- Data representations and transformations that convert all types of conflicting and dynamic data in ways that support visualization and analysis
- Techniques to support production, presentation, and dissemination of the results of an analysis to communicate information in the appropriate context to a variety of audiences.
- Visual representations and interaction techniques that take advantage of the human eye's broad bandwidth pathway into the mind to allow users to see, explore, and understand large amounts of information at once.

=== Analytical reasoning techniques ===

Analytical reasoning techniques are the method by which users obtain deep insights that directly support situation assessment, planning, and decision making. Visual analytics must facilitate high-quality human judgment with a limited investment of the analysts’ time. Visual analytics tools must enable diverse analytical tasks such as:

- Understanding past and present situations quickly, as well as the trends and events that have produced current conditions
- Identifying possible alternative futures and their warning signs
- Monitoring current events for emergence of warning signs as well as unexpected events
- Determining indicators of the intent of an action or an individual
- Supporting the decision maker in times of crisis.

These tasks will be conducted through a combination of individual and collaborative analysis, often under extreme time pressure. Visual analytics must enable hypothesis-based and scenario-based analytical techniques, providing support for the analyst to reason based on the available evidence.

=== Data representations ===

Data representations are structured forms suitable for computer-based transformations. These structures must exist in the original data or be derivable from the data themselves. They must retain the information and knowledge content and the related context within the original data to the greatest degree possible. The structures of underlying data representations are generally neither accessible nor intuitive to the user of the visual analytics tool. They are frequently more complex in nature than the original data and are not necessarily smaller in size than the original data. The structures of the data representations may contain hundreds or thousands of dimensions and be unintelligible to a person, but they must be transformable into lower-dimensional representations for visualization and analysis.

=== Theories of visualization ===
Theories of visualization include:
- Jacques Bertin's Semiology of Graphics (1967)
- Nelson Goodman's Languages of Art (1977)
- Jock D. Mackinlay's Automated design of optimal visualization (APT) (1986)
- Leland Wilkinson's Grammar of Graphics (1998)
- Hadley Wickham's Layered Grammar of Graphics (2010)

=== Visual representations ===

Visual representations translate data into a visible form that highlights important features, including commonalities and anomalies. These visual representations make it easy for users to perceive salient aspects of their data quickly. Augmenting the cognitive reasoning process with perceptual reasoning through visual representations permits the analytical reasoning process to become faster and more focused.

== Process ==

The input for the data sets used in the visual analytics process are heterogeneous data sources (i.e., the internet, newspapers, books, scientific experiments, expert systems). From these rich sources, the data sets S = S_{1}, ..., S_{m} are chosen, whereas each S_{i} , i ∈ (1, ..., m) consists of attributes A_{i1}, ..., A_{ik}. The goal or output of the process is insight I. Insight is either directly obtained from the set of created visualizations V or through confirmation of hypotheses H as the results of automated analysis methods. This formalization of the visual analytics process is illustrated in the following figure. Arrows represent the transitions from one set to another one.

More formally the visual analytics process is a transformation F: S → I, whereas F is a concatenation of functions f ∈ {D_{W}, V_{X}, H_{Y}, U_{Z}} defined as follows:

D_{W} describes the basic data pre-processing functionality with D_{W} : S → S and W ∈ {T, C, SL, I} including data transformation functions D_{T}, data cleaning functions D_{C}, data selection functions D_{SL} and data integration functions D_{I} that are needed to make analysis functions applicable to the data set.

V_{W}, W ∈ {S, H} symbolizes the visualization functions, which are either functions visualizing data V_{S} : S → V or functions visualizing hypotheses V_{H} : H → V.

H_{Y}, Y ∈ {S, V} represents the hypotheses generation process. We distinguish between functions that generate hypotheses from data H_{S} : S → H and functions that generate hypotheses from visualizations H_{V} : V → H.

Moreover, user interactions U_{Z}, Z ∈ {V, H, CV, CH} are an integral part of the visual analytics process. User interactions can either effect only visualizations U_{V} : V → V (i.e., selecting or zooming), or can effect only hypotheses U_{H} : H → H by generating a new hypotheses from given ones. Furthermore, insight can be concluded from visualizations U_{CV} : V → I or from hypotheses U_{CH} : H → I.

The typical data pre-processing applying data cleaning, data integration and data transformation functions is defined as D_{P} = D_{T}(D_{I}(D_{C}(S_{1}, ..., S_{n}))). After the pre-processing step either automated analysis methods H_{S} = {f_{s1}, ..., f_{sq}} (i.e., statistics, data mining, etc.) or visualization methods V_{S} : S → V, V_{S} = {f_{v1}, ..., f_{vs}} are applied to the data, in order to reveal patterns as shown in the figure above.

In general the following paradigm is used to process the data:

Analyse First – Show the Important – Zoom, Filter and Analyse Further – Details on Demand

== See also ==
=== Related subjects ===

- Cartography
- Computational visualistics
- Critical thinking
- Decision-making
- Google Analytics
- Interaction design
- Interactive visual analysis
- Interactivity
- Social network analysis software
- Software visualization
- Starlight Information Visualization System
- Text analytics
- Traffic analysis
- Visual reasoning

=== Related scientists ===

- Cecilia R. Aragon
- Robert E. Horn
- Daniel A. Keim
- Theresa-Marie Rhyne
- Lawrence J. Rosenblum
- Ben Shneiderman
- John Stasko
- Jim Thomas
- Pak Chung Wong

=== Related software ===
- imc FAMOS (1987), graphical data analysis
