FanLens: Dynamic Hierarchical Exploration of Tabular Data Xinghua Lou ∗

Shixia Liu †

Tianshu Wang ‡

IBM China Research Lab

A BSTRACT Tabular data is a very popular format for storing information from various domains. However, as the data grows in size, it becomes increasingly difficult to discover its intrinsic structure or see the comparison among cells from the traditional column-row presentation. To address the problem, we propose an enhanced technique based on traditional radial, space-filling visualization (e.g. Sunburst) named FanLens, which helps users to explore tabular data by dynamically specifying the hierarchies and then visualizing them. Our work is an improvement upon existing approaches in terms of flexibility, context preservation and interaction.

• High-level start-up When visualizing the hierarchy, do not lay out the entire hierarchy initially but display only several high levels showing the summarization information (Figure 1(a)). • Expanding/collapsing mechanism Users can drill down into lower levels by expanding one branch from the higher level. The newly expanded branch will be incrementally laid out around the periphery of its parent slice, radially, and is regarded as the focus (Figure 1(b)(c)(d)).

Keywords: Tabular data visualization, dynamic hierarchy specification, radial space-filling visualization, fisheye distortion. 1

I NTRODUCTION

Tabular data is usually a matrix-like structure of columns and rows containing data cells and is one of the most popular formats for data storage. Although the table is the simplest and most straightforward way to present the tabular data, it becomes awkward when data volume increases, since there is no emphasis on its organization or the distribution. [1]. Here we present FanLens, which enables dynamic exploration of tabular data with an incremental, radial space-filling visualization technique. Compared with traditional radial space-filling visualization (e.g. Sunburst [2]), our technique can provide more flexibility and better preserve the context. The typical thin slice problem is addressed with a new fisheye distortion based interaction. 2

DATA T RANSFORMATION

Generally speaking, the tabular data can be divided into two categories, categorical data and quantitative data. Naturally, hierarchy can be structured by breaking down the tabular data in order of its categorical data. FanLens further improves this feature by allowing the users to bin the quantitative data into different ranges and impose the results on structuring the hierarchy and by allowing grouping the categorical data by the instance of its value. FanLens also supports dynamic visual data transformation, namely two separate dimensions of data can be mapped to the angle and color of the slices in the visualization, respectively. The source of the mapping could be one attribute or a mathematical expression of some selected attributes. 3

V ISUALIZATION D ESIGN

3.1 Incremental Layout Incremental layout is the primary feature of FanLens, which follows two principles: ∗ [email protected][email protected][email protected]

Figure 1: Examples of layout principles. (a) High-level start-up; (b)(c)(d) Expanding/collapsing mechanism.

This incremental layout firstly brings flexibility. Users can focus on the branch of interest and also see the overview by redefining the base levels to cover the entire hierarchy (which creates the classic Sunburst visualization). The readability is also improved because of the expanding/collapsing mechanism which offers the users with a clear view of the exploration path and structure of the focus. 3.2 Zooming and Picking Zooming is used to deal with the thin-slice problem and is implemented by enlarging the sweep angle of the focus so all the thin slices in it are enlarged as well. Former designs [2, 3] applying this method have difficulties with preserving the context. Our solution follows the same idea but, benefiting from the expanding/collapse mechanism, preserves the context better because the focus is enlarged where it is and need not be repositioned (Figure 2). Former solution to picking is zooming before picking, which surely works but also lower the efficiency. Sometimes users need to locate one slice quickly and precisely to find some detailed information or drill down from it. We brought an interesting solution to this problem by applying fisheye transformation to distort the angles of slices in the focus. The goal is to ensure that users can have a clear view of this current selection. The transformation algorithm is prepared when the mouse is moving within the focus, but is only executed when it gets close to or into a thin slice (Figure (3)).

Figure 2: An example of focus zooming.

Figure 5: Explore the 3-point shooting abilities and find the special pattern of the Phoenix Suns team.

Figure 3: (a) Picking without fisheye transformation. (b) Picking with fisheye transformation.

4 C ASE S TUDY The case contains the statistics of NBA players for one season. 4.1 Overall Evaluation Figure 4 shows the result of structuring the data in order of Conference, Division, Team and Player and defining the base levels to cover the entire hierarchy. The slice angle and slice color are mapped to player’s offensive ability and defensive ability, respectively. This overall evaluation indicates that, even though player’s offensive and defensive abilities vary a lot, the league is quite balanced in all levels of Conference, Division and Team. That is one important reason why NBA game is usually exciting because it is always a close matchup.

4.3 Hypothesis Testing We may also use Figure 4 to analyze player’s scoring ability (PPG) and mistakes (TO, Turnover); however, this visualization is not really intrinsic but guides us to hypothesize that players with high scoring ability also have more turnovers. To test this hypothesis, we specify the hierarchy by ranging the players into several categories according to their PPG (see the following table) and visualizing the new hierarchy with the same visual presentation configuration, as shown in Figure 6(b). This new visualization proves our hypothesis that players with stronger scoring ability also give more turnovers.

Figure 6: Testing the hypothesis on players’ scoring ability and turnovers.

Figure 4: Use overview to evaluate the balance of the NBA league.

4.2 Special Pattern Discovery Figure 5 shows the same hierarchy but is dedicated to analyze the 3point shooting ability. The angle is mapped to 3PM(3-Points Made per game) and the color is mapped to 3P%(3-Point shooting Percentage). The large angle of Pacific Divison guides us to explore it and find the team Phoenix Suns which is actually wild about shooting 3-points. In addition, a special pattern is noted that most of its players have close 3-Point shooting percentage which explains why they dare shooting so many 3-point baskets.

5 C ONCLUSIONS We have introduced the FanLens, an approach for dynamic and hierarchical exploration of tabular data. Our primary contribution is an incremental, radial space-filling visualization technique which better preserves the context, provides extra flexibility and has an unique fisheye transformation supported picking interaction. In the future, we plan to enrich the interactions, e.g. Drag and Drop operation on the display to specify the hierarchy and multiple expansion to compare different branches. We also intend to support automatically hierarchy specification using methods such as SOMs and clustering. R EFERENCES [1] G. Chintalapani, C. Plaisant, and B. Shneiderman. Extending the utility of treemaps with flexible hierarchy. In Proc. of the Eighth International Conference on Information Visualisation, pages 335–344, 2004. [2] J. Stasko and E. Zhang. Focus+context display and navigation techniques for enhancing radial, space-filling hierarchy visualisations. In IEEE Symposium on information visualisation, pages 9–12, 2000.

FanLens: Dynamic Hierarchical Exploration of Tabular Data

Keywords: Tabular data visualization, dynamic hierarchy specifi- cation, radial .... The large angle of Pacific Divison guides us to explore it and find the team ...

379KB Sizes 0 Downloads 230 Views

Recommend Documents

FanLens: Dynamic Hierarchical Exploration of Tabular Data
Keywords: Tabular data visualization, dynamic hierarchy specifi- cation, radial space-filling visualization, fisheye distortion. 1 INTRODUCTION. Tabular data is ...

Hierarchical Dynamic Neighborhood Based Particle ...
Abstract— Particle Swarm Optimization (PSO) is arguably one of the most popular nature-inspired algorithms for real parameter optimization at present. In this article, we introduce a new variant of PSO referred to as Hierarchical D-LPSO (Dynamic. L

Javascript Data Exploration - GitHub
Apr 20, 2016 - Designers. I'm a sort of. « social data scientist ». Paris. Sciences Po médialab. I just received a CSV. Let me grab my laptop ... Page 9 ...

Life-Long Spatio-Temporal Exploration of Dynamic ...
The primary purpose of robotic exploration is to auto- ... for a separate training and deployment phase, and allows .... best location to visit at each time slot. Then ...

Dynamic Local Clustering for Hierarchical Ad Hoc ... - IEEE Xplore
Hierarchical, cluster-based routing greatly reduces rout- ing table sizes compared to host-based routing, while reduc- ing path efficiency by at most a constant factor [9]. More importantly, the amount of routing related signalling traffic is reduced

Dynamic Model Selection for Hierarchical Deep ... - Research at Google
Figure 2: An illustration of the equivalence between single layers ... assignments as Bernoulli random variables and draw a dif- ..... lowed by 50% Dropout.

Temporal Visualisation of Hierarchical Data June ...
They can manipulate the CVS repository in the same way as the command line tools. Examples of this ... log command and parses it to produce an analysis of the work that each developer has done within the .... Time Machine is a new backup technology t

Temporal Visualisation of Hierarchical Data June 2007 ...
Jun 18, 2007 - folders on your hard drive is Disk Inventory X [10]. It does have the ... Time Machine is a new backup technology that Apple are going to introduce into Leopard, ... A search of other computer visualisations was also undertaken.

HW 2: Chapter 1. Data Exploration - GitHub
OI 1.8: Smoking habits of UK Residents: A survey was conducted to study the smoking habits ... create the scatterplot here. You can use ... Go to the Spurious Correlations website: http://tylervigen.com/discover and use the drop down menu to.

An exploration of the offset hypothesis using disaggregate data: The ...
Page 1 ... data: The case of airbags and antilock brakes ... We test the hypothesis using disaggregate data to analyze the effects of airbags and antilock.

A Hierarchical Approach to Represent Relational Data ...
making the processing of these data a difficult task. As a consequence of the widespread use of relational databases, the use of data mining methods to discover.

[PDF Online] Dynamic Mode Decomposition: Data-Driven Modeling of ...
Buy Dynamic Mode Decomposition Data Driven Modeling of Chapter 1 Dynamic Mode Decomposition Data Driven Modeling of Complex Systems by J Nathan ...

Automatic Non-rigid Registration of 3D Dynamic Data ...
istration of 3D dynamic facial data using least-squares con- formal maps, and ..... cial expressions that provides a good representation of facial motion. Isomap ...

Path Consolidation for Dynamic Right-Sizing of Data ...
nificant energy savings can be achieved via path consolidation in the network. We present an offline formulation for the flow assignment in a data center network and develop an online algorithm by path consolidation for dynamic right-sizing of the ne

Inference of Dynamic Discrete Choice Models under Incomplete Data ...
May 29, 2017 - directly identified by observed data without structural restrictions. ... Igami (2017) and Igami and Uetake (2016) study various aspects of the hard. 3. Page 4. disk drive industry where product quality and efficiency of production ...