/
Trust Me, I’m Partially Right: Incremental Visualization Trust Me, I’m Partially Right: Incremental Visualization

Trust Me, I’m Partially Right: Incremental Visualization - PowerPoint Presentation

trish-goza
trish-goza . @trish-goza
Follow
447 views
Uploaded On 2016-06-13

Trust Me, I’m Partially Right: Incremental Visualization - PPT Presentation

Shengliang Dai Background Queries over large scale petabyte data bases often mean waiting overnight for a result to come back Scale costs time Potential avenues of exploration are ignored because the costs are perceived to be too high to run or even propose them ID: 360877

incremental queries analysis data queries incremental data analysis large confidence users visualizations visualization values query uncertainty datasets time intervals

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "Trust Me, I’m Partially Right: Increme..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

Slide1

Trust Me, I’m Partially Right: Incremental Visualization Lets Analysts Explore Large Datasets Faster

Shengliang

DaiSlide2

Background

Queries over large scale (petabyte) data bases often mean waiting overnight for a result to come back.

Scale costs time.

Potential

avenues of exploration are ignored because the costs are perceived to be too high to run or even propose them.

S

ampleAction

:

A

ccelerate and open up the query process with incremental visualizations. Slide3

ProblemsTrading

off

speed

of

exploring

and

richness

of

questions

for

time

and

resources

when

running

queries

over

vast

arrays

of

data.

T

he

number and types of queries

are

still

restricted.

I

ncremental queries

:

Analysts

are accustomed to seeing precise figures, rather than probabilistic results Slide4

Goals

In order

to

let

incremental

analysis to be a viable technique

C

omplement

ing

technical aspects of the back-end with an investigation of the interaction design

V

isualize

estimates on incremental data. Slide5

METHOD

H

ypothesis

:

U

sers working with incremental visualizations will be able to interpret the confidence intervals comfortably

.

This

will

allow

them

to act rapidly on their queries.

I

ncremental results will allow users to carry out exploratory queries.

S

ampleAction

S

imulat

ing

the experience of using a very large dataset.

I

ncrementally

displaying results based on ever-larger portions of the dataset. Slide6

SampleAction

S

imulat

ing

the effects of interacting with very large datasets while supporting an iterative query interaction for large aggregates.

E

rror bars

:

show

the values of the estimate. Slide7
Slide8

SampleAction shows how the bounds are changing over time Slide9

Bounded uncertainty based on samples Slide10

The Back-End Database Industrial

DBMS

do not currently support incremental queries of the type required

C

onstraine

this initial evaluation to deploying

sampleAction

on a database small enough to query interactively: Slide11

USER STUDY Bob: Server

Operations

Allan: Online Game Reporting

Sam

: Twitter Analytics Slide12

ANALYSIS

The value of seeing a first record fast

users found value in getting a quick response to their

queries

:

Sam

and Allan realized they had entered an incorrect query, and were able to repair it quickly by adding appropriate filters. Slide13

ANALYSISNew Behaviors around Data data in a static, non-interactive form

real exploration of the dataset

If the first few samples had not converged, they would decide whether it was worth the trade-off of waiting longer, sometimes checking the convergence view

to

decide. Slide14

ANALYSISDifficulties with Error Bar Convergence

Big

variance

Past

literature on visualizing uncertainty

has

emphasized visualizations that fit the entire uncertainty range on screen; these were not sufficient for some of

bounds

.

N

oisy

values

Incremental systems can be slowed by datasets that are not

clean

.

Solution:

Using additional domain knowledge during the

execution

,

such

as

discarding values that fall outside meaningful constraints

Slide15

ANALYSISNon-Expert Views of Confidence Intervals

E

rror

bars

sometimes

are

confusing

for

users.

For

example,

the

interval would shrink toward a converged value.

T

wo

very different adjacent columns might have identical confidence intervals

.

Slide16

Implications

U

sers

seem to be able to interpret confidence intervals,

which

opens

opportunities for using uncertainty visualization tied to probabilistic datasets. Slide17

Limitations of Incremental Visualization

T

here

some genres of queries that are structurally going to be difficult.

Outlier Values

For

example,

t

here

is no probabilistic answer to “which item has the highest value”.

Table Joins

When

joins against a rare or unique key, using samples from joining tables may not work at all. Slide18

Future Work

R

epresentations

of

confidence

,

eliminating

downsides of error bars

M

ore

types of visualizations

M

ore

types of data analysis Slide19

Conclusion

While the concept of approximate queries has been known for some time, the visualization implications have not been explored with users.

S

howing

the utility of these approximations will encourage further research on both the front- and back-ends of these systems.

HCI researchers have also been limited in their ability to explore these

concepts

.

S

imulating

large data systems may help them explore realistic front-ends without needing to build full-scale computation back-ends.