QRS 2017 Beihang University, China - PowerPoint Presentation

laxreffa . @laxreffa

342 views
Uploaded On 2020-07-01

QRS 2017 Beihang University, China - PPT Presentation

A Controlled Experiment on Android Applications Which Factor Impacts GUI TraversalBased Test Case Generation Technique Most Bo Jiang amp Yaoyue Zhang Beihang University WK ID: 792063

experiment controlled framework analysis controlled experiment analysis framework generation test results factors case strategy waiting amp design state time

Link:

Copy

Embed:

<iframe width="560" height="315" src="https://www.docslides.com/embed/792063" frameborder="0" allowfullscreen></iframe>

Download Presentation from below link

Download The PPT/PDF document "QRS 2017 Beihang University, China" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.

Presentation Transcript

Slide1

QRS

2017

Beihang University, China

— A

Controlled Experiment on Android Applications

Which Factor Impacts GUI Traversal-Based Test Case Generation Technique Most?

Jiang &

Yaoyue

Zhang

Beihang

University

W.K

Chan,

City

Univeristy of Hong KongZhenyu Zhang, Chinese Academy of Sciences, Institute of Software

Slide2

Outline

BACKGROUND & RELATED WORK

DESIGN

FACTORS

CONCLUSION

TEST CASE GENERATION FRAMEWORK

CONTROLLED EXPERIMENT

Slide3

Outline

DESIGN

FACTORS

CONCLUSION

BACKGROUND & RELATED WORK

TEST CASE GENERATION FRAMEWORK

CONTROLLED EXPERIMENT

Slide4

Background

Android Applications requires testing for ensure quality

Automated test case generation techniques are major research focus.

StateTraversal

: one of the most popular type of techniques

Slide5

Related Work

Automated Test Input Generation for Android: Are We There Yet?

R. C.

Shauvik

et al. (ASE2015)

PUMA

: Programmable UI-automation for large-scale dynamic analysis of mobile apps.

Hao

, B. Liu et al.

(

MobiSys2014) SwiftHand

: Guided GUI Testing of Android Apps with Minimal Restart and Approximate Learning. W.

Choi et al. (OOPSLA2013)

Acteve: Automated Concolic Testing of Smartphone Apps.S. Anand et al. (FSE2012)5

Slide6

Outline

DESIGN

FACTORS

CONCLUSION

CONTROLLED EXPERIMENT

TEST CASE GENERATION FRAMEWORK

BACKGROUND & RELATED WORK

Slide7

Test Case Generation Framework

PUMA Framework

Slide8

Test Case Generation Framework

Generic Framework

Generic GUI

Traversal

base

Test Case generation framework of

PUMA

State

equivalence

Search strategy

Waiting time

Slide9

Outline

BACKGROUND & RELATED WORK

CONCLUSION

TEST CASE GENERATION FRAMEWORK

CONTROLLED EXPERIMENT

DESIGN

FACTORS

Slide10

Design Factors

State Equivalence

Cosine

Eigenvectors

of the UI widgets

Threshold:

0.95

DECAF&PUMA

ActivityID

UIAutomator's API

getCurrentActivityName

().

tring comparison. A3EUI HierarchyUse Widgets tree structure to represent . The widgets trees are the same.

SwiftHand

Slide11

Search S

trategy

Rand

monkey

BFS

PUMA

DFS

A3E

Design Factors

Slide12

Waiting Time (between Two Events)

Design Factors

Strategy

Used

watiForIdle

PUMA

wait200ms

Monkey

used

Shauvik

al in ASE 2015

wait3000msActeve

wait5000msSwifthand

Slide13

Factor Level

Factor 1:

State Equivalence

Factor 2:

Search Strategy

Factor 3:

Waiting Time0

Cosine

BFSwaitForIdle

UI Hierarchy

DFS

wait200ms

ActivityID

Randomwait3000ms3

—

—wait5000msThree Factors and Their LevlesDesign Factors12

Slide14

Outline

BACKGROUND & RELATED WORK

DESIGN

FACTORS

CONCLUSION

TEST CASE GENERATION FRAMEWORK

CONTROLLED EXPERIMENT

Slide15

Controlled Experiment

Benchmarks and

Experimental

Setup

33 real-world open-source mobile apps

from Dynodroid, A3E, ACTEve,

SwiftHand.

implemented all

factor levels

the PUMA

framework.

Two virtual machines installed with ubuntu 14.04 operating systems.

Slide16

Experimental Procedure

36 (i.e., 3*3*4) combinations of factor levels for the three

factors.

Took

1188 testing hours in total on

virtual machines

ANOVAs(one-way

ANalyses

VAriances

)

Multiple

comparison

Controlled Experiment

Slide17

Results

and

Analysis-state equivalence

Controlled Experiment

Slide18

Results

and

Analysis-state equivalence

Controlled Experiment

Slide19

Results

and

Analysis-state equivalence

Cosine Similarity > UI Hierarchy >

ActivityID

Failure

detection

ability

ode

coverage rate

Controlled Experiment

Slide20

Results

and Analysis-search strategy

Controlled Experiment

Slide21

Results

and Analysis-search strategy

Controlled Experiment

Slide22

Results

and Analysis-search strategy

Randomized

strategy

was statistically comparable

to BFS and DFS in

Failure

detection

ability

ode

coverage rate

Controlled Experiment

Slide23

Results

and

Analysis-waiting time

Controlled Experiment

Slide24

Results

and

Analysis-waiting time

Controlled Experiment

Slide25

Results

and

Analysis-waiting time

The strategy to

wait until GUI state is stable

before sending the next input event is not statistically more effective than the strategy of

waiting for a fixed time

interval

Failure

detection ability

Code coverage rate

Controlled Experiment

Slide26

Results

and Analysis-Best

Treatment-Failure Detection Rate

Controlled Experiment

Slide27

Results

and Analysis-Best

Treatment-Code Coverage

Controlled Experiment

Slide28

Results

and Analysis-Best

Treatment

There

were many combinations of factor levels can attain the same high level of failure detection rate and high level of statement code

coverage

There

could be many good configurations in configuring

StateTraversal.

Controlled Experiment

Slide29

Outline

BACKGROUND & RELATED WORK

DESIGN

FACTORS

TEST CASE GENERATION FRAMEWORK

CONTROLLED EXPERIMENT

CONCLUSION

Slide30

Conclusion

State

equivalence

:Different state equivalence definitionswill

significantly affect the

failure detection rates and the code coverage.Search strategy: BFS and

DFS are comparable to Random.Waiting time: Waiting for idle and waiting for a fixed time period have no

significant difference.Failure detection rate: <Cosine Similarity, BFS, wait5000ms

>(best).Code coverage:<Cosine Similarity, DFS, wait5000ms>(best).

Slide31

THANKS

Q&A

QRS 2017 Beihang University, China - PowerPoint Presentation

QRS 2017 Beihang University, China - PPT Presentation

Share:

Link:

Embed:

Related Contents