/
S ushi – S ushi –

S ushi – - PowerPoint Presentation

giovanna-bartolotta
giovanna-bartolotta . @giovanna-bartolotta
Follow
371 views
Uploaded On 2017-06-23

S ushi – - PPT Presentation

An exquisite recipe for NGS data analysis Hubert Rehrauer amp Masaomi Hatakeyama S upporting U ser for SH ell script I ntegration What is your data analysis wishlist ID: 562401

job sushi import fgcz sushi job fgcz import data analysis script dataset application ruby apps uzh check meta set

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "S ushi –" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

Slide1

Sushi – An exquisite recipe for NGS data analysis

Hubert Rehrauer &

Masaomi Hatakeyama

S

upporting

U

ser for

SH

ell

-script

I

ntegrationSlide2

What is your data analysis wishlist?

We had in mind:analyze by clickingscriptable

manage my meta-informationdocument all analysis stepsorganize my workI can add analysis applications

c

onnects to my compute resources

k

eep everything in files on my disk

no painful file formats

The bioinformatician stays in the driver seatSlide3

The Sushi idea (I)

Start with a bunch of raw data files on your disk:Slide4

The Sushi idea (II)

Add the magic seasoning:

Meta informationSlide5

Meta-information turns mere data files into a data set

One row per sample

associated files

everything else noteworthy about the files and the samplesSlide6

Sushi offers a choice of analysis apps for your data set

The meta-information

columns drive the available applicationsSlide7

Sushi lets you control all parameters

as selectors

or as free textSlide8

The processing jobs generate the data files … Slide9

Sushi adds all ingredients to make them a new, documented data set …Slide10

… and Sushi let’s you move to the next analysisSlide11

The Sushi Data Analysis Process

Step 1

Generate Job script(s)

Step 2

Submit the Job script(s)Slide12

Sushi Modules

Sushi UI

Ruby on Rails

GUI

Sushi Application

Single Ruby file

CLI

Workflow

Manager

Ruby gem library

Job ControlSlide13

Sushi Components at FGCZSlide14

Sushi RankSlide15

Why choose Sushi?It has never been easier to import meta-information

It has never been easier to add new data analysis applicationsSushi does not impose constraints on your data analysis

 Your applications define the semanticsYou never have to export your data again it’s already exported!You never have to document your analysis again

the result is fully self-contained and documented by the time the analysis is done

Sushi keeps your work organized even if you work on 10 different projects with thousands of samplesSlide16

Acknowledgements

FGCZ Genome Informatics TeamGiancarlo RussoLennart Opitz

Weihong QiSlavica DimitrievaSlide17

Sushi Takeaways

S

U

S

H

I

SUSHI

S

uper

Easy Pipeline System

U

ltra

Fast Development

S

urprisingly

Flexible Ruby code

H

ighly

Independent Modules

I

ntermediary

between biologist and

informatician

Slide18
Slide19

Sushi Demo 10 minutes

Installation

1 minData import/Job submission 2

mins

New application

import

3

mins

Case Study

4

mins

RNAseq

DEG analysisSlide20

Demo Environment

Mac OS X 10.9.4 Ruby 1.9.3 Ruby on Rails 3.2.9Slide21

Installation

Downloading Ruby on Rails packagehttp://fgcz-sushi.uzh.ch/

sushi_20140908.tgzInstall librariesbundle installSetup DB

b

undle exec rake

db:migrateSlide22

Documents

f

gcz-sushi.uzh.chSlide23

Download

fgcz-sushi.uzh.ch/download.htmlSlide24

InstallationDownload, Extraction, Library installation, DB setup

$

wget http://fgcz-sushi.uzh.ch/sushi_20140908.tgz $ tar zxvf

sushi_20140908.tgz

$ cd sushi

$ bundle install

$ bundle exec rake

db:migrateSlide25

Sushi run, workflow_manager run

$ rails server

$ workflow_managerSlide26

Sushi Access

l

ocalhost:3000Slide27

Data import / Job submission

Prepare your data setImport dataset.tsv

Check samplesSelect an applicationWordCountAppSet parametersSubmit a jobCheck job statusCheck job script/logp

ublic/projects

Check resultSlide28

DataSet ImportSlide29

DataSet ImportSlide30

New Application Import

fgcz-sushi.uzh.ch/download.htmlSlide31

New Application Import

$ wget http://fgcz-sushi.uzh.ch

/fgcz_sushi_apps.tgz$ tar xvf fgcz_sushi_apps.tgz

$

cp

fgcz_sushi_apps

/

FastqcApp.rb

sushi/

lib

/

$

cp

-r

fgcz_sushi_apps

/R_scripts sushi/lib/ Slide32

A Case Study –RNAseq DEG analysis-

RNAseq AnalysisQuality Control

FastQCMappingSTARCountingHTSeqDifferential Gene ExpressionEdgeRSlide33

Summary

Shell script auto-generating

Application Framework

Support all languages

Help bio-

logist

/-

informatician

Implemented

in

Ruby

Meta-Information

DataSet

Interface: GUI / CLI

S

A

S

H

I

M

ISlide34

Thank you for your attention!!

http://

fgcz-sushi.uzh.chSlide35
Slide36

Data import / Job submission

Prepare your data setImport dataset.tsv

Check samplesSelect an applicationWordCountAppSet parametersSubmit a jobCheck job statusCheck job script/logp

ublic/projects

Check resultSlide37

DataSet ImportSlide38

DataSet ImportSlide39

DataSet ImportSlide40

DataSetSlide41

DataSetSlide42

Sushi Application RunSlide43

Parameter SettingSlide44

Job SubmissionSlide45

Job StatusSlide46

New DataSetSlide47

Log, Job ScriptSlide48

Log, Job ScriptSlide49

Job ScriptSlide50

Job ScriptSlide51

New Application Import

fgcz-sushi.uzh.ch/download.htmlSlide52

New Application Import

$ wget http://fgcz-sushi.uzh.ch

/fgcz_sushi_apps.tar$ tar xvf fgcz_sushi_apps.tar

$

cp

fgcz_sushi_apps

/

FastqcApp.rb

sushi/

lib

/

$

cp

-r

fgcz_sushi_apps

/R_scripts sushi/lib/ Slide53

New Application ImportSlide54

FastQC resultSlide55

A Case Study –RNAseq DEG analysis-

RNAseq AnalysisQuality Control

FastQCMappingSTARCountingHTSeqDifferential Gene ExpressionEdgeRSlide56

A Case StudySlide57

Summary

Shell script auto-generating

Application Framework

Support all languages

Help bio-

logist

/-

informatician

Implemented on Ruby

Meta-Information

DataSet

Interface: GUI / CLI

S

A

S

H

I

M

ISlide58

Thank you for your attention!!

http://

fgcz-sushi.uzh.chSlide59

AppendixSlide60

Sushi Run Style

GUI

CLISlide61

Application Mode

SAMPLE

modeJob per Sample, e.g. Tophat

DATASET

mode

Job per

DataSet

, e.g.

FastQCSlide62

Import New Application2 ways: prepare

Shell script

Ruby scriptSave it in Sushi repositorylib directoryNo rebootTest on CLI$ sushi_fabric --class

WordCountApp

--

dataset_id

1

–run

Test on GUISlide63

How to add a new application

Inherit SushiApp class

Write Ruby codeTemplate Method Design PatternPossible to tune-up detailsDelegate to SushiWrap

class

Write

Shell script

Ruby

Metaprogramming

Quick importSlide64

Template Method Pattern

Write Ruby code directlySlide65

Meta-programming

Ruby code auto generation

from shell script codeSlide66

A Sushi App – WordCountApp.rbSlide67

A Sushi App – WordCount.shSlide68

The R Sushi Apps – FastqcApp.rbSlide69

Structure of a Job Script

Generated by

Sushi

Generated by

Sushi

Generated

by

AppSlide70

Sushi gem

https://

rubygems.org/gems/sushi_fabric

Related Contents


Next Show more