/
By: Raul Rodriguez By: Raul Rodriguez

By: Raul Rodriguez - PowerPoint Presentation

pamella-moone
pamella-moone . @pamella-moone
Follow
408 views
Uploaded On 2016-02-21

By: Raul Rodriguez - PPT Presentation

Walter Checefsky Added later http orangebiolabsi What is Orange Python based tool for datamining developed by the Bioinformatics laboratory of the faculty of Computer and Information Science at the University of Ljubljana in Slovenia ID: 225361

learning based bayes python based learning python bayes clustering analysis orange bioinformatics means partition learn information time trees

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "By: Raul Rodriguez" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

Slide1

By:Raul RodriguezWalter Checefsky

(Added later)

http

://orange.biolab.si/Slide2

What is Orange?Python based tool for data-mining, developed by the Bioinformatics laboratory of the faculty of Computer and Information Science at the University of Ljubljana in Slovenia.Slide3

Why does Bioinformatics need this?Learn about the interaction of different genes

Discover different methods of gene expressionLearn the structure of proteinsFind probable regions of protein encodingSlide4

What’s it do?

Mainly well known for its Graphical User Interface (GUI)

You can script in Python tooSlide5

Which algorithms can it use?Decision trees (ID3, C4.5, CART)

Naïve BayesInstance Based Learning (kNN, ML-kNN)

Function Based Learning (regression analysis(

log,lin,lasso,PLS,trees,mean

), ANN, SVM(

libSVM,liblinear

))

Ensemble Learning (bagging,

AdaBoost

, random forest)

Hierarchical clustering (linkage-based)

Partition Based Clustering (k-means, partition around

medoids

, fuzzy-c-means)

ANN based clustering(self-organizing)

Association Rules(

Apriori

(sparse,

attr

.-value)),

apriori

-SD)Slide6

Type of input?

Tab delimited file

Top row is: Features

Type of data

Meta information to describe features

DataSlide7

Example Time!Slide8

Why isn’t it perfect?No Spatial D

ata AnalysisNo Time Series AnalysisNo Parallelization

Only Naïve Bayes in the Bayes family

Less algorithm options than other frameworks

Locks you into python