Walter Checefsky Added later http orangebiolabsi What is Orange Python based tool for datamining developed by the Bioinformatics laboratory of the faculty of Computer and Information Science at the University of Ljubljana in Slovenia ID: 225361
Download Presentation The PPT/PDF document "By: Raul Rodriguez" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
By:Raul RodriguezWalter Checefsky
(Added later)
http
://orange.biolab.si/Slide2
What is Orange?Python based tool for data-mining, developed by the Bioinformatics laboratory of the faculty of Computer and Information Science at the University of Ljubljana in Slovenia.Slide3
Why does Bioinformatics need this?Learn about the interaction of different genes
Discover different methods of gene expressionLearn the structure of proteinsFind probable regions of protein encodingSlide4
What’s it do?
Mainly well known for its Graphical User Interface (GUI)
You can script in Python tooSlide5
Which algorithms can it use?Decision trees (ID3, C4.5, CART)
Naïve BayesInstance Based Learning (kNN, ML-kNN)
Function Based Learning (regression analysis(
log,lin,lasso,PLS,trees,mean
), ANN, SVM(
libSVM,liblinear
))
Ensemble Learning (bagging,
AdaBoost
, random forest)
Hierarchical clustering (linkage-based)
Partition Based Clustering (k-means, partition around
medoids
, fuzzy-c-means)
ANN based clustering(self-organizing)
Association Rules(
Apriori
(sparse,
attr
.-value)),
apriori
-SD)Slide6
Type of input?
Tab delimited file
Top row is: Features
Type of data
Meta information to describe features
DataSlide7
Example Time!Slide8
Why isn’t it perfect?No Spatial D
ata AnalysisNo Time Series AnalysisNo Parallelization
Only Naïve Bayes in the Bayes family
Less algorithm options than other frameworks
Locks you into python