Rikky Wenang Purbojati miRNA MicroRNA miRNA is a class of RNA which is believed to play important roles in gene regulation Its a short 21 to 23nt RNAs that bind to the 3 ID: 931245
Download Presentation The PPT/PDF document "Improving miRNA Target Genes Prediction" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
Improving miRNA Target Genes Prediction
Rikky
Wenang
Purbojati
Slide2miRNA
MicroRNA
(miRNA) is a class of RNA which is
believed to play important roles in gene regulation.It’s a short (21- to 23-nt) RNAs that bind to the 3′ untranslated regions (3′ UTRs) of target genes.
Slide3miRNA Functions
miRNA plays a major role in RNA Induced Silencing Complex (RISC).
miRNAs
control the expression of large numbers of genes by:mRNA degradationTranslational repressionRecent studies indicates it plays a role in cancer development:
Surplus of miRNA might inhibit cell apoptosis process
Deficit of miRNA might cause excess of certain
oncogenes
Slide4RNA Induced Silencing Complex
mRNA degradation
Breaks the structural integrity of a mRNA.
Translational repressionPrevent the mRNA from being translated.
Slide5Characteristics of miRNA
Short (22-25nts)
Transcripted
from a miRNA geneIntragenic: miRNA gene is located inside a host gene (usually intron region)Intergenic
: miRNA gene is located outside gene bodies
A consistent 5’ and 3’ boundary:
Transcription Start Site
5’ Cap
Poly(A) tail
Slide6Development of miRNA
Slide7miRNA General Research Question
Much attention has been directed in miRNA processing and targeting.
Computational-wise, one basic challenge of miRNA:
Given a miRNA sequence, what are its target genes?
Slide8miRNA sequence target prediction
Predict target genes by matching the complement of miRNA sequence.
Two types of complement:
Perfect complementImperfect complement
Find perfect match for
seed (2-8nt)
Slide9miRNA sequence target prediction
Several requirements for matching:
Strong Watson-Crick base pairing of the 5’ seed (2-8
nts)Conservation of the miRNA binding site across speciesAnother approach: thermodynamic rule
Local miRNA-mRNA interaction with positive balance of minimum free energy
Problems and Opportunities
Problem:
Pure computational target genes prediction produces a lot of candidates
No unifying theory for target gene prediction yetMost of them are not validated yetCommon assumption is that most of them are false positives
Can
we shorten the list to include only the strong candidates ?
Slide11Problems and Opportunities
Opportunity:
Lots of publicly available experimental dataset i.e.
cDNA microarray, miRNA microarray, etc. Use the dataset to computationally validate some of the target genes
Current Research:
Preliminary research tries to utilizes the abundance of publicly available microarray data.
Slide12Assumptions
miRNA works by silencing target genes, thus miRNA gene and target genes should be anti-correlated
Intragenic
miRNA are expressed along with the host gene.a host gene should be anti-correlated with a target geneIntergenic miRNA does not have a host gene, but we might be able to use available composite (miRNA microarray +
cDNA
microarray) dataset
If a miRNA is up-regulated in miRNA microarray, then its target genes should be down-regulated in
cDNA
microarray
Slide13Current Work
There have been some works related to this idea (i.e.
HOCTAR)
However, we can improve it by:Using a stricter criteria across the microarray dataUsing a more diverse dataWe expect we will get a much better specifity
than the previous method
Slide14Hoctar Method
Get a list of target genes from 3 different tools (
pictar
, TargetScan,miranda)Uses Pearson correlation to determine the correlation coefficient between 2 genesInclude target genes which have correlation below some threshold
(-)
Only works for intragenic miRNA
Slide15Hoctar Method
Slide16Shortcomings of Hoctar
Uses all probes data even though they are not consistent
Uses
only one target gene prediction algorithm approachDepends on Pearson Correlation, which is sensitive to outliers
Slide17Improvement Idea (1)
Use only subset of data which probes are all
consistent
Treat each probes as different experiments
Slide18Improvement Idea (2)
Pearson correlation is very sensitive to outliers, alternative solutions:
Uses
Rank correlation coefficients instead of Pearson correlation coefficientsNormalize the dataset to normal distributionIgnore outliers
Slide19Improvement Idea (3)
In addition to probes consistency and rank correlation, we might use entropy rule in eliminating candidate target genes
Assumption:
Transcript level can be approximated from expression level dataOne miRNA transcript can only degrade one mRNA transcript
Thus miRNA expression changes should not be much different from mRNA expression changes
Slide20Improvement Idea (4)
Uses a larger amount of microarray data
We might be able to include miRNA microarray to further refine target genes list for several miRNA
Slide21Preliminary Result
GSE9234 dataset (
hipoxia
/normoxia)Using only consistency criteria
miRNA
Host Gene
Known
Target Gene
HOCTAR
Refined
miR-103-2
PANK3
GPD1
YES
YES
miR-103-2
PANK3
FBW1B
NO
YES
miR-140
WWP2
HDAC4
YES
YES
miR-224
GABRE
API5
NO
NO
Slide22Refining Intergenic miRNA prediction
Refining intergenic miRNA prediction using microarray dataset is not a trivial task
Microarray can only be used to measure the expression of target genes, but not the miRNA gene
Might have to rely on additional data: Proxy measurement miRNA microarray
Slide23Intergenic miRNA proxy measurement
Putative target gene
approximation
use the expression level of a known target genes for that specific intergenic miRNAIf its target genes are consistently down-regulated, then we can assume that the expression level of the intergenic miRNA gene is
up-regulated
Cluster miRNA
approximation
Some intergenic
miRNAs
are clustered with each other; according to (
Saini
et al. 2007) most of these clusters use the same
pri-mirNA
transcript
Use method 1 for neighboring miRNA to get the intergenic miRNA expression approximation
Slide24Further Work
Implementation and evaluation
Standardizing
composite dataset repository