BAYESIAN HIERARCHICAL MODEL FOR ESTIMATING GENE ASSOCIATION NETWORK FROM MICROARRAY DATA Dongxiao Zhu,a and Alfred O Hero b a

Bioinformatics Program,b Depatments of EECS, Biomedical Engineering and Statistics University of Michigan,Ann Arbor, MI 48105 1. INTRODUCTION

Estimating gene association networks from gene microarray data is the key to decipher complicated web of functional relationship between genes [1]. However, the process remains to be challenging due to the relatively few independent samples and the large amount of correlation parameters [2]. In a gene association network, vertices represent genes, and edges represent biological association between genes. The network edges are declared to be present if the corresponding correlation parameters are significantly different from a non-zero threshold [3]. The approach has been very useful in inferring gene association networks, and facilitating network based discovery [3]. However, as a Frequentist approach, it often suffers from the “overfitting” problem especially for analyzing small sample size data. Approaches that are able to globally estimate the correlation parameters with variance regularization followed by the seamless correlation thresholding are highly desirable. The desirable approaches fall naturally into the framework of Bayesian hierarchical models [4]. We assume the correlation parameters are exchangeable meaning that the joint distribution (Eq. 1) is invariant to permutations of the indexes. Biologically, this represents a lack of knowledge that could differentiate one pair of biological associations from the others. We then regularize variances of the correlation estimations by specifying a parent normal distribution from which marginal correlation parameters are sampled (Fig. 1). The posterior distributions of correlation parameters provide a seamless combination of the correlation estimation and strength thresholding. 2. METHODS We use ρ to denote the true strength of association between a pair of gene expression profiles. For G gene expression ¡ ¢ profiles in a microarray data set, there are Λ = G 2 correlation parameters ρ that need to be estimated, denoted as This work was partially supported by grants from the National Institute of Health (EY01115), The Foundation Fighting Blindness, Sramek Foundation and Research to Prevent Blindness.

ρλ , λ = 1, . . . , Λ. We define ρˆλ as the λth sample correlaˆ λ as the hyperbolic arc-tangent transtion coefficient, and Γ formation of ρˆλ . Then the transformed sample correlation ˆ λ = atanh(ˆ coefficients Γ ρλ ) are asymptotically Gaussian distributed with means of Γλ and stabilized variance approximations of σλ2 = 1/(N − 3). Here N is the sample size. Simulation studies show that the variance approximation works reasonably well even at a relatively small sample size, e.g. N ≤ 10. We assume known variances to reduce computational complexity. Furthermore, we don’t have a prior information about these Γ0 s, and assuming independency between them in marginal correlation approaches cause the “overfitting” problem [2]. In the Bayesian hierarchical model, we assume that these parameters are exchangeable, and are drawn from a normal distribution with unknown hyperparameters (α, β) (Fig. 1): p(Γ1 , . . . , ΓΛ |α, β) =

Λ Y

N (Γλ |α, β 2 )

(1)

λ=1

In order to generate conditional posterior distributions p(Γλ |α, β, y) for each parameter Γλ , λ = 1, . . . , Λ, where y represents the microarray data, we performed simulation steps as follows [4]: 1. Assign prior distribution for β, i.e. uniform prior distribution p(β) ∝ 1. 2. Draw β from posterior distribution p(β|y).

p(β|y) ∝

p(β)



b λ |ˆ N (Γ α, σλ2 + β 2 ) N (ˆ α|ˆ α, Vα )

λ=1

∝ p(β)Vα1/2

Λ Y

−1/2

(σλ2 + β 2 )

λ=1

where α ˆ and Vα are defined as: PΛ 1

exp(−

bλ − α (Γ b)2 (3) ), 2 2(σλ + β 2 )

ˆ

2 +β 2 Γλ λ=1 σλ

α ˆ = PΛ

(2)

1 2 +β 2 λ=1 σλ

,

(4)

and Vα−1 =

Λ X

σ2 λ=1 λ

1 . + β2

(5)

3. Draw α from p(α|β, y). Combining the data with the uniform prior density p(α|β) yields, p(α|β, y) ∼ N (ˆ α, Vα ).

(6)

ˆ where α ˆ is a precision-weighted average of the Γ’s and Vα is the total precision. Note, we define precision as inverse of variance. 4. Draw Γλ from p(Γλ |α, β, y)

Fig. 1. Bayesian hierarchical model structure.

ˆ λ , Vλ ), p(Γλ |α, β, y) ∼ N (Θ

(7) Estimation Comparison: Bayesian vs. Simple MSE:upper, Variance:lower

and Vλ =

1 2 σλ

1 +

1 β2

(8)

.

Bayesian N=20

Simple N=20

Bayesian N=4

Simple N=4

Bayesian N=20

Simple N=20

Bayesian N=4

Simple N=4

(9)

ˆ λ , i.e. 5. Take hyperbolic tangent transformation of Γ ˆ λ ), ρˆλ = tanh(Γ

0.05

1 ˆ 1 2 Γλ + β 2 α σλ , 1 1 2 + β2 σλ

(10)

and the sampling distribution ρˆλ is the desired posterior distribution. 3. SIMULATIONS We evaluated the performance of the full Bayesian estimation by comparing with the marginal estimation in terms of Mean Squared Error (MSE) and variance. Fig. 2 plots MSE’s and variances of the Bayesian correlation estimation and the marginal correlation estimation over 500 simulations. It is evident in Fig. 2 that the MSE of Bayesian estimation is about three-fold smaller than that of the marginal estimation. A much dramatic contrast was observed at the small sample size that clearly shows the advantages of the Bayesian estimations for the small sample size problem. Comparison of variances follows the same trend (Fig. 2). 4. ESTIMATING CO-EXPRESSION NETWORK FROM GALACTOSE METABOLISM DATA We also demonstrated the application of our Bayesian approach and compared it with the previous Frequentist approach [3] using a yeast galactose metabolism two-color microarray data [5]. Following the procedure in method section, we simulated the empirical posterior distribution for

0.00 0.15 0.30

ˆλ = Θ

0.20

ˆ λ , Vλ are defined as: where Θ

Fig. 2. Mean Squared Errors (MSE’s) and Variances of the Bayesian estimations versus the simple estimations over 500 runs of simulations.

each correlation parameter Γ. Similar to the previous analysis, we used 0.6 as the correlation cutoff value, and declared the statistical association to be biologically relevant when their (1 − q) × 100% posterior confidence intervals do not intersect with [-0.6, 0.6] at the significant level q. Comparison of the networks inferred from Bayesian hierarchical model and from the previous approach in terms of top hub nodes shows much agreement with certain discrepancies. 5. REFERENCES [1] Butte,A., Tamayo,P. Slonim,D., Golub,T.R. and Kohane,I.S. (2000) Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks. Proc Natl Acad Sci USA, 97, 12182-6. [2] Schfer, J. and Strimmer, K. (2005) An empirical Bayes approach to inferring large-scale gene association networks. Bioinformatics, 21(6), 754-764. [3] Zhu, D., Hero, A.O., Qin, Z.S., Swaroop, A. (2005) High throughput screening of co-expressed gene pairs with controlled False Discovery Rate (FDR) and Minimum Acceptable Strength (MAS). J Comput Biol, 12(7), 1029-1045. [4] Gelman, A., Carlin, J.B., Stern, H.S. and Rubin, D.B. (2004) Bayesian Data Analysis. Chapmann & Hall/CRC, Boca Raton, FL, USA. [5] Ideker,T., Thorsson,V. et al. (2000) Testing for differentially-expressed genes by maximum-likelihood analysis of microarray data. J Comput Biol, 7(6): 805-17.

BAYESIAN HIERARCHICAL MODEL FOR ...

NETWORK FROM MICROARRAY DATA ... pecially for analyzing small sample size data. ... correlation parameters are exchangeable meaning that the.

123KB Sizes 1 Downloads 319 Views

Recommend Documents

Nonparametric Hierarchical Bayesian Model for ...
employed in fMRI data analysis, particularly in modeling ... To distinguish these functionally-defined clusters ... The next layer of this hierarchical model defines.

Nonparametric Hierarchical Bayesian Model for ...
results of alternative data-driven methods in capturing the category structure in the ..... free energy function F[q] = E[log q(h)] − E[log p(y, h)]. Here, and in the ...

A nonparametric hierarchical Bayesian model for group ...
categories (animals, bodies, cars, faces, scenes, shoes, tools, trees, and vases) in the .... vide an ordering of the profiles for their visualization. In tensorial.

A Bayesian hierarchical model of Antarctic fur seal ...
Mar 30, 2012 - transmitter (Advanced Telemetry Systems, Isanti, Min- nesota, USA), while 211 females were instrumented with only a radio transmitter, and 10 ...

A Bayesian hierarchical model with spatial variable ...
towns which rely on a properly dimensioned sewage system to collect water run-off. Fig. ... As weather predictions are considered reliable up to 1 week ahead, we ..... (Available from http://www.abi.org.uk/Display/File/Child/552/Financial-Risks-.

Bayesian Hierarchical Curve Registration
The analysis often proceeds by synchronization of the data through curve registration. In this article we propose a Bayesian hierarchical model for curve ...

Bayesian Hierarchical Curve Registration
c ) and ai ∼ N(a0;σ2 a ) ×. I{ai > 0}, thus defining curve-specific random linear transfor- mations. ..... http://www.uwm.edu/%7Egervini/programs.html) for the.

A Hierarchical Model for Value Estimation in Sponsored ...
Also, our hierarchical model achieves the best fit for utilities that are close to .... free equilibrium, while Varian [22] showed how it could be applied to derive bounds on ...... the 18th International World Wide Web Conference, pages 511–520 ..

Dynamic Model Selection for Hierarchical Deep ... - Research at Google
Figure 2: An illustration of the equivalence between single layers ... assignments as Bernoulli random variables and draw a dif- ..... lowed by 50% Dropout.

a hierarchical model for device placement - Research at Google
We introduce a hierarchical model for efficient placement of computational graphs onto hardware devices, especially in heterogeneous environments with a mixture of. CPUs, GPUs, and other computational devices. Our method learns to assign graph operat

A Hierarchical Conditional Random Field Model for Labeling and ...
the building block for the hierarchical CRF model to be in- troduced in .... In the following, we will call this CRF model the ... cluster images in a semantically meaningful way, which ..... the 2004 IEEE Computer Society Conference on Computer.

Hierarchical Method for Foreground Detection Using Codebook Model
Jun 3, 2011 - important issue to extract foreground object for further analysis, such as human motion analysis. .... this object shall become a part of the background model. For this, the short-term information is employed ...... processing-speed-ori

Geo-level Bayesian Hierarchical Media Mix Modeling Services
Priors are needed for the hyperparameters τ,β,γ and standard deviations κ,η,ξ. ... If non-negativity is desired for β, we could put a gamma prior ...... R-project.org/.

Bayesian Model Averaging for Spatial Econometric ...
Aug 11, 2005 - represents a cross-section of regions located in space, for example, counties, states, or countries. y ¼ rWy ю ... If the sample data are to determine the posterior model probabilities, the prior probabilities ..... averaged estimate

A Weakly Supervised Bayesian Model for Violence ...
Social media and in particular Twitter has proven to ..... deriving word priors from social media, which is ..... ics T ∈ {1,5,10,15,20,25,30} and our significance.

Anatomically Informed Bayesian Model Selection for fMRI Group Data ...
A new approach for fMRI group data analysis is introduced .... j )∈R×R+ p(Y |ηj,σ2 j. )π(ηj,σ2 j. )d(ηj,σ2 j. ) is the marginal likelihood in the model where region j ...

Geo-level Bayesian Hierarchical Media Mix ... - Research at Google
shape effect, media spend should be standardized to the amount per capita. In some use cases, .... For simplicity of illustration, we will use geometric adstock x∗.

Bayesian Language Model Interpolation for ... - Research at Google
used for a variety of recognition tasks on the Google Android platform. The goal ..... Equation (10) shows that the Bayesian interpolated LM repre- sents p(w); this ...

A Collective Bayesian Poisson Factorization Model for ...
Request permissions from [email protected]. KDD'15, August 10-13, 2015, Sydney, NSW, Australia. ... Event-Based Social Networks, Cold-start Recommendation. 1. INTRODUCTION ... attract more users to register on their websites. Although ... mented in

Bayesian Model Averaging for Spatial Econometric ...
Aug 11, 2005 - There is a great deal of literature on Bayesian model comparison for nonspatial .... structure of the explanatory variables in X into account. ...... Further computational savings can be achieved by noting that the grid can be.

A Hierarchical Bayesian Approach to Improve Media Mix Models ...
Apr 7, 2017 - Even if data is available for a longer duration, e.g., more than 10 years, it is ..... impact for a media channel over multiple years and numerous campaigns is quite rare. ..... row of Figure 3) are estimated with narrow uncertainty and

The Bayesian Draughtsman: A Model for Visuomotor ...
by eye-tracking human subjects, we formulate a Bayesian model of draw- ..... PhD dissertation, Berkeley, University of California, Computer Science Division.

Bayesian Model Averaging for Spatial Econometric ...
11 Aug 2005 - We extend the literature on Bayesian model comparison for ordinary least-squares regression models ...... with 95 models having posterior model probabilities 40.1%, accounting for. 83.02% probability ...... choices. 2 MATLAB version 7 s