Gene Expression Data Analysis of Different Brain Areas Based on Non-Negative Matrix Factorization
1 Information Engineering College, Shanghai Maritime University, Shanghai 201306, China
2 Department of Chemistry and Biochemistry, Rowan University, NJ 08028, USA
摘要基因之间存在多种多样的表达调控活动,一般认为这些调控关系隐含在基因表达谱中。针对阿尔茨海默症(AD) 起病隐匿、诊断难、发病机理复杂以及基因信号传导通路和调控关系难以重建等特征,利用非平滑非负矩阵分解(nsNMF)方法提取AD致病基因,聚类过程中利用共表型相关性系数(CCC)选取聚类数 k 的值,得到最优的聚类数目。针对基因表达数据噪声高、信息变量隐藏难分析的困难,考虑AD的发生发展与许多大脑功能区域密切相关的特性,提出将nsNMF分别应用于AD患者的大脑海马区、内嗅区皮质、颞中回及视觉皮层区的基因表达数据中,共提取3 800个显著基因,其中包括确定与AD致病机理有关联的10个致病基因,并进行了生物学分析,得到了AD相关的细胞凋亡、代谢及炎症反应等生物过程,显示nsNMF方法及大脑多区域数据集的联合分析能更全面地探寻AD信号传导关系及基因调控方式。
Abstract:It is accounted that various regulatory activities between genes contain in the gene expression datasets. Alzheimers disease (AD) are characterized by its hidden onset, complex pathological mechanism, hard diagnosis and it is difficult to reconstruct the genes signal pathways and its regulatory network. In this work, we improved nonsmooth nonnegative matrix factorization (nsNMF) to identify significant genes of Alzheimers disease (AD) using cophenetic correlation coefficient to confirm the clustering number k. Since gene expression dataset has high noise, and the underlying information is hard to analyze according to the function of brain areas, we applied nsNMF to AD samples of hippocampus (HIP), entorhinal cortex (EC), media temporal gyrus (MTG) and primary visual cortex (VCX) which have close relationship of human learning and memory. After that, 3800 of significant genes were extracted including 10 known pathogenic genes. By biological analysis, many AD related biological process like apoptosis, metabolize and inflammation were obtained, and it is demonstrated that the improved nsNMF and the conjoint analysis method can deeply explore the pathways and gene regulatory ways of AD.
孔薇1* 陶伟杰1 牟晓阳2. 基于非负矩阵分解的大脑不同区域基因表达数据分析[J]. 中国生物医学工程学报, 2012, 31(6): 875-881.
KONG Wei 1* TAO Wei Jie 1 MOU Xiao Yang2. Gene Expression Data Analysis of Different Brain Areas Based on Non-Negative Matrix Factorization. journal1, 2012, 31(6): 875-881.
[1]Liu Chang, Zhou Jiliu, Lang Fangnian, et al. Generalized discriminant orthogonal nonnegative matrix factorization and its applications [J]. Systems Engineering and Electronics, 2011, 33(10): 2327-2330.
[2]Zhu Yanli, Chen Jun, Qu Peixin. A novel discriminant nonnegative matrix factorization and its application to facial expression recognition [J]. Advanced Materials Research, 2011, 143-144: 129-133.
[3]Zhao Weizhong, Ma Huifang, Fu Yanxiang. An effective nonnegative Matrix Factorization algorithm and its application in document clustering [J]. Advanced Materials Research, 2011, 268-270: 1508-1513.
[4]Shahnaz FB, Michael W, Pauca VP, et al. Document clustering using nonnegative matrix factorization [J]. Information Processing and Management March, 2006, 42 (2): 373-386.
[5]Woolfe F, Gerdes M, Bello M, et al. Autofluorescence removal by nonnegative matrix factorization [J]. IEEE Transactions on Image Processing, 2011, 20(4): 1085-1093.
[6]Devarajan K, Ebrahimi N. Class discovery via nonnegative matrix factorization [J]. American Journal of Mathematical and Management Sciences, 2008, 28(3-4): 457-467.
[7]Gao Yuan, George C. Improving molecular cancer class discovery through sparse nonnegative matrix factorization [J]. Bioinformatics, 2005, 21(21): 3970-3975.
[8]Lee JH, Seongbuk G. Investigation of spectrally coherent restingstate networks using nonnegative matrix factorization for functional MRI data [J]. International Journal of Imaging Systems and Technology, 2011, 21(2): 211-222.
[9]Lee DD, Seung HS. Learning the parts of objects by nonnegative matrix factorization [J]. Nature, 1999, 401: 788-793.
[10]Sajda P, Du S, Brown TR, et al. Nonnegative matrix factorization for rapid recovery of constituent spectra in magnetic resonance chemical shift imaging of the brain [J]. IEEE Trans Med Imaging. 2004, 23(12): 1453-1465.
[11]Montcuquet AS, Hervé L, Navarro F, et al. In vivo fluorescence spectra unmixing and autofluorescence removal by sparse nonnegative matrix factorization [J]. IEEE Trans Biomed Eng, 2011, 58(9): 2554-2565.
[12]Kopriva I. Singleframe multichannel blind deconvolution by nonnegative matrix factorization with sparseness constraints [J]. Opt Letter, 2005, 30(23): 3135-3137.
[13]Türkan M, Guillemot C. Image prediction based on neighborembedding methods [J]. IEEE Trans Image Process, 2012, 21(4):1885-1898.
[14]PascualMontano A, Carazo JM, Kochi K, et al. Nonsmooth nonnegative matrix factorization (nsNMF) [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006, 28: 403-415.