Two Screening Methods for Genetic Association Study with Application to Psoriasis Microarray Data Sets
Refereed conference paper presented and published in conference proceedings


摘要Feature selection in genome data faces the challenge of high dimensionality of variables. When the goal of analytics is to identify susceptible loci for complex disease, interaction effects need to be considered, and the actual number of variables to be screened is even larger than the original number of variables due to variable combination. Previous methods of feature selection for interactions either exhaustively calculate pair-wise combination across genome or adopt a pre-screening step by marginal effect of genetic markers. However, these methods might still result in a considerably large number of candidate markers that demand further selection, some genes that have moderate main effect but are important for forming subsets of strong interaction might be filtered out. In this article, we introduce two alternative screening methods: one uses the variable appearance frequency (VAF) to select features, the other uses a non-overlapping criteria to reduce the candidate pool. The methods are applied to two real gene-expression datasets for psoriasis, and their advantages are discussed.
著者Wang M.H., Tsoi K., Lai X., Chong M., Zee B., Zheng T., Lo S.-H., Hu I.
會議名稱4th IEEE International Congress on Big Data, BigData Congress 2015
會議地點New York City
詳細描述organized by IEEE Computer Society, 27 June – 2 July 2015,
頁次324 - 326
關鍵詞feature selection, GWAS, screening

上次更新時間 2021-19-02 於 00:44