Weighted K-means Clustering with Observation Weight for Single-cell Epigenomic Data
Chapter in an edited book (author)


摘要The recent advances in single-cell technologies have enabled us to profile genomic features at unprecedented resolution. Nowadays, we can measure multiple types of genomic features at single-cell resolution, including gene expression, protein-binding, methylation, and chromatin accessibility. One major goal in single-cell genomics is to identify and characterize novel cell types, and clustering methods are essential for this goal. The distinct characteristics in single-cell genomic datasets pose challenges for methodology development. In this work, we propose a weighted K-means algorithm. Through down-weighting cells with low sequencing depth, we show that the proposed algorithm can lead to improved detection of rare cell types in analyzing single-cell chromatin accessibility data. The weight of noisy cells is tuned adaptively. In addition, we incorporate sparsity constraints in our proposed method for simultaneous clustering and feature selection. We also evaluated our proposed methods through simulation studies.
著者Wenyu Zhang, Jiaxuan Wangwu, Zhixiang Lin
編輯Yichuan Zhao, Ding-Geng Chen
書名Statistical Modeling in Biomedical Research: Contemporary Topics and Voices in the Field
系列標題Emerging Topics in Statistics and Biostatistics
頁次37 - 64

上次更新時間 2021-21-01 於 00:38