Modified Free Energy Model to improve RNA secondary structure prediction with pseudoknots
Refereed conference paper presented and published in conference proceedings


全文

引用次數

其它資訊
摘要The free energy (evaluation) models used in RNA secondary structure prediction are one of the most important reasons that makes the prediction a challenging computational problem in Bioinformatics. These models are the key factor determining the accuracy of the prediction algorithms. Previously we have developed a method called GAknot that has obtained good performance on predicting RNA secondary structures with pseudoknots. In this paper, we propose a new free energy model. We first select a number of RNA sequences from a database which contains known RNA secondary structures as a training dataset for learning this new model. From the training dataset, we then extract base pairs patterns in subsequences of pairs of k-mers from the stems of each sequence in the training data and use the patterns to formulate penalty factors. We modify the energy model by adding these penalty factors. Combined with the new modified energy model, the prediction performance of GAknot has been improved significantly. GAknot with the new modified energy model is shown to be the best method in comparison with two state-of-the-art algorithms using a commonly used testing dataset. The penalty factors of the new energy model and dataset can be downloaded at http: //appsrv.cse.cuhk.edu.hk/(similar to)kktong/NewModel
著者Tong KK, Cheung KY, Lee KH, Leung KS
會議名稱IEEE 13th International Conference on Bioinformatics and Bioengineering (BIBE)
會議開始日10.11.2013
會議完結日13.11.2013
會議地點Chania
會議國家/地區希臘
出版年份2013
月份1
日期1
出版社IEEE
電子國際標準書號978-1-4799-3163-7
國際標準期刊號2471-7819
語言英式英語
Web of Science 學科類別Engineering; Engineering, Biomedical; Medical Informatics

上次更新時間 2020-27-10 於 01:11