ScholarMate
客服热线:400-1616-289

LncReader: identification of dual functional long noncoding RNAs using a multi-head self-attention mechanism

Liu, Tianyuan; Zou, Bohao; He, Manman; Hu, Yongfei; Dou, Yiying; Cui, Tianyu; Tan, Puwen; Li, Shaobin; Rao, Shuan; Huang, Yan; Liu, Sixi*; Cai, Kaican*; Wang, Dong*
Science Citation Index Expanded
南方医科大学; 中国医学科学院; 中国医学科学院北京协和医院

摘要

Long noncoding ribonucleic acids (RNAs; LncRNAs) endowed with both protein-coding and noncoding functions are referred to as 'dual functional lncRNAs'. Recently, dual functional lncRNAs have been intensively studied and identified as involved in various fundamental cellular processes. However, apart from time-consuming and cell-type-specific experiments, there is virtually no in silico method for predicting the identity of dual functional lncRNAs. Here, we developed a deep-learning model with a multi-head self-attention mechanism, LncReader, to identify dual functional lncRNAs. Our data demonstrated that LncReader showed multiple advantages compared to various classical machine learning methods using benchmark datasets from our previously reported cncRNAdb project. Moreover, to obtain independent in-house datasets for robust testing, mass spectrometry proteomics combined with RNA-seq and Ribo-seq were applied in four leukaemia cell lines, which further confirmed that LncReader achieved the best performance compared to other tools. Therefore, LncReader provides an accurate and practical tool that enables fast dual functional lncRNA identification.

关键词

noncoding RNA system biology deep learning