Redirigiendo al acceso original de articulo en 23 segundos...
Inicio  /  Algorithms  /  Vol: 16 Par: 6 (2023)  /  Artículo
ARTÍCULO
TITULO

DrugFinder: Druggable Protein Identification Model Based on Pre-Trained Models and Evolutionary Information

Mu Zhang    
Fengqiang Wan and Taigang Liu    

Resumen

The identification of druggable proteins has always been the core of drug development. Traditional structure-based identification methods are time-consuming and costly. As a result, more and more researchers have shifted their attention to sequence-based methods for identifying druggable proteins. We propose a sequence-based druggable protein identification model called DrugFinder. The model extracts the features from the embedding output of the pre-trained protein model Prot_T5_Xl_Uniref50 (T5) and the evolutionary information of the position-specific scoring matrix (PSSM). Afterwards, to remove redundant features and improve model performance, we used the random forest (RF) method to select features, and the selected features were trained and tested on multiple different machine learning classifiers, including support vector machines (SVM), RF, naive Bayes (NB), extreme gradient boosting (XGB), and k-nearest neighbors (KNN). Among these classifiers, the XGB model achieved the best results. DrugFinder reached an accuracy of 94.98%, sensitivity of 96.33% and specificity of 96.83% on the independent test set, which is much better than the results from existing identification methods. Our model also performed well on another additional test set related to tumors, achieving an accuracy of 88.71% and precision of 93.72%. This further demonstrates the strong generalization capability of the model.

 Artículos similares

       
 
Yang Liu, Qiang Zhang, Longjin Wang, Shun An, Yan He, Zhimin Fan and Fang Deng    
This paper investigates the problem of real-time parameter identification for ship maneuvering parameters and wave peak frequency in an ocean environment. Based on the idea of Euler discretion, a combined model of ship maneuvering and wave peak frequency... ver más

 
Min Hu, Fan Zhang and Huiming Wu    
Various abnormal scenarios might occur during the shield tunneling process, which have an impact on construction efficiency and safety. Existing research on shield tunneling construction anomaly detection typically designs models based on the characteris... ver más
Revista: Applied Sciences

 
Murat Millidere, Ferhat Akgül, Kemal Leblebicioglu and James F. Whidborne    
For developing high-fidelity flight simulations, an accurate and complete representation of the aerodynamic characteristics of the aircraft is necessary. To obtain a realistic aerodynamic database, system identification methods can be used to describe th... ver más
Revista: Aerospace

 
Nirmal Acharya, Padmaja Kar, Mustafa Ally and Jeffrey Soar    
Significant clinical overlap exists between mental health and substance use disorders, especially among women. The purpose of this research is to leverage an AutoML (Automated Machine Learning) interface to predict and distinguish co-occurring mental hea... ver más
Revista: Applied Sciences

 
Hao Gu, Ming Chen and Dongmei Gan    
The identification of gender in Chinese mitten crab juveniles is a critical prerequisite for the automatic classification of these crab juveniles. Aiming at the problem that crab juveniles are of different sizes and relatively small, with unclear male an... ver más
Revista: Applied Sciences