Prediction and optimization of NaV1.7 inhibitors based on machine learning methods

120 0 0.0 ( 0 )

تحميل البحث استخدام كمرجع

نشر من قبل Weikaixin Kong

تاريخ النشر 2019

مجال البحث علم الأحياء الهندسة المعلوماتية

والبحث باللغة English

تأليف Weikaixin Kong - Xinyu Tu - Zhengwei Xie

الأساليب الكمية التعلم الآلي الجزيئات الحيوية

قم بزيارة صفحتنا على فيسبوك

‎Shamra Academia - شمرا أكاديميا‎

اسأل ChatGPT حول البحث

الملخص بالعربية الملخص بالإنكليزية

We used machine learning methods to predict NaV1.7 inhibitors and found the model RF-CDK that performed best on the imbalanced dataset. Using the RF-CDK model for screening drugs, we got effective compounds K1. We use the cell patch clamp method to verify K1. However, because the model evaluation method in this article is not comprehensive enough, there is still a lot of research work to be performed, such as comparison with other existing methods. The target protein has multiple active sites and requires our further research. We need more detailed models to consider this biological process and compare it with the current results, which is an error in this article. So we want to withdraw this article.

قيم البحث

113 - Brian L. Hie , Kevin K. Yang 2021

Machine-learning models that learn from data to predict how protein sequence encodes function are emerging as a useful protein engineering tool. However, when using these models to suggest new protein designs, one must deal with the vast combinatoria l complexity of protein sequences. Here, we review how to use a sequence-to-function machine-learning surrogate model to select sequences for experimental measurement. First, we discuss how to select sequences through a single round of machine-learning optimization. Then, we discuss sequential optimization, where the goal is to discover optimized sequences and improve the model across multiple rounds of training, optimization, and experimental measurement.

الأساليب الكمية التعلم الآلي الجزيئات الحيوية

Mid-price Prediction Based on Machine Learning Methods with Technical and Quantitative Indicators

356 - Adamantios Ntakaris , Juho Kanniainen , Moncef Gabbouj 2019

Stock price prediction is a challenging task, but machine learning methods have recently been used successfully for this purpose. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical and quantitative analysis and t ested their validity on short-term mid-price movement prediction. We focus on a wrapper feature selection method using entropy, least-mean squares, and linear discriminant analysis. We also build a new quantitative feature based on adaptive logistic regression for online learning, which is constantly selected first among the majority of the proposed feature selection methods. This study examines the best combination of features using high frequency limit order book data from Nasdaq Nordic. Our results suggest that sorting methods and classifiers can be used in such a way that one can reach the best performance with a combination of only very few advanced hand-crafted features.

التمويل الإحصائي التعلم الآلي التعلم الالي

Prediction of 5-hydroxytryptamine Transporter Inhibitor based on Machine Learning

133 - Weikaixin Kong , Wenyu Wang , Jinbing An 2019

In patients with depression, the use of 5-HT reuptake inhibitors can improve the condition. Topological fingerprints, ECFP4, and molecular descriptors were used. Some SERT and small molecules combined prediction models were established by using 5 mac hine learning methods. We selected the higher accuracy models(RF, SVM, LR) in five-fold cross-validation of training set to establish an integrated model (VOL_CLF). The training set is from Chembl database and oversampled by SMOTE algorithm to eliminate data imbalance. The unbalanced data from same sources (Chembl) was used as Test set 1; the unbalanced data with different sources(Drugbank) was used as Test set 2 . The prediction accuracy of SERT inhibitors in Test set 1 was 90.7%~93.3%(VOL_CLF method was the highest); the inhibitory recall rate was 84.6%-90.1%(RF method was the highest); the non-inhibitor prediction accuracy rate was 76.1%~80.2%(RF method is the highest); the non-inhibitor predictive recall rate is 81.2%~87.5% (SVM and VOL_CLF methods were the highest) The RF model in Test Set 2 performed better than the other models. The SERT inhibitor predicted accuracy rate, recall rate, non-inhibitor predicted accuracy rate, recall rate were 42.9%, 85.7%, 95.7%, 73.3%.This study demonstrates that machine learning methods effectively predict inhibitors of serotonin transporters and accelerate drug screening.

الأساليب الكمية

Review of Machine-Learning Methods for RNA Secondary Structure Prediction

113 - Qi Zhao , Zheng Zhao , Xiaoya Fan 2020

Secondary structure plays an important role in determining the function of non-coding RNAs. Hence, identifying RNA secondary structures is of great value to research. Computational prediction is a mainstream approach for predicting RNA secondary stru cture. Unfortunately, even though new methods have been proposed over the past 40 years, the performance of computational prediction methods has stagnated in the last decade. Recently, with the increasing availability of RNA structure data, new methods based on machine-learning technologies, especially deep learning, have alleviated the issue. In this review, we provide a comprehensive overview of RNA secondary structure prediction methods based on machine-learning technologies and a tabularized summary of the most important methods in this field. The current pending issues in the field of RNA secondary structure prediction and future trends are also discussed.

الجزيئات الحيوية التعلم الآلي التعلم الالي

adabmDCA: Adaptive Boltzmann machine learning for biological sequences

115 - Anna Paola Muntoni , Andrea Pagnani , Martin Weigt 2021

Boltzmann machines are energy-based models that have been shown to provide an accurate statistical description of domains of evolutionary-related protein and RNA families. They are parametrized in terms of local biases accounting for residue conserva tion, and pairwise terms to model epistatic coevolution between residues. From the model parameters, it is possible to extract an accurate prediction of the three-dimensional contact map of the target domain. More recently, the accuracy of these models has been also assessed in terms of their ability in predicting mutational effects and generating in silico functional sequences. Our adaptive implementation of Boltzmann machine learning, adabmDCA, can be generally applied to both protein and RNA families and accomplishes several learning set-ups, depending on the complexity of the input data and on the user requirements. The code is fully available at https://github.com/anna-pa-m/adabmDCA. As an example, we have performed the learning of three Boltzmann machines modeling the Kunitz and Beta-lactamase2 protein domains and TPP-riboswitch RNA domain. The models learned by adabmDCA are comparable to those obtained by state-of-the-art techniques for this task, in terms of the quality of the inferred contact map as well as of the synthetically generated sequences. In addition, the code implements both equilibrium and out-of-equilibrium learning, which allows for an accurate and lossless training when the equilibrium one is prohibitive in terms of computational time, and allows for pruning irrelevant parameters using an information-based criterion.

الأساليب الكمية الأنظمة المضطربة والشبكات العصبية الجزيئات الحيوية