Ajou University repository

Computational prediction of protein folding rate using structural parameters and network centrality measures
  • SARASWATHY NITHIYANANDAM
Citations

SCOPUS

0

Citation Export

DC Field Value Language
dc.contributor.advisorGwang Lee-
dc.contributor.authorSARASWATHY NITHIYANANDAM-
dc.date.issued2024-02-
dc.identifier.other33532-
dc.identifier.urihttps://aurora.ajou.ac.kr/handle/2018.oak/38858-
dc.description학위논문(박사)--분자과학기술학과,2024. 2-
dc.description.abstractA polymer of amino acids undergoes a complex physicochemical process called protein folding in which it tries out multiple conformations in its unfolded state before deciding on a fundamentally distinct native three-dimensional (3D) structure. Several theoretical studies have used a collection of 3D structures, determined various structural characteristics, and examined their correlations with the natural logarithmic protein folding rate (ln(kf)) in order to explain this process. Unfortunately, these structural features are exclusive to a limited group of proteins and do not have the ability to reliably predict ln(kf) for both two-state (TS) and non- two-state (NTS) proteins. A few machine learning (ML)-based models have been presented using smaller training datasets in an attempt to overcome the shortcomings of the statistical methods. Although all of these techniques are promising, none of them provides an effective folding mechanism. Based on newly created datasets, we assessed the predictive power of 10 distinct ML algorithms in this study by utilizing five distinct network centrality measures and eight different structure characteristics. Support vector machine was determined to be the most suitable regressor for predicting ln(kf) in comparison to the other nine regressors, for three different datasets respectively. In addition, combining structural characteristics and network centrality measures enhances prediction performance, suggesting that more than one factor contributes to folding. This thesis aims to advance our understanding of the relationship between protein structure and folding rates, providing valuable insights for both computational biology and experimental studies. The integration of ML techniques with structural and network parameters offers a promising avenue for predicting protein folding rates and contributes to the broader field of bioinformatics.-
dc.description.tableofcontents1. Introduction. 11_x000D_ <br>2. Overview of protein folding and kinetics. 14_x000D_ <br> 2.1. Protein folding. 14_x000D_ <br> 2.2. Structural class of a protein. 16_x000D_ <br> 2.3. Protein folding kinetics. 17_x000D_ <br> 2.4. Protein misfolding and aggregation. 18_x000D_ <br>3. Overview of machine learning. 20_x000D_ <br> 3.1. Steps involved in machine learning. 20_x000D_ <br> 3.2. Machine learning algorithms. 22_x000D_ <br>4. Material and methods. 23_x000D_ <br> 4.1. Dataset description and acquisition. 23_x000D_ <br> 4.2. Structural parameter selection. 23_x000D_ <br> 4.2.1. Relative contact order. 23_x000D_ <br> 4.2.2. Absolute contact order. 24_x000D_ <br> 4.2.3. Total contact distance. 24_x000D_ <br> 4.2.4. Chain topology parameter. 24_x000D_ <br> 4.2.5. Fraction of local contact. 24_x000D_ <br> 4.2.6. Long-range order. 25_x000D_ <br> 4.2.7. Long-range contact order. 25_x000D_ <br> 4.3. Network centrality measures. 25_x000D_ <br> 4.4. Evaluation metrics. 27_x000D_ <br>5. Results and Discussion. 28_x000D_ <br> 5.1. Structural parameters and their relationship with ln(kf). 28_x000D_ <br> 5.2. Network centrality measures and their relationship with ln(kf) of TS and NTS. 30_x000D_ <br> 5.3. Large scale machine learning regression models. 30_x000D_ <br> 5.4. Comparison of SVM-based single model with the ensemble models. 34_x000D_ <br> 5.5. Model interpretation. 35_x000D_ <br> 5.6. Comparison of SVM-based models with the statistical parameters. 36_x000D_ <br> 5.7. Supplementary information. 37_x000D_ <br>6. CONCLUSION AND FUTURE WORK. 40_x000D_ <br>7. BIBLIOGRAPHY. 41_x000D_-
dc.language.isoeng-
dc.publisherThe Graduate School, Ajou University-
dc.rights아주대학교 논문은 저작권에 의해 보호받습니다.-
dc.titleComputational prediction of protein folding rate using structural parameters and network centrality measures-
dc.typeThesis-
dc.contributor.affiliation아주대학교 대학원-
dc.contributor.department일반대학원 분자과학기술학과-
dc.date.awarded2024-02-
dc.description.degreeDoctor-
dc.identifier.urlhttps://dcoll.ajou.ac.kr/dcollection/common/orgView/000000033532-
dc.subject.keywordmachine learning-
dc.subject.keywordnon-two-state protein-
dc.subject.keywordprotein folding rate-
dc.subject.keywordsupport vector machine-
dc.subject.keywordtwo-state protein-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Total Views & Downloads

File Download

  • There are no files associated with this item.