Unsupervised Learning for Molecular Structure Discoveries

dc.contributor.advisorShehu, Amarda
dc.creatorKabir, Kazi Lutful
dc.date.accessioned2023-03-17T19:05:40Z
dc.date.available2023-03-17T19:05:40Z
dc.date.issued2022
dc.description.abstractWe have long known that form determines function. This is particularly true of biological molecules, which utilize their three-dimensional structures to interface with one another and propagate chemical reactions in the living cell. We also now better understand how vast and rich the structure space available to a molecule is and how little we know about what information to extract from this space to better characterize the structure(s)-function(s) relationship in biological molecules. This dissertation puts forth computational concepts and techniques to support this goal. Particularly, we develop algorithms to organize the structure space of a molecule and reveal one or more important structural states of small molecules, macromolecules, and complexated molecules. The algorithms proposed here fall under the umbrella of unsupervised learning but leverage explicit or implicit embeddings of molecular structures in discrete data-structures, such as graphs, to better utilize proximity in structure space for capturing structural states. The proposed algorithms employ diverse formalizations and show the power of those formalizations in addressing increasingly complex problems and application settings. Rigorous evaluation on hallmark problems in computational structural biology suggests that the leveraged formalizations and proposed algorithms advance research on unsupervised learning of the organization of molecular structure spaces.
dc.format.extent143 pages
dc.format.mediumdoctoral dissertations
dc.identifier.urihttps://hdl.handle.net/1920/13170
dc.language.isoen
dc.rightsCopyright 2022 Kazi Lutful Kabir
dc.rights.urihttps://rightsstatements.org/vocab/InC/1.0
dc.subjectClustering Algorithms
dc.subjectMatrix Factorization
dc.subjectMolecular Dynamics
dc.subjectProtein Structure
dc.subjectTensor Factorization
dc.subjectUnsupervised Learning
dc.subject.keywordsBioinformatics
dc.titleUnsupervised Learning for Molecular Structure Discoveries
dc.typeText
thesis.degree.disciplineComputer Science
thesis.degree.grantorGeorge Mason University
thesis.degree.levelDoctoral
thesis.degree.namePh.D. in Computer Science

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kabir_gmu_0883E_12878.pdf
Size:
21.36 MB
Format:
Adobe Portable Document Format