Learning the language of molecules to predict their properties

 


Discovering new materials and drugs typically involves a manual, trial-and-error process that can take decades and cost millions of dollars. To streamline this process, scientists often use machine learning to predict molecular properties and narrow down the molecules they need to synthesize and test in the lab.

Researchers from MIT and the MIT-IBM Watson AI Lab have developed a new, unified framework that can simultaneously predict molecular properties and generate new molecules much more efficiently than these popular deep-learning approaches.

To teach a machine-learning model to predict a molecule’s biological or mechanical properties, researchers must show it millions of labeled molecular structures — a process known as training. Due to the expense of discovering molecules and the challenges of hand-labeling millions of structures, large training datasets are often hard to come by, which limits the effectiveness of machine-learning approaches.

By contrast, the system created by the MIT researchers can effectively predict molecular properties using only a small amount of data. Their system has an underlying understanding of the rules that dictate how building blocks combine to produce valid molecules. These rules capture the similarities between molecular structures, which helps the system generate new molecules and predict their properties in a data-efficient manner.

This method outperformed other machine-learning approaches on both small and large datasets, and was able to accurately predict molecular properties and generate viable molecules when given a dataset with fewer than 100 samples.

“Our goal with this project is to use some data-driven methods to speed up the discovery of new molecules, so you can train a model to do the prediction without all of these cost-heavy experiments,” says lead author Minghao Guo, a computer science and electrical engineering (EECS) graduate student.

Guo’s co-authors include MIT-IBM Watson AI Lab research staff members Veronika Thost, Payel Das, and Jie Chen; recent MIT graduates Samuel Song ’23 and Adithya Balachandran ’23; and senior author Wojciech Matusik, a professor of electrical engineering and computer science and a member of the MIT-IBM Watson AI Lab, who leads the Computational Design and Fabrication Group within the MIT Computer Science and Artificial Intelligence Laboratory (CSAIL). The research will be presented at the International Conference for Machine Learning.

Event Name : International Molecular Biologist Awards

Website Link: molecularbiologist.org

Contact Mail ID : support@molecularbiologist.org

Nomination Link  : https://molecularbiologist.org/award-nomination/?ecategory=Awards&rcategory=Awardee


Follow On:

Twitterhttps://x.com/Camilla532645                                                                   

Blogger https://molecularconference.blogspot.com/ 

Youtube https://www.youtube.com/channel/UCehrwFGWKbQa0mKDDNJCwvA                  

Pinterest https://in.pinterest.com/molecularbiologistawards/                   

Linkedin https://www.linkedin.com/feed/?trk=onboarding-landing               

Instagram https://www.instagram.com/molecularawards

facebook https://www.facebook.com/share/v/1EPhJhbUDg/

#MolecularLanguage #Chemoinformatics #MoleculePrediction #ChemicalProperties #MolecularModeling #MolecularDesign #MolecularScience #MolecularEngineering #MolecularInsights #PredictiveChemistry #MolecularAnalysis #ComputationalChemistry #MolecularInnovation #MolecularResearch #MolecularData

Comments

Popular posts from this blog

Pausing” Cell Death Could Be the Key to Longevity

Record-Shattering Molecule Stores Data at “Dark Side of the Moon” Temperatures

Does cellular senescence hold secrets for healthier aging