The Simons Center for Computational Physical Chemistry at NYU regularly hosts visiting scholars to discuss their work. Join us on November 25th for a presentation by Joshua Schrier of Fordham University:
Large Language models for Solid-State Synthesis Predictions and Explanations (and in the classroom)
Abstract:
In this talk, I will describe recent progress on the use of large language models (LLMs) to predict and explain the synthesizability (can it be made?) and selecting precursors (how to make it?) for solid-state inorganic compounds. In our initial work, we examined the ability to make predictions given only the chemical formula of the target compound, and benchmarked pre-trained and fine-tuned LLMs against recent (traditional) machine-learning approaches based on convolutional graph neural networks. Surprisingly, fine-tuned LLMs can solve these problems at levels that are comparable to the best traditional machine learning approaches. The relative ease, speed, and quality of this LLM-based approach suggests both its broader adoption in chemical discovery and use of methods like these as a general baseline for when reporting the performance of more traditional chemical space prediction methods. More recently, we have extended this approach to the prediction of specific polymorphs, in which the structure is represented by a plain text description.. While fine-tuned LLM are competitive with bespoke machine-learning methods, we find that better results can be achieved by training a model on the embedding vector of the text description. We also demonstrate how to use an LLM-based workflow to generate human-readable explanations for the types of factors governing synthesizability, extract the underlying physical rules, and assess the veracity of those rules. These text-based models can be adapted to specialized cases where less data exists by transfer learning, which we demonstrate for the case of oxide perovskites. Finally, I will close with a preliminary report on ways of incorporating this type of methodology into the undergraduate curriculum.
All Simons Center seminars are held in Waverly 540. Refreshments will be served at 10:45, and the seminar begins promptly at 11:00 AM ET.
Or join via Zoom: https://nyu.zoom.us/j/99318701420?pwd=eGVvSzlKWFRlV0ZldnJJbjhYVUtEQT09