TY - UNPB
T1 - Fine-tuning GPT-3 for machine learning electronic and functional properties of organic molecules
AU - Xie, Zikai
AU - Evangelopoulos, Xenophon
AU - Omar, Ömer
AU - Troisi, Alessandro
AU - Cooper, Andrew I.
AU - Chen, Linjiang
PY - 2023/8/23
Y1 - 2023/8/23
N2 - We evaluate the effectiveness of fine-tuning GPT-3 for the prediction of electronic and functional properties of organic molecules. Our findings show that fine-tuned GPT-3 can successfully identify and distinguish between chemically meaningful patterns, and discern subtle differences among them, exhibiting robust predictive performance for the prediction of molecular properties. We focus on assessing the fine-tuned models' resilience to information loss, resulting from the absence of atoms or chemical groups, and to noise that we introduce via random alterations in atomic identities. We discuss the challenges and limitations inherent to the use of GPT-3 in molecular machine-learning tasks and suggest potential directions for future research and improvements to address these issues.
AB - We evaluate the effectiveness of fine-tuning GPT-3 for the prediction of electronic and functional properties of organic molecules. Our findings show that fine-tuned GPT-3 can successfully identify and distinguish between chemically meaningful patterns, and discern subtle differences among them, exhibiting robust predictive performance for the prediction of molecular properties. We focus on assessing the fine-tuned models' resilience to information loss, resulting from the absence of atoms or chemical groups, and to noise that we introduce via random alterations in atomic identities. We discuss the challenges and limitations inherent to the use of GPT-3 in molecular machine-learning tasks and suggest potential directions for future research and improvements to address these issues.
U2 - 10.26434/chemrxiv-2023-h02j4
DO - 10.26434/chemrxiv-2023-h02j4
M3 - Preprint
BT - Fine-tuning GPT-3 for machine learning electronic and functional properties of organic molecules
PB - ChemRxiv
ER -