Műegyetemi Digitális Archívum

Investigating the natural product subspace within the Transformer-VAE foundation model’s drug-like molecule space

Date

Type

könyvfejezet

Language

en

Reading access rights:

Open Access

Rights Holder

Budapest University of Technology and Economics, Department of Measurement and Information Systems

Conference Date

2024.02.05-2024.02.06.

Conference Place

Budapest, Hungary

Conference Title

31th Minisymposium of the Department of Measurement and Information Systems

ISBN, e-ISBN

978-963-421-951-4

Container Title

Proceedings of the 31th Minisymposium

Department

Department of Measurement and Information Systems

Version

Kiadói változat

Faculty

Faculty of Electrical Engineering and Informatics

First Page

95

Subject (OSZKAR)

Drug discovery
Machine Learning
Transformer-VAE
Natural Products

Gender

Konferenciacikk

University

Budapest University of Technology and Economics

OOC works

Abstract

We explore the intricate world of natural product chemistry through the lens of computational modelling. Utilising a Transformer-VAE foundation model originally pre-trained on the GuacaMol dataset, known for its comprehensive collection of small drug-like molecules. We explore the COCONUT dataset’s natural products within this model’s latent space. This approach allows us to investigate these complex natural compounds’ structural organisation and relationships in a latent space tailored to smaller, drug-like molecules. Our findings provide insightful revelations about the similarities and divergences between these two distinct molecular realms. We uncover new perspectives on molecular similarity and potential bioactivity by examining how natural products, with their diverse and often complex structures, are represented and structured in a latent space initially trained on more simplistic molecules. This research sheds light on the capabilities and adaptability of pre-trained models in chemical informatics. It could help pave the way for innovative approaches in discovering and analysing natural products for pharmaceutical applications.

Description

Keywords