idw – Informationsdienst Wissenschaft

Nachrichten, Termine, Experten

Grafik: idw-Logo
Grafik: idw-Logo

idw - Informationsdienst
Wissenschaft

Science Video Project
idw-Abo

idw-News App:

AppStore

Google Play Store



Instanz:
Teilen: 
23.10.2024 17:00

A “chemical ChatGPT” for new medications

Katrin Piecha Dezernat 8 - Hochschulkommunikation
Rheinische Friedrich-Wilhelms-Universität Bonn

    Researchers from the University of Bonn have trained an AI process to predict potential active ingredients with special properties. Therefore, they derived a chemical language model – a kind of ChatGPT for molecules. Following a training phase, the AI was able to exactly reproduce the chemical structures of compounds with known dual-target activity that may be particularly effective medications. The study has now been published in Cell Reports Physical Science. Emabrgo: Do not publish before Wednesday, October 23rd, 5:00 pm CEST!

    Anyone who wants to delight their granny with a poem on her 90th birthday doesn’t need to be a poet nowadays: A short prompt in ChatGPT is all it takes, and within a few seconds the AI spits out a long list of words that rhyme with the birthday girl’s name. It can even produce a sonnet to go with it if you like.

    Researchers at the University of Bonn have implemented a similar model in their study – known as a chemical language model. This does not, however, produce rhymes. Instead, the AI displays the structural formulas of chemical compounds that may have a particularly desirable property: They are able to bind to two different target proteins. In the organism, this means, for example, they can inhibit two enzymes at once.

    Wanted: Active ingredients with a double effect

    “In pharmaceutical research, these types of active compounds are highly desirable due to their polypharmacology,” explains Prof. Dr. Jürgen Bajorath. The computational chemistry expert heads the AI in Life Sciences area at the Lamarr Institute for Machine Learning and Artificial Intelligence and the Life Science Informatics program at b-it (Bonn-Aachen International Center for Information Technology) at Uni Bonn. “Because compounds with desirable multi-target activity influence several intracellular processes and signaling pathways at the same time, they are often particularly effective – such as in the fight against cancer.” In principle, this effect can also be achieved by co-administration of different drugs. However, there is a risk of unwanted drug-drug interactions and different compounds are also often broken down at different rates in the body, making it difficult to administer them together.

    Finding a molecule that specifically influences the effect of a single target protein is no easy task. Designing compounds that have a predefined double effect is even more complicated. Chemical language models may help here in the future. ChatGPT is trained with billions of pages of written text and learns to formulate sentences itself. Chemical language models work in a similar way, but only have comparably very small amounts of data available for learning. However, in principle, they are also fed with texts, such as what are known as SMILES strings, which show organic molecules and their structure as a sequence of letters and symbols. “We have now trained our chemical language model with pairs of strings,” says Sanjana Srinivasan from Bajorath’s research group. “One of the strings described a molecule that we know only acts against one target protein. The other represented a compound that, in addition to this protein, also influences a second target protein.”

    AI learns chemical connections

    The model was fed with more than 70,000 of these pairs. This allowed it to acquire an implicit knowledge of how the normal active compounds differed from those with the double effect. “When we then fed it with a compound against a target protein, it suggested molecules on this basis that would act not only against this protein but also against another,” explains Bajorath.

    The training compounds with the double effect often target proteins that are similar and thus perform a similar function in the body. In pharmaceutical research, however, people are also looking for active ingredients that influence completely different classes of enzymes or receptors. To prepare the AI for this task, fine-tuning took place after the general learning phase. The researchers used several dozen special training pairs to teach the algorithm which different classes of proteins the suggested compounds should target. This is a bit like instructing ChatGPT not to create a sonnet this time, but instead a limerick.

    After the fine-tuning, the model actually spat out molecules that have already been shown to act against the desired combinations of target proteins. “This shows that the process works,” says Bajorath. In his opinion, however, the strength of the approach is not that new compounds exceeding the effect of available pharmaceuticals can immediately be found. “It is more interesting, from my point of view, that the AI often suggests chemical structures that most chemists would not even think of right away,” he explains. “To a certain extent, it triggers ‘out of the box’ ideas and comes up with original solutions that can lead to new design hypotheses and approaches.”

    Participating institutions and funding:
    The study was conducted at the University of Bonn at the Lamarr Institute and b-it.


    Wissenschaftliche Ansprechpartner:

    Prof. Dr. Jürgen Bajorath
    Life Science Informatics
    Tel. +49 (0)228/73-69100
    E-mail: bajorath@bit.uni-bonn.de


    Originalpublikation:

    Sanjana Srinivasan and Jürgen Bajorath: Generation of Dual-Target Compounds Using a Transformer Chemical Language Model; Cell Reports Physical Science; DOI: 10.1016/j.xcrp.2024.102255; https://doi.org/10.1016/j.xcrp.2024.102255


    Bilder

    Three-dimensional structures of two target proteins, histone deacetylase 6 (blue) and tyrosine-protein kinase JAK2 (red), together with a selective inhibitor of each enzyme. The dual inhibitor in the center is active against both targets.
    Three-dimensional structures of two target proteins, histone deacetylase 6 (blue) and tyrosine-prote ...

    Figure: Sanjana Srinivasan & Jürgen Bajorath


    Merkmale dieser Pressemitteilung:
    Journalisten
    Chemie, Informationstechnik
    überregional
    Forschungsergebnisse
    Englisch


     

    Hilfe

    Die Suche / Erweiterte Suche im idw-Archiv
    Verknüpfungen

    Sie können Suchbegriffe mit und, oder und / oder nicht verknüpfen, z. B. Philo nicht logie.

    Klammern

    Verknüpfungen können Sie mit Klammern voneinander trennen, z. B. (Philo nicht logie) oder (Psycho und logie).

    Wortgruppen

    Zusammenhängende Worte werden als Wortgruppe gesucht, wenn Sie sie in Anführungsstriche setzen, z. B. „Bundesrepublik Deutschland“.

    Auswahlkriterien

    Die Erweiterte Suche können Sie auch nutzen, ohne Suchbegriffe einzugeben. Sie orientiert sich dann an den Kriterien, die Sie ausgewählt haben (z. B. nach dem Land oder dem Sachgebiet).

    Haben Sie in einer Kategorie kein Kriterium ausgewählt, wird die gesamte Kategorie durchsucht (z.B. alle Sachgebiete oder alle Länder).