idw – Informationsdienst Wissenschaft

Nachrichten, Termine, Experten

Grafik: idw-Logo
Grafik: idw-Logo

idw - Informationsdienst
Wissenschaft

Science Video Project
idw-Abo

idw-News App:

AppStore

Google Play Store



Instance:
Share on: 
07/24/2023 11:14

AI predicts the work rate of enzymes

Dr.rer.nat. Arne Claussen Stabsstelle Presse und Kommunikation
Heinrich-Heine-Universität Düsseldorf

    Bioinformatics: Publication in Nature Communications

    Enzymes play a key role in cellular metabolic processes. To enable the quantitative assessment of these processes, researchers need to know the so-called “turnover number” (for short: kcat) of the enzymes. In the scientific journal Nature Communications, a team of bioinformaticians from Heinrich Heine University Düsseldorf (HHU) now describes a tool for predicting this parameter for various enzymes using AI methods.

    Enzymes are important biocatalysts in all living cells. They are normally large proteins, which bind smaller molecules – so-called substrates – and then convert them into other molecules, the “products”. Without enzymes, the reaction that converts the substrates into the products could not take place, or could only do so at a very low rate. Most organisms possess thousands of different enzymes. Enzymes have many applications in a wide range of biotechnological processes and in everyday life – from the proving of bread dough to detergents.

    The maximum speed at which a specific enzyme can convert its substrates into products is determined by the so-called turnover number kcat. It is an important parameter for quantitative research on enzyme activities and plays a key role in understanding cellular metabolism.

    However, it is time-consuming and expensive to determine kcat turnover numbers in experiments, which is why they are not known for the vast majority of reactions. The Computational Cell Biology research group at HHU headed by Professor Dr Martin Lercher has now developed a new tool called TurNuP to predict the kcat turnover numbers of enzymes using AI methods.

    To train a kcat prediction model, information about the enzymes and catalysed reactions was converted into numerical vectors using deep learning models. These numerical vectors served as the input for a machine learning model – a so-called gradient boosting model – which predicts the kcat turnover numbers.

    Lead author Alexander Kroll: “TurNuP outperforms previous models and can even be used successfully for enzymes that have only a low similarity to those in the training dataset.” Previous models have not been able to make any meaningful predictions unless at least 40% of the enzyme sequence is identical to at least one enzyme in the training set. By contrast, TurNuP can already make meaningful predictions for enzymes with a maximum sequence identity of 0 – 40%.

    Professor Lercher adds: “In our study, we show that the predictions made by TurNuP can be used to predict the concentrations of enzymes in living cells much more accurately than has been the case to date.”

    In order to make the prediction model easily accessible to as many users as possible, the HHU team has developed a user-friendly web server, which other researchers can use to predict the kcat turnover numbers of enzymes.

    Background: Machine learning and deep learning

    Deep learning models comprise multi-layered artificial neural networks which can recognise and process patterns in the input data. Using large training datasets is the optimum way to train a deep learning model to process numerical inputs.

    Gradient boosting models are a machine learning method, which produces large numbers of decision trees. The results of all decision trees for a specific input are used to make predictions. Similar to deep learning, training data are used to refine the model, i.e. to produce the decision trees.

    Detailed caption:
    Schematic diagram of the prediction process for the turnover numbers of enzymatic reactions: Enzymes are amino acid sequences; these sequences are converted into numerical vectors, depicted as grey squares, which are then transformed into a single vector by a deep learning model (top left). Information about the catalysed reactions is also converted into numerical vectors (top right). Experimentally determined turnover numbers (bottom left) are used to train a gradient boosting model in order to predict the kcat turnover number (bottom right). Gradient boosting models are an ensemble of multiple decision trees, depicted in different green tones. (Fig.: HHU/Alexander Kroll)


    Original publication:

    Kroll, A., Rousset, Y., Hu, XP., Liebrand, N. & Lercher, M.J.; Turnover number predictions for kinetically uncharacterized enzymes using machine and deep learning; Nature Communications 14, 4139 (2023).

    DOI: 10.1038/s41467-023-39840-4


    More information:

    https://turnup.cs.hhu.de/


    Images

    Schematic diagram of the prediction process for the turnover numbers of enzymatic reactions. Detailed caption at the end of text.
    Schematic diagram of the prediction process for the turnover numbers of enzymatic reactions. Detaile ...

    HHU/Alexander Kroll


    Criteria of this press release:
    Journalists, Scientists and scholars
    Biology, Information technology
    transregional, national
    Research results, Scientific Publications
    English


     

    Help

    Search / advanced search of the idw archives
    Combination of search terms

    You can combine search terms with and, or and/or not, e.g. Philo not logy.

    Brackets

    You can use brackets to separate combinations from each other, e.g. (Philo not logy) or (Psycho and logy).

    Phrases

    Coherent groups of words will be located as complete phrases if you put them into quotation marks, e.g. “Federal Republic of Germany”.

    Selection criteria

    You can also use the advanced search without entering search terms. It will then follow the criteria you have selected (e.g. country or subject area).

    If you have not selected any criteria in a given category, the entire category will be searched (e.g. all subject areas or all countries).