idw – Informationsdienst Wissenschaft

Nachrichten, Termine, Experten

Grafik: idw-Logo
Science Video Project
idw-Abo

idw-News App:

AppStore

Google Play Store



Instance:
Share on: 
08/12/2024 09:55

Independent, complex thinking not (yet) possible after all: Study led by TU shows limitations of ChatGPT & co.

Silke Paradowski Science Communication Centre - Abteilung Kommunikation
Technische Universität Darmstadt

    Darmstadt, August 12, 2024. According to a new study led by TU Darmstadt, AI models such as ChatGPT are apparently less capable of learning independently than previously assumed. According to the study, there is no evidence that what are known as large language models (LLMs) are beginning to develop a general "intelligent" behaviour that would enable them to proceed in a planned or intuitive manner or to think in a complex way. The study will be presented in August at the annual conference of the renowned Association for Computational Linguistics (ACL) in Bangkok, the largest international conference on automatic language processing.

    The research focuses on unforeseen and sudden leaps in the performance of language models, which are referred to as "emergent abilities". After the models were introduced, scientists found that they became more powerful with increasing size and the growing amount of data with which they were trained (scaling). As the tools were scaled up, they were able to solve a larger number of language-based tasks – for example, recognising fake news or drawing logical conclusions. On the one hand, this raised hopes that further scaling would make the models even better. On the other hand, there was also concern that these abilities could become dangerous, as the LLMs could become independent and possibly escape human control. In response, AI laws were introduced worldwide, including in the European Union and the USA.
    However, the authors of the current study have now come to the conclusion that there is no evidence for the presumed development of differentiated thinking in the models. Instead, the LLMs acquired the superficial skill of following relatively simple instructions, as the researchers showed. The systems are still a long way from what humans are capable of. The study was led by TU computer science professor Iryna Gurevych and her colleague Dr Harish Tayyar Madabushi from the University of Bath in the UK.
    "However, our results do not mean that AI is not a threat at all," emphasised Gurevych. "Rather, we show that the purported emergence of complex thinking skills associated with specific threats is not supported by evidence and that we can control the learning process of LLMs very well after all. Future research should therefore focus on other risks posed by the models, such as their potential to be used to generate fake news."
    And what do the results mean for users of AI systems such as ChatGPT? "It is probably a mistake to rely on an AI model to interpret and execute complex tasks without help," explains Gurevych, who heads the Ubiquitous Knowledge Processing (UKP) Lab at the Computer Science Department of TU Darmstadt. "Instead, users should explicitly state what the systems should do and, if possible, give examples. The important thing is: The tendency of these models to produce plausible-sounding but false results – known as confabulation – is likely to persist, even if the quality of the models has improved dramatically in recent times."

    About TU Darmstadt
    TU Darmstadt is one of Germany’s leading technical universities and a synonym for excellent, relevant research. We are crucially shaping global transformations – from the energy transition via Industry 4.0 to artificial intelligence – with outstanding insights and forward-looking study opportunities. TU Darmstadt pools its cutting-edge research in three fields: Energy and Environment, Information and Intelligence, Matter and Materials. Our problem-based interdisciplinarity as well as our productive interaction with society, business and politics generate progress towards sustainable development worldwide. Since we were founded in 1877, we have been one of Germany’s most international universities; as a European technical university, we are developing a trans-European campus in the network, Unite! With our partners in the alliance of Rhine-Main universities – Goethe University Frankfurt and Johannes Gutenberg University Mainz – we further the development of the metropolitan region Frankfurt-Rhine-Main as a globally attractive science location.

    MI No. 36e/2024, mih


    Contact for scientific information:

    Prof Dr Iryna Gurevych
    Hochschulstraße 10, S2|02 B110, 64289 Darmstadt
    Tel.: +49 (0) 6151 16 - 25290; e-mail: iryna.gurevych@tu-darmstadt.de


    Original publication:

    Sheng Lu, Irina Bigoulaeva, Rachneet Sachdeva, Harish Tayyar Madabushi, Iryna Gurevych: Are Emergent Abilities in Large Language Models just In-Context Learning?
    https://arxiv.org/abs/2309.01809
    https://doi.org/10.48550/arXiv.2309.01809


    Images

    Criteria of this press release:
    Journalists, Scientists and scholars
    Information technology, Language / literature, Social studies
    transregional, national
    Research results, Scientific Publications
    English


     

    Help

    Search / advanced search of the idw archives
    Combination of search terms

    You can combine search terms with and, or and/or not, e.g. Philo not logy.

    Brackets

    You can use brackets to separate combinations from each other, e.g. (Philo not logy) or (Psycho and logy).

    Phrases

    Coherent groups of words will be located as complete phrases if you put them into quotation marks, e.g. “Federal Republic of Germany”.

    Selection criteria

    You can also use the advanced search without entering search terms. It will then follow the criteria you have selected (e.g. country or subject area).

    If you have not selected any criteria in a given category, the entire category will be searched (e.g. all subject areas or all countries).