idw – Informationsdienst Wissenschaft

Nachrichten, Termine, Experten

Grafik: idw-Logo
Science Video Project
idw-Abo

idw-News App:

AppStore

Google Play Store



Instanz:
Teilen: 
12.08.2024 09:55

Independent, complex thinking not (yet) possible after all: Study led by TU shows limitations of ChatGPT & co.

Silke Paradowski Science Communication Centre - Abteilung Kommunikation
Technische Universität Darmstadt

    Darmstadt, August 12, 2024. According to a new study led by TU Darmstadt, AI models such as ChatGPT are apparently less capable of learning independently than previously assumed. According to the study, there is no evidence that what are known as large language models (LLMs) are beginning to develop a general "intelligent" behaviour that would enable them to proceed in a planned or intuitive manner or to think in a complex way. The study will be presented in August at the annual conference of the renowned Association for Computational Linguistics (ACL) in Bangkok, the largest international conference on automatic language processing.

    The research focuses on unforeseen and sudden leaps in the performance of language models, which are referred to as "emergent abilities". After the models were introduced, scientists found that they became more powerful with increasing size and the growing amount of data with which they were trained (scaling). As the tools were scaled up, they were able to solve a larger number of language-based tasks – for example, recognising fake news or drawing logical conclusions. On the one hand, this raised hopes that further scaling would make the models even better. On the other hand, there was also concern that these abilities could become dangerous, as the LLMs could become independent and possibly escape human control. In response, AI laws were introduced worldwide, including in the European Union and the USA.
    However, the authors of the current study have now come to the conclusion that there is no evidence for the presumed development of differentiated thinking in the models. Instead, the LLMs acquired the superficial skill of following relatively simple instructions, as the researchers showed. The systems are still a long way from what humans are capable of. The study was led by TU computer science professor Iryna Gurevych and her colleague Dr Harish Tayyar Madabushi from the University of Bath in the UK.
    "However, our results do not mean that AI is not a threat at all," emphasised Gurevych. "Rather, we show that the purported emergence of complex thinking skills associated with specific threats is not supported by evidence and that we can control the learning process of LLMs very well after all. Future research should therefore focus on other risks posed by the models, such as their potential to be used to generate fake news."
    And what do the results mean for users of AI systems such as ChatGPT? "It is probably a mistake to rely on an AI model to interpret and execute complex tasks without help," explains Gurevych, who heads the Ubiquitous Knowledge Processing (UKP) Lab at the Computer Science Department of TU Darmstadt. "Instead, users should explicitly state what the systems should do and, if possible, give examples. The important thing is: The tendency of these models to produce plausible-sounding but false results – known as confabulation – is likely to persist, even if the quality of the models has improved dramatically in recent times."

    About TU Darmstadt
    TU Darmstadt is one of Germany’s leading technical universities and a synonym for excellent, relevant research. We are crucially shaping global transformations – from the energy transition via Industry 4.0 to artificial intelligence – with outstanding insights and forward-looking study opportunities. TU Darmstadt pools its cutting-edge research in three fields: Energy and Environment, Information and Intelligence, Matter and Materials. Our problem-based interdisciplinarity as well as our productive interaction with society, business and politics generate progress towards sustainable development worldwide. Since we were founded in 1877, we have been one of Germany’s most international universities; as a European technical university, we are developing a trans-European campus in the network, Unite! With our partners in the alliance of Rhine-Main universities – Goethe University Frankfurt and Johannes Gutenberg University Mainz – we further the development of the metropolitan region Frankfurt-Rhine-Main as a globally attractive science location.

    MI No. 36e/2024, mih


    Wissenschaftliche Ansprechpartner:

    Prof Dr Iryna Gurevych
    Hochschulstraße 10, S2|02 B110, 64289 Darmstadt
    Tel.: +49 (0) 6151 16 - 25290; e-mail: iryna.gurevych@tu-darmstadt.de


    Originalpublikation:

    Sheng Lu, Irina Bigoulaeva, Rachneet Sachdeva, Harish Tayyar Madabushi, Iryna Gurevych: Are Emergent Abilities in Large Language Models just In-Context Learning?
    https://arxiv.org/abs/2309.01809
    https://doi.org/10.48550/arXiv.2309.01809


    Bilder

    Merkmale dieser Pressemitteilung:
    Journalisten, Wissenschaftler
    Gesellschaft, Informationstechnik, Sprache / Literatur
    überregional
    Forschungsergebnisse, Wissenschaftliche Publikationen
    Englisch


     

    Hilfe

    Die Suche / Erweiterte Suche im idw-Archiv
    Verknüpfungen

    Sie können Suchbegriffe mit und, oder und / oder nicht verknüpfen, z. B. Philo nicht logie.

    Klammern

    Verknüpfungen können Sie mit Klammern voneinander trennen, z. B. (Philo nicht logie) oder (Psycho und logie).

    Wortgruppen

    Zusammenhängende Worte werden als Wortgruppe gesucht, wenn Sie sie in Anführungsstriche setzen, z. B. „Bundesrepublik Deutschland“.

    Auswahlkriterien

    Die Erweiterte Suche können Sie auch nutzen, ohne Suchbegriffe einzugeben. Sie orientiert sich dann an den Kriterien, die Sie ausgewählt haben (z. B. nach dem Land oder dem Sachgebiet).

    Haben Sie in einer Kategorie kein Kriterium ausgewählt, wird die gesamte Kategorie durchsucht (z.B. alle Sachgebiete oder alle Länder).