idw – Informationsdienst Wissenschaft

Nachrichten, Termine, Experten

Grafik: idw-Logo
Grafik: idw-Logo

idw - Informationsdienst
Wissenschaft

idw-Abo

idw-News App:

AppStore

Google Play Store



Instanz:
Teilen: 
30.06.2026 11:37

New DFG project researches moral hallucinations in artificial intelligence

Kathrin Haimerl Abteilung Kommunikation
Universität Passau

    An interdisciplinary team led by ethicist Professor Karoline Reinhardt and computational linguist Professor Annette Hautli-Janisz from the University of Passau is investigating a phenomenon in AI hallucinations that has received little attention to date: when large language models produce moral fallacies that deviate significantly from human values.

    More and more people are using large language models (LLMs) such as GPT, PaLM or Llama – not just for simple information queries, but also for advice on practical matters. It is not uncommon for so-called hallucinations to occur when using such systems. This means that the LLMs provide answers that, whilst appearing realistic, are factually incorrect or misleading.

    An interdisciplinary team led by ethicist Professor Karoline Reinhardt and computational linguist Professor Annette Hautli-Janisz is conducting research into a previously little-noticed type of such fabricated AI responses as part of the DFG project ‘Moral Hallucinations in Large Language Models’. The project focuses on moral hallucinations that can arise when people use LLMs for ethically relevant questions. For example, when they ask the machine whether it is always a moral duty to keep a promise.

    “Moral hallucinations thus go beyond conventional AI hallucinations. They involve not only erroneous or unreliable information, but also distortions in moral argumentation structures,” explains Professor Reinhardt, holder of the Chair of Applied Ethics at the University of Passau. “And this often occurs in situations where people are vulnerable, seeking advice, or at a loss.”

    The structure and consequences of moral AI hallucinations

    This interdisciplinary research project at the University of Passau combines conceptual and empirical methods from applied AI ethics and computational linguistics with the aim of theoretically analysing, ethically evaluating and automatically identifying moral hallucinations generated by large language models (LLMs). To this end, the research team is investigating questions concerning both the structure of moral hallucinations and their ethical consequences. For example: What are the key characteristics of moral hallucinations? What ethical consequences arise if moral claims made by LLMs are not merely biased but are, in fact, hallucinations? What are the argumentative structures of moral hallucinations, and how do they relate to moral judgement in philosophical theories? What are the ethical consequences of using LLMs as moral advisors? How could LLMs be designed to flag moral hallucinations?

    “There is an urgent need to expand existing benchmarks and evaluate models in greater detail, particularly in the field of moral reasoning,” says Professor Hautli-Janisz, holder of the Junior Professorship in Computational Rhetoric and Natural Language Processing. She adds that this is also an area that has received little attention in computational linguistics to date.

    The project is funded by the German Research Foundation (DFG) for a period of three years as part of the DFG Priority Programme “Robust Assessment & Safe Applicability of Language Modelling: Foundations for a New Field of Language Science & Technology (LaSTing)” (SPP 2556). This is a funding programme launched in 2026 that brings together interdisciplinary research at the interface of linguistics, computer science, AI and philosophy, amongst other fields.

    This text was machine-translated from German.


    Wissenschaftliche Ansprechpartner:

    Professor Karoline Reinhardt
    Professorship of Applied Ethics, University of Passau
    E-Mail: Karoline.Reinhardt@uni-passau.de

    Professor Annette Hautli-Janisz
    Junior Professorship of Computational Rhetoric and Natural Language Processing, University of Passau
    E-Mail: Annette.Hautli-Janisz@uni-passau.de


    Bilder

    Symbolic image: A seamless transition between a human hand and a machine.
    Symbolic image: A seamless transition between a human hand and a machine.
    Quelle: Adobe Stock 1877298676
    Copyright: Adobe Stock


    Merkmale dieser Pressemitteilung:
    Journalisten, Lehrer/Schüler, Studierende, Wirtschaftsvertreter, Wissenschaftler, jedermann
    Informationstechnik, Philosophie / Ethik, Sprache / Literatur
    überregional
    Forschungsprojekte
    Englisch


     

    Hilfe

    Die Suche / Erweiterte Suche im idw-Archiv
    Verknüpfungen

    Sie können Suchbegriffe mit und, oder und / oder nicht verknüpfen, z. B. Philo nicht logie.

    Klammern

    Verknüpfungen können Sie mit Klammern voneinander trennen, z. B. (Philo nicht logie) oder (Psycho und logie).

    Wortgruppen

    Zusammenhängende Worte werden als Wortgruppe gesucht, wenn Sie sie in Anführungsstriche setzen, z. B. „Bundesrepublik Deutschland“.

    Auswahlkriterien

    Die Erweiterte Suche können Sie auch nutzen, ohne Suchbegriffe einzugeben. Sie orientiert sich dann an den Kriterien, die Sie ausgewählt haben (z. B. nach dem Land oder dem Sachgebiet).

    Haben Sie in einer Kategorie kein Kriterium ausgewählt, wird die gesamte Kategorie durchsucht (z.B. alle Sachgebiete oder alle Länder).