idw – Informationsdienst Wissenschaft

Nachrichten, Termine, Experten

Grafik: idw-Logo
Science Video Project
idw-Abo

idw-News App:

AppStore

Google Play Store



Instance:
Share on: 
06/06/2025 13:00

Making ChatGPT & Co usable for security domain applications

Michael Lindner Presse
Agentur für Innovation in der Cybersicherheit GmbH

    Making ChatGPT & Co usable for security domain applications

    HEGEMON tackles the evaluation and adaptation of generative foundation models for security-critical applications

    On 4 June 2025, the Agentur für Innovation in der Cybersicherheit GmbH (Cyberagentur) launched the call for HEGEMON - a new research programme aimed at the co-development of holistic benchmarks and AI model adaptations for safety-critical applications. In a competitive "everyone vs. everyone" setting, for the first time generative foundation models are to be adapted and implemented as prototypes for complex geoinformation tasks while being repeatedly evaluated with newly developed holistic measurements.

    A common dilemma: You want to generate an image or summarise a long text. Taking the quick route, you task a generative AI service based on a foundation model (often a large multimodal language model such as GPT-4o), which is supposed to generate the desired image or summary by means of text prompting. Unfortunately, the first results can be frequently unsatisfactory or even incorrect.

    The broad application possibilities of foundation models and associated process acceleration potentials are also of interest to German security authorities. However, if the example of text summarisation is transferred to the security context - e.g. a soldier using such a tool to summarise a long command - the potential source of error takes on a much broader meaning. To date, no comprehensive tests (so-called benchmarks) exist specifically for the German security context in order to evaluate the universal and application-specific impacts of pre-trained foundation models.

    Against this background, the Cyberagentur's HEGEMON research programme ("Holistic Evaluation of Generative Foundation Models in the Security Context") was launched on 4 June 2025. Universities, colleges, research institutions, companies and start-ups are invited to make an offer with their innovative ideas. The programme is designed as a multi-year research competition with several phases and high disruptive potential.

    "With HEGEMON, we are creating a unique experimentation environment to systematically enable the comprehensive evaluation of generative AI models in the security sector - beyond previous benchmark routines," says Dr Daniel Gille, project manager of the programme and Head of Artificial Intelligence at the Cyberagentur. "We invite all research-intensive players to participate in this challenge."

    The research programme addresses a key gap in current AI development: Foundation models such as GPT-4, Midjourney or Claude that are developed internationally - mainly by private companies, mostly in the USA and China - are often opaque in their training data and model architectures and are increasingly dominating security-critical areas. In the German and European context in particular, there is a lack of opportunities to evaluate these models comprehensively and comparatively - especially with regard to complex, multimodal tasks and applications with high demands on safety and facticity.

    Aim of HEGEMON is the development of domain-specific, holistic benchmark sets (consisting of tasks, metrics and test data sets) as well as customised AI models for defined use cases. This combination of benchmark and model development will be tested in a competitive scenario in which all participants evaluate each other's solutions - an "everyone versus everyone" structure with integrated red/blue teaming for robustness testing.

    "We are rethinking the evaluation of large models," continues project manager Dr Daniel Gille. "The desired benchmarks should not only measure what is technically possible, but also what is safe, explainable and relevant to the application."

    The focus is on three use cases from the geoinformation sector:
    • The generation of comprehensible text summaries on country-specific topics,
    • the conversion of remote sensing data into vector data,
    • and a map chatbot with intelligent text output (e.g. "Are there medical facilities on this map? Please share the coordinates if they exist.").

    Cyberagentur uses the tried-and-tested pre-commercial procurement procedure (PCP) for implementation. It allows for risky but fully financed research outside of regular procurement law. PCP is ideal for disruptive projects where no mature market products yet exist - i.e. exactly for the research subject of HEGEMON. Participants retain their usage and exploitation rights, which further strengthens their innovative capacity.

    The call for proposals is the first step in a multi-phase research process that extends over a total of three years. Following an evaluation phase, three interaction points will be defined at which benchmarks and models will be systematically compared, improved and re-evaluated.

    The tender was published in the Supplement to the Official Journal of the European Union with the contract notice number TED 358520-2025 (https://ted.europa.eu/de/notice/-/detail/358520-2025). The deadline for submitting the short concept is 31 July 2025, 10:00 am. Participation is possible both alone and in a consortium.

    Further information can be found at
    https://www.cyberagentur.de/en/programs/hegemon/

    Contact:

    Agentur für Innovation in der Cybersicherheit GmbH
    Große Steinstraße 19
    06108 Halle (Saale)

    Michael Lindner
    Press Officer

    Phone: +49 151 44150 645
    E-Mail: presse@cyberagentur.de

    Background: Cyberagentur

    The Agentur für Innovation in der Cybersicherheit GmbH (Cyberagentur) was founded by the German Federal Government in 2020 as a fully in-house company of the Federal Government under the joint leadership of the German Federal Ministry of Defence and the German Federal Ministry of the Interior and Community with the aim of adopting an application-strategy-related and cross-departmental view of internal and external security in the field of cybersecurity. Against this background, the work of the Cyberagentur is primarily aimed at the institutionalised implementation of highly innovative projects that are associated with a high risk with regard to the achievement of objectives, but at the same time can have a very high potential for disruption if successful.

    The Cyberagentur is part of the National Security Strategy of the Federal Republic of Germany.

    The Cyberagentur is headed by Prof Dr Christian Hummert as Scientific Director and Managing Director and Daniel Mayer as Commercial Director.


    Contact for scientific information:

    Dr Daniel Gille, project manager of the programme and Head of Artificial Intelligence at the Cyberagentur


    Original publication:

    https://www.cyberagentur.de/en/press/hegemon-erforscht-bewertung-und-anpassung-g...


    More information:

    https://www.cyberagentur.de/en/programs/hegemon/


    Images

    Launch of the call for proposals for the HEGEMON research programme to evaluate generative AI models in a security context.
    Launch of the call for proposals for the HEGEMON research programme to evaluate generative AI models ...
    Nancy Glor
    Cyberagentur


    Criteria of this press release:
    Business and commerce, Journalists, Scientists and scholars, Students
    Economics / business administration, Geosciences, Information technology, Mathematics, Physics / astronomy
    transregional, national
    Contests / awards, Research projects
    English


     

    Launch of the call for proposals for the HEGEMON research programme to evaluate generative AI models in a security context.


    For download

    x

    Help

    Search / advanced search of the idw archives
    Combination of search terms

    You can combine search terms with and, or and/or not, e.g. Philo not logy.

    Brackets

    You can use brackets to separate combinations from each other, e.g. (Philo not logy) or (Psycho and logy).

    Phrases

    Coherent groups of words will be located as complete phrases if you put them into quotation marks, e.g. “Federal Republic of Germany”.

    Selection criteria

    You can also use the advanced search without entering search terms. It will then follow the criteria you have selected (e.g. country or subject area).

    If you have not selected any criteria in a given category, the entire category will be searched (e.g. all subject areas or all countries).