idw – Informationsdienst Wissenschaft

Nachrichten, Termine, Experten

Grafik: idw-Logo
Grafik: idw-Logo

idw - Informationsdienst
Wissenschaft

Science Video Project
idw-Abo

idw-News App:

AppStore

Google Play Store



Instanz:
Teilen: 
13.06.2023 17:47

Punishments and rewards teach AI agents to make the right decisions

Press contact at Linnéuniversitetet: Ulrika Bergström Phone: 0480-49 70 55 Mobile phone: 070-259 36 29 Kommunikationsavdelningen / Communications Department
Schwedischer Forschungsrat - The Swedish Research Council

    In a new dissertation in mathematics, Björn Lindenberg shows how reinforcement learning in AI can be used to create effective strategies for autonomous decision-making in various environments. Reward systems can be developed to reinforce correct behaviour, such as finding optimal pricing strategies for financial instruments or controlling robots and network traffic.

    Reinforcement learning is a part of AI where a digital decision-maker, known as an agent, learns to make decisions by interacting with its environment and receiving rewards or punishments depending on how well it performs its actions.

    The agent receives rewards and punishments in the learning process by acting in an environment and receiving feedback based on its actions. By maximising rewards and minimising punishments, AI gradually learns to perform desirable actions and improve its performance in the given task.

    “My research focuses on reinforcement learning where an agent is placed in an environment. The agent observes the state of the environment at each step, similar to how we humans perceive our surroundings. This could, for example, be the chessboard position, incoming video footage, industrial data, or sensor data from a robot”, says Björn Lindenberg, PhD in mathematics at the Department of Mathematics at Linnaeus University.

    Reinforcement learning trains AI in autonomous decision making. The goal is to develop algorithms and models that help the agent make the best decisions. This is achieved through learning algorithms that take into account the agent’s previous experiences and improve its performance over time.

    There are many applications for reinforcement learning, such as game theory, robotics, financial analysis, and control of industrial processes.

    “The agent makes decisions by choosing an action from a list of options, such as moving a chess piece or controlling a robot movement. These choices can then affect the environment and create a new game situation in chess or provide new sensor values for a robot”, says Björn Lindenberg.

    New Mathematical Model Enhances Reliability in the Learning Process

    In his dissertation, Lindenberg has developed a model for deep reinforcement learning with multiple concurrent agents, which can enhance the learning process and make it more robust and effective. He has also investigated the number of iterations, i.e., repeated attempts, required for a system to become stable and perform well.

    “Deep reinforcement learning is advancing at the same pace as other AI technologies, that is, very rapidly. This is largely due to exponentially increasing hardware capacity, meaning that computers are becoming more and more powerful, along with new insights into network architectures”, Lindenberg continues.

    The more complex the applications become, the more advanced mathematics and deep learning is needed in reinforcement learning. This need is evident in promoting the understanding of existing problems and discovering new algorithms.

    “The methods presented in the dissertation can be incorporated into a variety of decision-making AI applications that, whether we realise it or not, are becoming an increasingly prevalent part of our daily lives,” Lindenberg concludes.


    Wissenschaftliche Ansprechpartner:

    Björn Lindenberg, PhD, +46 73-819 56 19, bjorn.lindenberg@lnu.se


    Originalpublikation:

    Dissertation:
    Reinforcement Learning and Dynamical Systems https://doi.org/10.15626/LUD.494.2023


    Weitere Informationen:

    https://lnu.se/en/meet-linnaeus-university/current/news/2023/new-dissertation-de... Read the full news item at lnu.se
    https://lnu.diva-portal.org/smash/record.jsf?dswid=6097&pid=diva2%3A1756782 Link to the dissertation: Reinforcement Learning and Dynamical Systems(at lnu.se)


    Bilder

    Merkmale dieser Pressemitteilung:
    Journalisten
    Mathematik
    überregional
    Forschungsergebnisse
    Englisch


     

    Hilfe

    Die Suche / Erweiterte Suche im idw-Archiv
    Verknüpfungen

    Sie können Suchbegriffe mit und, oder und / oder nicht verknüpfen, z. B. Philo nicht logie.

    Klammern

    Verknüpfungen können Sie mit Klammern voneinander trennen, z. B. (Philo nicht logie) oder (Psycho und logie).

    Wortgruppen

    Zusammenhängende Worte werden als Wortgruppe gesucht, wenn Sie sie in Anführungsstriche setzen, z. B. „Bundesrepublik Deutschland“.

    Auswahlkriterien

    Die Erweiterte Suche können Sie auch nutzen, ohne Suchbegriffe einzugeben. Sie orientiert sich dann an den Kriterien, die Sie ausgewählt haben (z. B. nach dem Land oder dem Sachgebiet).

    Haben Sie in einer Kategorie kein Kriterium ausgewählt, wird die gesamte Kategorie durchsucht (z.B. alle Sachgebiete oder alle Länder).