idw – Informationsdienst Wissenschaft

Nachrichten, Termine, Experten

Grafik: idw-Logo
Science Video Project
idw-Abo

idw-News App:

AppStore

Google Play Store



Instanz:
Teilen: 
24.07.2014 16:27

Software Provides a Clear Overview in Long Documents

Dr. Norbert Aschenbrenner Corporate Communications, Corporate Technology
Siemens AG

    In the future, a software will help users better analyze long texts such as the documents for calls for bids, which are often more than one thousand pages long. Experts at Siemens' global research unit Corporate Technology have developed a search function that enables users to simultaneously look for key words and sections of text in all of the documents of a call for bids, for example, without having to actually open any of the files. This makes the search very fast so that it only takes a few milliseconds before users can read the search results in the documents. The experts also developed a component that checks to see how requirements have changed compared to previous versions of a specific text. As reported in the current issue of "Pictures of the Future" magazine, the ultimate goal is to create a semantic software that recognizes interrelationships in order to find relevant information.

    Corporate Technology originally developed the software as part of a feasibility study regarding the digitization of all land registers in Germany. A system was required that could record automated information regarding owners, property sizes, outstanding mortgages, and other matters from the land registers of the past 50 years (around 500 million pages of PDF files). The software had to be able to extract the required information with the help of the respective document structure. The software also had to be able to handle scans of poorly copied typewritten pages or repeatedly corrected documents.

    To develop the software for calls for bids in industry, the researchers at CT are cooperating closely with colleagues from the corresponding Siemens businesses. The researchers are using this as a basis for developing characteristic search algorithms that enable users to find all of the information that a document contains about certain topics such as safety or pollution control.

    Because calls for bids are repeatedly adjusted during a project, the software then identifies and displays any changes compared to previous versions of the document. In the third step, the software looks for analogies to previous, similar calls for bids so that users can see how certain requirements were evaluated in the past. The automatic semantic evaluation of large documents for a bid saves time, prevents mistakes, and makes it easier for users to integrate and analyze changes that were made at short notice

    Press Picture: http://www.siemens.com/press/en/presspicture/innovationnews/2014/in20140704-01.h...


    Weitere Informationen:

    http://www.siemens.com/innovationnews


    Bilder

    Merkmale dieser Pressemitteilung:
    Journalisten
    Informationstechnik
    überregional
    Forschungs- / Wissenstransfer
    Englisch


     

    Hilfe

    Die Suche / Erweiterte Suche im idw-Archiv
    Verknüpfungen

    Sie können Suchbegriffe mit und, oder und / oder nicht verknüpfen, z. B. Philo nicht logie.

    Klammern

    Verknüpfungen können Sie mit Klammern voneinander trennen, z. B. (Philo nicht logie) oder (Psycho und logie).

    Wortgruppen

    Zusammenhängende Worte werden als Wortgruppe gesucht, wenn Sie sie in Anführungsstriche setzen, z. B. „Bundesrepublik Deutschland“.

    Auswahlkriterien

    Die Erweiterte Suche können Sie auch nutzen, ohne Suchbegriffe einzugeben. Sie orientiert sich dann an den Kriterien, die Sie ausgewählt haben (z. B. nach dem Land oder dem Sachgebiet).

    Haben Sie in einer Kategorie kein Kriterium ausgewählt, wird die gesamte Kategorie durchsucht (z.B. alle Sachgebiete oder alle Länder).