idw – Informationsdienst Wissenschaft

Nachrichten, Termine, Experten

Grafik: idw-Logo
Grafik: idw-Logo

idw - Informationsdienst
Wissenschaft

Science Video Project
idw-Abo

idw-News App:

AppStore

Google Play Store



Instanz:
Teilen: 
24.01.2007 13:23

For the Europe of 23 languages: the computer learns to translate

Saar - Uni - Presseteam Presse- und Informationszentrum
Universität des Saarlandes

    The new European project EuroMatrix develops automatic translation systems for all official languages of the EU

    The number and scope of translations in business, administration, politics and culture is increasing yearly. The principle clients for translations are export-oriented economic sectors, international organisations including the European Union and the United Nations, as well as multinational corporations. The exploding demand for text translations is one of the direct consequences of open trade practices, globalisation, European integration and international cooperation. The legion of translators is growing. However, with the general trend towards specialisation, the demand on the competence of specialised translators also increases. This leads to a general increase in the cost of a good quality translation. The European Commission cannot, for example, send technical and legal translations among the languages of the European Union to the Philippines, India or other low-wage countries.

    Translators working for large companies and organisations naturally have been using specialised computer programs for years, which are able to fish appropriate suggestions for translations from the body of previous translations. However, this method cannot render a sentence that is not available in the program's memory.

    This is made possible only by automatic translation, which is often represented as being the only realistic solution to managing the communication problems of a multilingual world society. Automatic translation is increasingly viewed by experts as a critical precondition for the enduring preservation of the manifold languages and cultures of our planet.

    Although experts agree among themselves on the immense demand for automatic translation, the opinions regarding the possibilities of such systems with regard to complex texts differ widely. While many experts still doubt that machines can deliver adequate translations, there are corporations which are already reducing costs through the use of computers.

    The truth of the matter lies not quite in the middle, although both sides are right in their own way. Even outdated and low-quality translation systems, which are currently only of very limited use in business and administration, bring costs down. The real breakthrough in automatic translation, however, still lies before us. And this at a point at which the business sector has more or less lost hope in automatic translation.

    For decades, unrealistic expectations were consistently disappointed. From 1975 to 1995, the state of research was largely stagnant. Since 1995, a noteworthy new trend has emerged, breathing new life into the current research: the development of statistical translation techniques. Rather than using complex handmade systems of rules requiring decades of concentrated work to build and refine, one can instead use the body of already translated work through the application of automatic learning processes. The result is computer programs which have taught themselves to translate.

    The human translator must have perfect command of the target language because it is more difficult to compose stylistically excellent language than it is to simply understand content. The statistical translation programs can be similarly perfected in the language into which they translate. This is done via automatic learning, for which large supplies of monolingual text, found in abundance in the internet, are necessary. These autodidacts of translation systems are still far from perfect, but they are improving year by year. An investigation of existing technology shows that both statistical and rule-based translation methods make many mistakes-but they do not make the same mistakes!

    Technological breakthroughs in automatic translation are to be the aim of a new international research program which is being financed under the 6th framework program of the European Union. New combinations of the best currently available methods for rule-based and statistical translation are to be tested. Additionally, the question of how statistical translation systems can be improved by linguistic methods will be investigated. The ambitious aim is to develop and test translation systems for the 23 official languages of the European Union. An important part of the project is also the organization of competitions among the best translation systems, to which all research centers and firms from Europe and other parts of the world are to be invited to participate. It will naturally not be possible to focus equally on each language pair, but a practical result of the project is to be a continuously updated assessment of the technological state for each language. This overview in the form of a large tabular matrix has given the project its name.

    The project is coordinated by Professor Hans Uszkoreit at Saarland University. Dr. Philipp Koehn from the University of Edinburgh is serving as technical/scientific coordinator of the project. Other partners include internationally renowned research groups at Charles University in Prague lead by Prof Jan Hajic and the Evaluation Center CELCT in Trento under the direction of Prof. Amadeus Cappelli.

    The partnership with representatives of industry is important for the success of the project: the German firm GroupWare AG is bringing their LOGOS system into the project, LOGOS being one of the largest and most important commercial translation systems, which was made available the year before last in an Open-Source version. The firm Morphologic in Budapest has developed digital dictionaries for many European languages and has its own translation technology.

    The project has recently kicked off and is planned for 30 months at a cost of 2.5 million Euros.

    A printable picture of Prof. Dr. Hans Uszkoreit is available at
    http://www.uni-saarland.de/mediadb/presse/ProfHansUszkoreit.jpg
    Copyright: Universität des Saarlandes / das bilderwerk

    For further questions:
    Prof. Dr. Hans Uszkoreit
    Phone: +49 (0)681 / 302-5282
    E-Mail: uszkoreit@coli.uni-saarland.de

    Kompetenzzentrum Informatik
    Friederike Meyer zu Tittingdorf
    Phone: +49 (0)681/302-58099
    E-Mail: meyer@cs.uni-sb.de


    Bilder

    Professor Dr. Hans Uszkoreit
    Professor Dr. Hans Uszkoreit
    Quelle: Universität des Saarlandes / das bilderwerk


    Merkmale dieser Pressemitteilung:
    Informationstechnik, Medien- und Kommunikationswissenschaften, Sprache / Literatur
    überregional
    Forschungsprojekte
    Englisch


     

    Hilfe

    Die Suche / Erweiterte Suche im idw-Archiv
    Verknüpfungen

    Sie können Suchbegriffe mit und, oder und / oder nicht verknüpfen, z. B. Philo nicht logie.

    Klammern

    Verknüpfungen können Sie mit Klammern voneinander trennen, z. B. (Philo nicht logie) oder (Psycho und logie).

    Wortgruppen

    Zusammenhängende Worte werden als Wortgruppe gesucht, wenn Sie sie in Anführungsstriche setzen, z. B. „Bundesrepublik Deutschland“.

    Auswahlkriterien

    Die Erweiterte Suche können Sie auch nutzen, ohne Suchbegriffe einzugeben. Sie orientiert sich dann an den Kriterien, die Sie ausgewählt haben (z. B. nach dem Land oder dem Sachgebiet).

    Haben Sie in einer Kategorie kein Kriterium ausgewählt, wird die gesamte Kategorie durchsucht (z.B. alle Sachgebiete oder alle Länder).