Terminology-aware machine translation in Trados 2024

Dear Community!

We are currently analyzing the upgrade of current 2022 licenses to 2024, therefore studying all the novelties offered by this new version.

According to the information available, MT results can be modified by the terminology available in the Multiterm dictionary, isn´t it?

So as far as I can understand, the working principle is the same as with the DeepL glossaries (which I personally prefer to call "MT correction lists", cause they are 100% personalized to the type of text translated and provide almost no information for a better understanding of a specific term). Or are there other benefits that I don´t see?

However, the "handicap" I see is how the MT provider will "choose" the term from the list of terms available in the Multiterm database?

Practical example 1: 

PLANO (Spanish term): in our corporate MDB we have three entries for "plano": 1) wheel tread defect 2) plane (vertical, horizontal...) and 3 ) drawing. 

In this particular case, which term would the AI Assistant 2024 be aware???? 

Practical case 2:

Bocina de agudos: this entry has 3 synonym terms in English: high-pitch horn, high-pitched horn and whistle.

Which synonym of the three would the AI Assistant use in this case? 

DeepL glossaries are more specific, i.e. they allow to correct a specific word with another specific word, only one. 

However, if the Multiterm glossary is applied in Trados 2024 following the same principle...how will the algorythm  work? And then, can Multiterm termonology be used on the MT provided by DeepL? Or is is mandatory to use other MT resources (Azure, Open Ai, etc?)

Thanks in advance!

emoji
Parents
  •  

    the working principle is the same as with the DeepL glossaries

    Well, "jein", as we might say in German. Yes and no. At this point, the AI is just given the recognized terms, without any metadata. Insofar yes, your termbase is in a way reduced to a glossary. I hope this will change and there will be a way to choose other fields from the termbase to be handed to the AI.

    If you prompt the AI to give you several suggestions, you will see that it varies which of the synonyms it uses. And you can instruct it to use a fitting synonym. Insofar, no, it's not like a DeepL glossary. That will always slot in the first fitting term it comes across, even if it makes little sense. With the AI, you can instruct it how to select a synonym. You might have to custom tailor your prompt for a particular translation to tell AI what the context is, but even a general instruction works. See screenshots below, I created a termbase with two translations for the English word "stain", and gave it two sentences. I also instructed AI to explain its choice.

    In the first sentence, MT got the correct term, which is accepted by AI. In the second sentence, MT got the wrong term, and AI corrects it:

    Screenshot of MultiTerm translation software showing two English to German translation results with term recognition for the word 'stain'. AI Assistant provides an explanation for choosing 'Fleck' as the translation.

    Screenshot of MultiTerm translation software displaying English to German translation results. The AI Assistant explains the choice of 'Beize' for the word 'stain' in the context of wood staining.

    This is just a quick test. I am sure this could be refined substantially.

    emoji


    Generated Image Alt-Text
    [edited by: RWS Community AI at 8:18 PM (GMT 1) on 16 Oct 2024]
  • Daniel, hello!

    In the first place, thank you for the answer! As far as I can understand, this is the interface of the AI Assistant in Trados 2024... And once, the MT result is corrected I have to click on it to insert it in the segment, right?

    ANd then the prompts...they are written for each term?? Or a kind of a list of prompts can be created to "train" AI???

    emoji
  •  

    a) Yes, that is the AI Assistant in Trados 2024.

    b) The way I set it up, you have to click to insert it. But you could automate this and use it as a batch task. As Paul pointed out, the current AIs are fuzzy – that's their glory and their weakness. So you will want to look at what it's doing, at the very least check the results.

    c) One prompt, hastily written in this case. It makes sense to refine prompts. I could imagine customizing a prompt for a project, like “The text you are working on is about cleaning textiles.” If you can give the LLM relevant information, by all means do.

    Screenshot of a terminology test section with a paragraph explaining the role of an editor and terminology expert. A 'True' checkbox is ticked, and there are icons for editing and deleting.

    The one serious weakness I see is how forbidden terms are handled. They, too, are just passed on to AI as relevant terms. If your termbase has a lot of those, you might have to create a separate termbase to get rid of those. In the above example, if I had "Klecks" as forbidden term, I doubt AI could reliably select "Fleck". All it get is a list like: "stain-Fleck, stain-Klecks, stain-Beize". Across the board, handling forbidden terms is a topic that urgently needs attention IMHO.

    But as for your main question, AI's terminology-aware translations are seriously above a DeepL-style glossary if you prompt it well. As I said in other posts before, I believe this is a big game-changer technology for this industry, just like the ascent of MT was ten years ago.

    Would be worth checking whether a plain terminology-redacting task requires GPT-4o, or whether it can run on a simpler, faster and cheaper model like 3.5turbo. There is so much that could be done!

    Daniel

    emoji


    Generated Image Alt-Text
    [edited by: RWS Community AI at 7:34 AM (GMT 1) on 17 Oct 2024]
  • Dear Daniel, 

    sorry for the late reply. Very grateful for the explanations, got it: we have to "train" the AI Assistant to make it apply out Multiterm terminology properly. Until I have the version and test it, I will not see how it actually works. 

    However, what I see on your screenshots is that the MT modification is performed in a separate window below, not directly in the segment. That could be less "comfortable and more time consuming. However, we have to give it a try. 

    Best regards, Daniel!

    emoji
Reply Children
No Data