Converting text to tags in TM

Hello folks,

I have an XML file which I processed with an embedded content rule to have links appear as tags in the editor.

However, my TM (from which I'd like to retrieve many 100% matches) was built up without this rule and simply contains "raw" text, therefore I'm not getting any 100% matches. 

Is there something I can do to have my TM give me 100% matches (by converting text into tags/embedded content in the TM perhaps?)?

Screenshot of Trados Studio interface showing a segment with embedded content tags. Source text includes tags and target text shows a 97% match without tags.

Many thanks for your precious help,



Generated Image Alt-Text
[edited by: RWS Community AI at 6:41 PM (GMT 0) on 14 Nov 2024]
emoji
Parents
  •   

    Oh no! I see the problem but we dont have a TM tag mapping to help address scenario's like this.
    Would be a good app - which I have noted so watch this space.

    What you can do is change the TM penalties to 0, which will give you 100% match

    Trados Studio project settings showing penalties section with a red arrow pointing to 'Different formatting penalty' set to 1.

    But you will still have to move over the source tags to target tags, so while your analysis will be correct your effort will not be saved.

    Depending how big your TM is and depending how big a project this is, I would be tempted to see if you can export your TM to TMX.
    Goal would be to amend how the TM data is being displayed, to update new TM that has correct/improved tag formatting.
    You would need to download the TMX file type from the private app store as well as Data Protection Suite 

    I dont have your files, so here is a bit of an experiment based on my files.

    See TM content with <> raw tags. Export as TMX
    Translation Memory search window in Trados Studio displaying source and target text with raw HTML tags. 
    See TMX file in Editor View, using TMX file Type

    Editor view of TMX file in Trados Studio showing 100% match with source and target segments containing raw HTML tags.

    I created a rule using Data Projection Suite, stating that content between <> should be tagged
    Data Protection Suite rule setup in Trados Studio with a regex pattern for content between angle brackets.

    Once complete, I then have this

    Editor view of TMX file in Trados Studio with segments highlighted in purple, indicating tagged content as per Data Protection Suite rules.

    By no means is my example perfect as my regex rule was too inclusive and some actual translated content got tagged.
    Mostly likely I would need to specify each <tag name> individually to get a more accurate results.
    But conceptually I hope it gives you an idea of how TMX with Data Projection Suite can be used to review your "*.sdlTM" data

    Then you would updated to a new TM and hopefully achieve not only better analysis results but also reduced tagging effort. 

    I hope this gives you something to explore if you feel it necessary
    Have a good day

    Lyds 

    Lydia Simplicio | RWS Group

    _______
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 8:04 AM (GMT 0) on 29 Feb 2024]
Reply
  •   

    Oh no! I see the problem but we dont have a TM tag mapping to help address scenario's like this.
    Would be a good app - which I have noted so watch this space.

    What you can do is change the TM penalties to 0, which will give you 100% match

    Trados Studio project settings showing penalties section with a red arrow pointing to 'Different formatting penalty' set to 1.

    But you will still have to move over the source tags to target tags, so while your analysis will be correct your effort will not be saved.

    Depending how big your TM is and depending how big a project this is, I would be tempted to see if you can export your TM to TMX.
    Goal would be to amend how the TM data is being displayed, to update new TM that has correct/improved tag formatting.
    You would need to download the TMX file type from the private app store as well as Data Protection Suite 

    I dont have your files, so here is a bit of an experiment based on my files.

    See TM content with <> raw tags. Export as TMX
    Translation Memory search window in Trados Studio displaying source and target text with raw HTML tags. 
    See TMX file in Editor View, using TMX file Type

    Editor view of TMX file in Trados Studio showing 100% match with source and target segments containing raw HTML tags.

    I created a rule using Data Projection Suite, stating that content between <> should be tagged
    Data Protection Suite rule setup in Trados Studio with a regex pattern for content between angle brackets.

    Once complete, I then have this

    Editor view of TMX file in Trados Studio with segments highlighted in purple, indicating tagged content as per Data Protection Suite rules.

    By no means is my example perfect as my regex rule was too inclusive and some actual translated content got tagged.
    Mostly likely I would need to specify each <tag name> individually to get a more accurate results.
    But conceptually I hope it gives you an idea of how TMX with Data Projection Suite can be used to review your "*.sdlTM" data

    Then you would updated to a new TM and hopefully achieve not only better analysis results but also reduced tagging effort. 

    I hope this gives you something to explore if you feel it necessary
    Have a good day

    Lyds 

    Lydia Simplicio | RWS Group

    _______
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 8:04 AM (GMT 0) on 29 Feb 2024]
Children