Segmentation

Hello!

I think my question is quite basic, but I cannot find a solution so I'm asking here.

I want to set segmentation of the source file in such a way as to avoid the break when certain abbreviations with a full stop are found, e.g. Art. (of a Law), but I do not know how to do it. Can someone help me?

Thank you in advance

Valeria

emoji
Parents Reply Children
  • Here is a screenshot where you can see that the segments are separated after "nr." and "art.", which are listed in the abbreviations not to be considered for segmentation

    and also after "S.p.A.", which is not listed, but it is not the end of the sentence, because the sentence continues, and this is not recognised.

    Can you tell me why this happens?

    My TMs are all file-based, I do not work online, so I will try to study the "workaround"...

    Screenshot of Trados Studio showing incorrect segmentation after 'S.p.A.' despite it not being the end of the sentence, with personal information redacted.

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 6:24 AM (GMT 0) on 29 Feb 2024]
  • I can’t help you with this example. If I recreate it I won’t have a problem. Please provide a sample file, even a couple of sentences, where you can reproduce this problem.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • Paul, I don't understand: what do you mean by "providing a sample file"?

    The source file is a normal Word where there is no interruption after "art." and the other abbreviations.

    What do you need exactly?

    emoji
  • Is this ok?

    emoji
  • Thank you.  I asked for the document example in case there were hidden tags in there, or maybe even a document format that forced the segmentation.  But based on what you have provided I can achieve this:

    Screenshot of a Trados Studio document with Italian text, highlighting the use of abbreviations like 'nr.', 'S.r.l.', 'dall'art.', and 'S.p.A.' within the text.

    To do this I just added these to the Italian abbreviation list:

    nr.
    S.r.l.
    dall'art.
    S.p.A.

    I couldn't just add art. because there is no space between the dall' and art. so it's seen as a word in its own right.

    So I guess the next things to check given the more obvious ones didn't work for you either is what TM are you applying this to, what language in your TM are you applying this to and when are you adding them?

    • it must be the TM you are actually using in your project and if you have multiple TMs it must be the one at the top of the list.
    • it must be the Italian language as this is your source:
      Screenshot of Trados Studio settings showing the language options for English (United Kingdom) and Italian (Italy), with a note indicating that the Italian abbreviations list has been customized with new entries.
    • I made these changes to my TM before creating my project so they could take effect when the sdlxliff segmentation is effected.

    I have also attached my TM so you can try it.  Given this 100% works for me a failure to work for you might give us some indication where the problem lies:

    it(IT) - en(GB).zip

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 6:24 AM (GMT 0) on 29 Feb 2024]
  • Paul, you are really precious! Heart eyes
    I did all the operations you suggested. I see that the abbreviations "art." and "nr." (not case sensitive, right?) are included by default in all TMs. However, I added "dall'art." and "dell'art.", "S.p.A." and "S.r.l.", for completeness. Then I re-upgraded the TMs, as suggested by Studio, and reprocessed the files.

    I even re-created a new project and reprocessed the file, but it did not work. I keep getting the separated segments... Sob

    Can there be any other setting to check?

    emoji
  • I think the only other explanation for this is that you are simply doing one or more of these two things:

    1. you are not using the TM you think you are and have applied the settings to a different TM. Hover over your TM in your project settings and see what you used.  Make sure it's what you think you're using:
      Trados Studio project settings showing two translation memories, one with a red exclamation mark indicating an issue.

      OR

    2. you simply don't have your TM activate when creating your project:
      A tooltip in Trados Studio advising to 'Make sure your TM is enabled' pointing to a translation memory with a red exclamation mark.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 6:24 AM (GMT 0) on 29 Feb 2024]
  • No, Paul... this was easy Blush  I have been working with Trados for almost 30 years and, although I am not an expert in technicalities, I know that my TMs are enabled and activated.

    For this specific job I was using 2 TMs (1 general TM for financial statements and 1 specific TM for this customer), both updated after upgrading

    to Studio 2021 (and upgraded again yesterday after I checked and added the abbreviations as you taught me) and, in fact, they were both being

    populated and regularly searchable while I was working... so all this is really strange.

    Anyway, if there is no other setting to check, I will try and try with these ones. At least now I know where to look. Maybe one day this will be resolved.

    THANK YOU!!!

    emoji
  • For this specific job I was using 2 TMs

    Did you add the abbreviations to both TMs or did you make sure the one you added was at the top of your list before creating your project?

    Maybe one day this will be resolved.

    Well.. I don't think we have anything to fix here.  Maybe, if you have some time tomorrow morning we can take a look remotely on a call as I'm really curious to know what the problem is here?  If this could work for you ca you send me an email? - pfilkin@rws.com

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • Have had similar issues - and discovered as mentioned that it wass essential to ensure that the first file-based TM in the list also contains the customised abbreviations. I had set up a language resource for DE-EN (my pair) and was wondering why it wasn't having the desired effect.

    emoji