Segmentation Rule

Dears,

 

I would like to ask the below questions.

  1. I have created TM with a segmentation rule full stop and translate the file against it.Can i export the TM as TMX and re-create new TM with the exported TMX but with a Paragraph mark segmentation rule ? if yes, How can I see the changes? as i can't see any change and the segments are still segmented with a full stop.
  2. Can we change the segmentation of a translated bilingual file from full stop to paragraph mark?

 

 

Best Regards,

Samar

Parents Reply Children
  • It could be that the source format played a big role in my case... it was MadCap Flare XML/HTML, so the segmentation is pretty much defined by the file type, rather than the TM-defined rules.

    Of course I cannot be expected to know how Studio works internally... all I know is that I:

    - created new empty TM where I changed the segmentation to Paragraph based for both source- and target language

    - used this TM for running the alignment

    That's all. I don't (and can't) know exactly which "magic" (or coincidence) made it to align just as one would expect ;-). Perhaps is the internal "reversed" TM created by reversing the actual TM (similarly to what AnyTM does)? It would quite make sense...

    I didn't explore it any deeper as we ended up not going further with paragraph-based segmentation and went the harder way of sentence-based  segmentation.

  • Hi  

    I would never of thought of doing that as I usually select an existing TM and it's too late at this point.  But you are absolutely right... and I'm really happy to see this:

    Thank you for sharing this information.... something we should definitely document somewhere as I'm sure it will be useful to many users.  Or maybe I was the only one who didn't know this!!

    Thanks

    Paul

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • I believe that not many users actually know this... because the TM creation/settings GUI hides the fact that there is more languages than just the source one from the user, and even discovering the dropdown content does not make immediately clear to the user what are the consequences of it.
    I'm not quite sure this is intentional... though one may presume that 'not making it too complex for user' might have been the driver, but in that case I would assume "synchronizing" some elementary settings (at least the segmentation type for sure) between the source and target language automatically in the background.
  • Hi ,

    I spent some time testing this tonight and I have to say I think we were lucky with the paragraph segmentation rule. Try it with any other kind of segmentation rules and the effect is mindblowing... at least I'm really struggling to see any logic in how this works. I certainly think my original assumption in every other case I tested this evening was correct. I don't think you can effect the target segmentation rules in the way described at all. I also looked at retrofit and this seems to do something else again.

    I can only conclude that whilst it seemed to make perfect sense, and I really wanted that to work, it does not. At least not in every case. I'm going to try and get to the bottom of this so I can understand what's going on.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • Unknown said:
    Try it with any other kind of segmentation rules and the effect is mindblowing... at least I'm really struggling to see any logic in how this works.

    Hmmmm... just out of curiosity, you are testing it with the latest version, I suppose... the one with god-knows-how broken segmentation rules...
    Would you mind doing same tests with "last (kind-of)sensibly-behaving version", i.e. 2017 CU5 (last pre-SR1) and 2015 SR3 (vanilla, w/o CUs)?
    I feel that it may behave differently...

  • Hi Evzen,

    I have actually found a few more interesting things... albeit embarrassing!

    1. I was aligning a completely different target file (same name, different location) and didn't notice
    2. Studio 2017, current version, actually handles the custom rules exactly as you said and works as expected
    3. Studio 2015 completely ignores the use of custom rules so actually 2017 SR1 CU9 works correctly for me. It's an improvement.
    4. Retrofit alignment works differently and won't apply any custom rules

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub