Segmentation Rule

Dears,

 

I would like to ask the below questions.

  1. I have created TM with a segmentation rule full stop and translate the file against it.Can i export the TM as TMX and re-create new TM with the exported TMX but with a Paragraph mark segmentation rule ? if yes, How can I see the changes? as i can't see any change and the segments are still segmented with a full stop.
  2. Can we change the segmentation of a translated bilingual file from full stop to paragraph mark?

 

 

Best Regards,

Samar

Parents
  • Hi  

    A TMX file doesn't hold segmentation rules.

    If you want to use paragraph segmentation for all future files then creating a new TM with this option will work for all future files. But if you import your TMX into this new paragraph based TM the segments will still only be sentence based as these are already defined in the TMX. I'm not aware of any tools that can go from sentence to paragraph... only a few that go the other way around.  Part of the problem I guess is that a TM is not a true reflection of the original documents so making sure the paragraphs were really correct would be tricky if not impossible and technically the TMX puts all segments, whether sentence based or paragraph based into a single TU.  So there is nothing in the TMX to tell any tool whether the TUs were part of a larger entity or not.

    What may be useful if is that if you do this the fragment matching feature can pick out the TUs. So whilst you won't get proper pretranslation leverage at least you would still be able to leverage the work interactively:

    Once you have converted your bilingual file that's it.  You can't change the segmentation at this point, you need the source file for that.  Perhaps a potential solution would be to align the source and target files with a TM set up for paragraph based segmentation instead of trying to change the bilingual files... although I wouldn't hold my breath!

    If there is a solution for this out there I'd also be interested to learn.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • Thank you for your replay.

    I would like to ask you a question. While trying to create SDL Project with a new Paragraph based TM, The new created file is not segmented with Paragraph as per the below is a screenshots.

    I have expected that each highlighted paragraph will presented  in only one segment in studio but this didn't happen.and the text is also segmented by full stop.

     

     

    Best Regards,

    Samar

  • Looks like you did not add the TM to the project (AND turned it on) BEFORE performing the 'Convert to translatable format' batch task.
    This task (and eventually also the 'Copy to target languages' task) actually does the segmentation.
Reply Children
No Data