Soft break segmentation rule almost working…

Hi all!

I set up a soft break segmentation rule in the main and only TM associated to a project, as per instructions found here (community.sdl.com/.../that-manual-line-break-soft-return-segmentation-rule) and here (noradiaz.blogspot.com/.../adding-soft-return-segmentation-rule-to.html). (See pictures 1 and 2.)

 Trados Studio Translation Memory Settings window showing segmentation rules with 'Other terminating punctuation' selected. Trados Studio Edit Segmentation Rule window for 'Soft break' with a regular expression entered in the 'Before break' field.

It catches 98% of my soft breaks, but not 100%, as you’d expect. (See picture 3.) What am I missing here?

Trados Studio segment view with segment numbers 366, 367, and 369, showing a soft break not caught by the rule in segment 367.



Generated Image Alt-Text
[edited by: Trados AI at 3:27 AM (GMT 0) on 29 Feb 2024]
emoji
Parents Reply
  • Thank you for your reply.

    I have prepared a test file (uploaded below) from the original file. I have also uploaded a screenshot of the segmentation I get.

    My soft break segmentation rule settings in the only TM attached to my project are “Anything” for both “Before break” and “After break.” When I hit “Advanced View,” “[\n]+” is shown under “Before break,” and “.” is shown under “After break.”

    In the Excel file type, I have enabled “embedded content processing” and added the tag definition rule “\[.+?\]” so as to make all square brackets and their contents non-translatable.

    Ideally, I’d like to see no soft breaks at all (meaning I wouldn’t even see tags 0, 5, 7, 12, 15, 18, 19, 20, 22, 24, 28, and 29), with all “LINES” shown separately in their respective segments. That’s what I would have expected the behaviour of this soft break segmentation rule to be anyhow…

    I’m using 2021 SR1.

    Thanks again for taking the time to look at this!

    TestFile.xlsx Screenshot of Trados Studio showing segmented text with soft breaks and tags. Lines are labeled LINE.01 to LINE.11 with tags 0, 5, 7, 12, 15, 18, 19, 20, 22, 24, 28, and 29 visible.

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 3:28 AM (GMT 0) on 29 Feb 2024]
Children