Hello, I started using Studio two weeks ago. How to I get Studio to do segment breaks after the closing quotation marks placed immediately after a period?

I started using/learning Studio two weeks ago, so I am a true beginner.. who's very willing to learn.

In the texts I translate, I have a fair amount of quotes... so lots of opening and closing double quotation marks. So far I have been busy mainly aligning pairs of files, from French to English. In those instances, I notice that Studio systematically creates a segment break after the full stop, just before the closing quotation marks. It means a lot of fixing/merging segments. From what I vaguely understand, there must be a way to add a segmentation rule, but I simply do not know how to do it, despite quite a bit of reading and watching video tutorials on the internet.

So, how to I get Studio to create segment breaks AFTER the closing quotation marks instead of BEFORE them?

Parents
  • Hi  

    That sounds strange, would you be able to post a screenshot of what you see? The default behavior is as below.

    In Word:

    Screenshot of text in Word showing two separate lines: 'This is some sample text.' and 'This is some more sample text.'

    In Studio:

    Screenshot of Trados Studio interface displaying two numbered segments: 1 'This is some sample text.' and 2 'This is some more sample text.'

    So, as you can see, with the default segmentation rules you should have the desired behavior.

    By the way, segmentation rules are stored in each TM. One place you can find them is by going to Project Settings - All Language Pairs - Translation Memory and Automated Translation - (Select the appropriate TM here), then select Settings - Language Resources - Segmentation Rules. 

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 1:51 PM (GMT 0) on 28 Feb 2024]
  • Hello Norah,

     

    Thanks for your reply.

    I've attached a screen capture. As you can see, I would like Studio to NOT break in between the full stop and the closing quotation mark.

    I would like to break to occur right AFTER the closing quotation mark so both the period and the quotation mark are the same segment... as they should.

    I hope the attached capture makes my problem clearer to you.

    Any suggestion?

    Screenshot of Trados Studio showing a segment with a full stop and closing quotation mark. A red squiggly line indicates a segment break error after the full stop.

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 1:51 PM (GMT 0) on 28 Feb 2024]
  • Hi  

    The problem seems to be caused by having a space before the quotes, I can repro like this:

    Screenshot showing Trados Studio with two segments, one with a space after quotation marks and the other without, highlighting the segmentation issue.

    If you create an exception to the full stop rule in your segmentation rules like this:

    Screenshot of the 'Edit Rule Exception' dialog in Trados Studio with a description 'Quotes with space' and regular expression settings for segmentation rules.

    Then you should achieve this:

    Screenshot showing Trados Studio with corrected segmentation, no space after quotation marks, demonstrating the solution to the issue.

    So go to your TM settings, then Language Resources -> Segmentation Rules -> Add Exception

    See if that helps you?

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 1:52 PM (GMT 0) on 28 Feb 2024]
  • And just in case your sentences are written like this with no paragraph break after the quoted text:

    Space after quotation marks. « Contusion main gauche avec fracture 3 doight gauche, écrasement main gauche avec atteinte nerf cubital. » No space after quotation marks. «Contusion main gauche avec fracture 3 doight gauche, écrasement main gauche avec atteinte nerf cubital.»

    Just create a completely new rule to break after a » in addition to the exception to the fullstop rule:

    Trados Studio Edit Segmentation Rule dialog box showing a rule description 'FR quotes' with settings before break 'Anything', break characters ' ', and after break 'Whitespace (including spaces)'.

    It's tricky to guess what you actually have to deal with without seeing more of the actual text.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 1:52 PM (GMT 0) on 28 Feb 2024]
  • Paul, I have another similar problem/question.

    Say, in my L1 Word document, I have a numbered list like the following one:

    1. Fractures ouvertes

    2. Fractures en bois vert

    3. Fractures avec déplacement

    etc.

    Studio does the break right after the period following the number in the L1 segments,

    but doesn't in the L2 segments. It looks like this:

    L1 segments:                                                               L2 segments:

    1.                                                                                  1. Open fractures

    Fractures ouvertes                                                          2. Greenstick fractures

    2.                                                                                  3. Displaced fractures

    Fractures en bois vert

    3.

    Fractures avec déplacement

    etc.

    I'm aware some people are happy with that default segmentation, but I would prefer Studio to do the breaks

    so the L1 segments look just like they do in my Word doc.

    Do I have to create a rule or an exception to the rule?

    Also what expression do I need so as to get Studio to NOT do a segment break after the period following the number

    in a numbered list?

  • Hi  

    This has less to do with default segmentation and more to do with poorly prepared source files.  If these were really numbered lists in Word (and not just text) then you would not see the numbers at all as Studio would take care of them externally.

    Can you confirm exactly what you would like here because I'm not clear.  You want your segments to look like this in Studio with the numbers included in the segment?

    1. Fractures ouvertes

    2. Fractures en bois vert

    3. Fractures avec déplacement

    Can you provide a small sample file just of the text you provided so we can see what kind of white space has been used to create this source?

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • Paul, I'm sorry, I just can't find that exact file, but you've opened my eyes on a different topic, i.e. poorly prepared source files.

    Now that I think about it, I do remember that it wasn't a "proper" numbered list. That's very likely what my problem was. Thank you for your help.

    My original question here dealth with an exception to the full stop segmentation with respect to closing quotation marks in French.

    I have encountered a similar problem, but this time with the question mark:

    By default Studio does the segmentation break AFTER the question mark and BEFORE the closing quotation marks.

    It looks like this:

    1 « Pourquoi est-il parti?

    2 »

    What do I have to do for the segmentation to look like this:

    1 « Pourquoi est-il parti? »

    2

    Do I have to create a new rule or an exception... to which rule exactly?

    What would be the expression?

    Thanks for all your help. It's not exactly an easy piece of software, but I will not give up.

Reply
  • Paul, I'm sorry, I just can't find that exact file, but you've opened my eyes on a different topic, i.e. poorly prepared source files.

    Now that I think about it, I do remember that it wasn't a "proper" numbered list. That's very likely what my problem was. Thank you for your help.

    My original question here dealth with an exception to the full stop segmentation with respect to closing quotation marks in French.

    I have encountered a similar problem, but this time with the question mark:

    By default Studio does the segmentation break AFTER the question mark and BEFORE the closing quotation marks.

    It looks like this:

    1 « Pourquoi est-il parti?

    2 »

    What do I have to do for the segmentation to look like this:

    1 « Pourquoi est-il parti? »

    2

    Do I have to create a new rule or an exception... to which rule exactly?

    What would be the expression?

    Thanks for all your help. It's not exactly an easy piece of software, but I will not give up.

Children
No Data