Segmentation - Ignore text in square brackets

Hi there,

I have an Excel file that looks like this:

[Do not translate]Translate, translate, translate.[Do not translate]Translate.[Do not translate]Translate, translate, translate!

Per default, Studio identifies that as one segment. However, I want Studio to segment it like this:


[Do not translate]

Translate, translate, translate.

[Do not translate]

Translate.

[Do not translate]

Translate, translate, translate!


I'd like to keep the text in square brackets visible in the file, because it contains context information. I can always lock these segments so they don't hinder my translation flow.

I can think of two ways of getting Studio to segment the file like this - either adjusting the segmentation rules of the TM, or adjusting the file-type definition.


Any pointers on which option is preferable and *how* to actually do it?

Parents
  • Use embedded content processing in the Excel file type.

    Enter \[.+?\] as placeholder:

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

  • Thanks for taking the time to answer, Jerzy, and thanks for your suggestion. It's a step in the right direction. :)

    The text in square brackets now appears as a tag and is not counted as translateable. But the file isn't segmented at the tags. Instead, the tags still appear within the segments:

    Translate, translate, translate.[Tag]Translate, translate.[Tag]Translate.

    Do you have a suggestion how to make Studio segment the text so that each tag is a segment of its own?

    Translate, translate, translate.

    [Tag]

    Translate, translate.

    [Tag]

    Translate.

    (It doesn't have to be tags, I'd be fine with the text appearing as normal text, even as translateable text. I just need the segmentation like this.)

  • You need to edit the advanced settings of the Inline Rule described by Jerzy. There you need to enable "Is Word Stop", so that a punctuation mark directly in front of the tag in question is correctly recognized as segmentation boundary.
    Unfortunately the setting "Inline Tag Behaviour: Exclude" does not work for the Excel filter, but only for the RegEx Text filter.

  • Thanks, Frank. Another step in the right direction ... :)

    It's now segmented the way I want it, but the tags are no longer visible – I need to see them, however, as they contain vital context information.
Reply Children