Use of any character as break character for segmentation

Hi

Is it possible to use any other characters than the ones suggested in Studio (full stop, colon, semi-colon, exclamation/question mark, tab) as break characters?

I'd like to segment a text file in which "|" is used as a separator.

Thanks a lot for your help.

Kind regards

Marthe

Parents
  • Marthe

    Yes, you can achieve this by creating a new segmentation rule. You do this in the properties of the TM in "Language Resources - Segmentation Rules". You can modify the existing segmentation rules or add new segmentation rules (you need to get familiar with the regex syntax used for this).

    Please be aware that the amended segmentation rules will affect all users who use this TM in their project settings, meaning that if you need this only for a certain project or document, you should revert the change once you have opened your document in Studio. 

    Walter

Reply
  • Marthe

    Yes, you can achieve this by creating a new segmentation rule. You do this in the properties of the TM in "Language Resources - Segmentation Rules". You can modify the existing segmentation rules or add new segmentation rules (you need to get familiar with the regex syntax used for this).

    Please be aware that the amended segmentation rules will affect all users who use this TM in their project settings, meaning that if you need this only for a certain project or document, you should revert the change once you have opened your document in Studio. 

    Walter

Children