segmentation problem with .xlf file

Hi,

A client wants to send .xlf files from now on. However, segmentation is problematic. Studio does not create segments based on sentences, even though that is the TM setting. It seems to consider all text within a CDATA tag as one single block.

The tag that seems to block correct segmentation is <![CDATA[herecomesthetext]]>

Does anyone have a solution for this? How can I make Studio apply the sentence-based segmentation in thos blocks as well?

Thanks in advance for your help.

Best regards,

Michael

Parents Reply
  • Hi Paul,

    I understand that the xliff my client sent me is not really the way it should be. I will definitely pass on your message. But chances are I will still have to deal with this kind of file.

    Your settings file is a major improvement, but it still does't segment correctly  where there are tags: <p> and <h2>. Is there a way around that? Can I create an extra rule somewhere to make Studio recognize the tags </p> and </h2> as the end of a segment?

    I tried to add two segmentation rules at the TM level (to make Studio consider </p> and </h2> as break characters, but that messed the whole thing up even worse than before ...

    Thanks again for your valuable help!

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 2:49 PM (GMT 0) on 1 Mar 2024]
Children