segmentation problem with .xlf file

Hi,

A client wants to send .xlf files from now on. However, segmentation is problematic. Studio does not create segments based on sentences, even though that is the TM setting. It seems to consider all text within a CDATA tag as one single block.

The tag that seems to block correct segmentation is <![CDATA[herecomesthetext]]>

Does anyone have a solution for this? How can I make Studio apply the sentence-based segmentation in thos blocks as well?

Thanks in advance for your help.

Best regards,

Michael

Parents Reply
  • I was hoping that it would work correctly like that, but the problem remains. Studio simply does not segment correctly...

    Then you are doing something wrong.  I tested and it works like this for me:

    Screenshot of Trados Studio showing incorrect segmentation with multiple sentences combined in one segment.

    As opposed this:

    Screenshot of Trados Studio displaying correct segmentation with each sentence in a separate segment.

    These are the files I used:

    <?xml version="1.0" encoding="utf-8"?>
    <xliff xmlns="urn:oasis:names:tc:xliff:document:2.0" srcLang="en-US" trgLang="fr-FR" version="2.0" >
      <file id="com.liferay.journal.model.JournalArticle:1130917">
        <unit id="title">
         <segment>
          <source><![CDATA[Sentence one. Sentence two. Sentence three.]]></source>
          <target><![CDATA[Sentence one. Sentence two. Sentence three.]]></target>
         </segment>
        </unit>
      </file>
    </xliff>

    <?xml version="1.0" encoding="utf-8"?>
    <xliff xmlns="urn:oasis:names:tc:xliff:document:2.0" srcLang="en-US" trgLang="fr-FR" version="2.0" >
      <file id="com.liferay.journal.model.JournalArticle:1130917">
        <unit id="title">
         <segment>
          <source><![CDATA[Sentence one. Sentence two. Sentence three.]]></source>
         </segment>
        </unit>
      </file>
    </xliff>

    Make sure you are creating the project from scratch with the adapted file.

    I also note that your XLIFF is actually XLIFF 2.0 where CDATA is allowed, but definitely frowned upon.  It's a lazy mans XLF and difficult to handle for localization.

    Is this something Studio cannot deal with then?

    Studio can deal with it if the file is prepared correctly.  No different to any other tool that's capable of handling XLIFF 2.0... and there are not many!  In fact you may find other tools wouldn't offer you the segmentation for a source only XLF either as they will treat each translation unit as a segment.

    Paul Filkin | RWS

    Design your own training!
    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub



    Generated Image Alt-Text
    [edited by: Trados AI at 11:43 PM (GMT 0) on 28 Feb 2024]
Children