How to open these XLIFF files?

Dear Team,

I tried using the default XLIFF filter, but I keep getting various errors, the latest are "value cannot be null. parameter name culture" and "There is an error in XML document (1, 2)"

Any idea how I could make this work? Attached two versions of the same file, XLIFF 1.2 and 2.0.

Thank you in advance for your help!

Best regards,

Giannis

4403.Lina-video-subtitles-4190_2.0.zip

emoji
Parents Reply Children
  • Hi ,

    Did you maybe take a look at this? I wasn't able to make the multilingual xml work for these XLIFFs.

    emoji
  •  

    Apologies for the delay... didn't have a lot of time for forum work in the last weeks.

    I took a look... the 2.0 works just fine with these settings:

    Trados Studio Language Mapping settings window showing Languages Root field with XPath query 'xlifffileunitsegment' and languages listed as English (United Kingdom) source and French (France) target.

    The languages root needs this:

    /xliff/file//unit/segment

    Note the double slash between file and unit which is needed because you have a bit of mess in this file with multiple levels of nesting for the group element.  This expression starts at the root (/xliff/file) and uses the // operator to search for any unit element at any level of nesting within group elements, and then selects the segment child element.

    I didn't test the other one.. I guess you only need one to work for you!

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 11:03 AM (GMT 0) on 29 Feb 2024]
  • Thank you,  

    I guess since the source has just <source> and <target>, I would need to change the target language every time, right?

    Also, I tried converting <div> <p> <br> etc. into tags. First, I disabled entity conversion, and created placeholder &lt;\w+&gt;

    However, this did not convert the content of &lt; and &gt; into tags, just them separately.

    Any thoughts? I thought the HTML embedded content would help with this, but it didn't.

    Screenshot of Trados Studio showing a comparison of source and target text with HTML tags visible. No errors or warnings are indicated.

    Screenshot of Trados Studio with highlighted differences between source and target text. The target text contains HTML tags and placeholders. No errors or warnings are shown.

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 11:03 AM (GMT 0) on 29 Feb 2024]
  •  

    I guess since the source has just <source> and <target>, I would need to change the target language every time, right?

    Right.

    I thought the HTML embedded content would help with this, but it didn't.

    It should do... perhaps you didn't apply it correctly?  I created a quick video to explain so you can see it works and how I did it:

    I hope this helps.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • Hi  

    Thank you so much for your detailed instructions! Actually, I was using the same settings (all embedded HTML), but it still didn't work in Studio 2021. Then I did the same process in Studio 2022, and it was finally fine!

    One issue still remains: the client had asked for whatever reason to preserve the elements below in the target, but my export has completely removed them.
    Is there a way to maintain them?

    Screenshot of Trados Studio showing a comparison of source and target text. The source text is highlighted with a red box indicating an error in the segment.

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 11:04 AM (GMT 0) on 29 Feb 2024]
  •  

    It's not missing at all, it's just been converted to what it represents, a carriage return.  The HTML entity &#13; represents the ASCII control character Carriage Return (CR).  In ASCII, the Carriage Return character is defined as decimal 13 (or hexadecimal 0D), which is why &#13; is used to represent it in HTML.

    So you have this:

    Screenshot of Trados Studio showing HTML source code with carriage return entities represented as 
 followed by the number 10 and another carriage return.

    Which is two carriage returns, the number 10 and another carriage return.  Your target file shows this:

    Screenshot of Trados Studio displaying the target file output with visible line breaks and the number 10 between them.

    Looks remarkably like two carriable returns, the number 10 and a carriage return.

    If I open the file with the filetype I created in the video and look at that part of the file using the "All content" display filter I can also see them there:

    Screenshot of Trados Studio's 'All content' display filter showing two carriage returns, the number 10, and another carriage return in the file.

    Unfortunately I don't think you are going to be able to address this within the filetype because Studio automatically handles them and doesn't allow you to have any control over how the whitespace like this is managed.

    The multilingual filetype at least retains then in the source, compared to the xml filetype which will not.  I think the best solution is to ask your customer to simply use whitespace and then you can tell Studio to preserve the whitespace.  Using entity values is really problematic and most XML processors (especially .NET) will automatically encode the &#13; entities as carriage returns when writing XML.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 11:04 AM (GMT 0) on 29 Feb 2024]