Output file Error (XML parsing error)

Hi,

It's the second time that I'm facing this error. I sent a package to a translator (there was nothing wrong with the project so far). The translator did his job and send me back the return package. The problem is that the output file couldn't open in Microsoft Word (xml parsing error). So, after searching the net, I couldn't find anything, and made the project again and pretranslated it. The project had lots of segments needed to merge, and the translator merged all that segments (I think this problem is with merging the segments with tags). So, a lot of segments hadn't translation, and I had to put them manually into the word document. It's so frustrating, so please debug it or show me a workaround.

 

Thank you.

Parents
  • Hi ,

    Are you using Perfect Match to pretranslate the file again? That may help you with the missing segments.

    On the problem itself... if you can send me the package you sent to the translator and also the package you received back then I'd be happy to take a look. Do you know exactly what the translator did with this? Did he use Studio or some other tool for this, or did he use an older version of Studio than you.... or even a newer one? What version of Studio are you using?

    If you can send the packages to pfilkin@sdl.com I can take a look.

    Regards

    Paul

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • I mailed you the files. I think the problem is with merging tag-included segments. We both use Trados Studio 2017.
  • Hi ,

    Whilst we do say we support PDF files these are a pretty tricky format and in my opinion a little bit of work should be carried out before preparing the files to make them more likely to survive a translation round trip. In this case there has been no attempt to clean up a fairly large PDF for translation at all, so the translator has left out tags all over the place, got some tags the wrong way around and had to merge to deal with poor segmentation issues. Leaving tags out, at least formatting tags, probably won't cause the problem you have but having tags the wrong way around could and so could all the merging.

    I was going to have a go at fixing the word file but the underlying xml is so large I can't even open it! So I think your best bet with files of this nature is to convert them to Word first, and then clean them up:

    - remove necessary formatting tags
    - remove incorrect hard breaks

    Then prepare your project with the Word filetype instead. This is probably going to be a lot easier for the translator as this file was a little bit like a tag soup, so they'll thank you for that, and it's more likely to survive the sort of file manipulation this one underwent, although if you did this clean up it probably would not need it.

    In the meantime I don't have a better solution that the one you have applied. Just additional work to re-translate using Perfect Match and some TM matching.

    Regards

    Paul

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • Hi Paul,
    Thank you so much. It was one of my first packages, so I didn't work on the source file. Recently, I edit the PDF file, then import it in Trados Studio. I will try your method to see which one is faster and more efficient.

    Kind regards,
    Alireza
Reply Children
No Data