Formatting problem

Good afternoon,

I am new in SDL Trados Studio and become frustrating about the formatting.

I have received the same document in Word and in Pdf and I need to translate it. The document is a manual and has a lot of pictures it means quite complicate formatting.

I uploaded the document in Word without problems and while I translate the text I wanted to see the "Preview". I noticed that in there is a big mess in formatting. Mostly the pictures were placed in the other pages or in other places on the right page.
So I tried to upload in Pdf and set it well for this type of document but still the same result - formatting was wrong.

I thought that if I upload the document in Word it will not be a problem to translate it into Word again without any change in the formatting of the document.

How can I resolve this?

Thank you very much in advance!

  • Hi ,

    The only way to resolve this is to clean up the word version after converting the PDF. PDF conversions are difficult and sometimes the quality of the conversion will be bad. So it’s always better to convert to Word first, clean up the word file and then translate the cleaned word file.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • Hi , hi ,

    Zuzana already received a Word version from the client, as well as a PDF. She tried processing the PDF because the Word version didn't show with the images in the right place in 'Preview'.

    Zuzana, images not showing up in the correct position in Preview doesn't mean that the final Target document will also have the images incorrectly placed.

    The best way to test this is to create a Target file from the SDLXLIFF via File > Options > Save Target As.

    Then, if the Target Word file fails to produce or is not correctly formatted, you can go back to the Source Word file and see what needs to be smoothed out so that the a 'Target' file is successfully produced (obviously, still in the Source language if it has not been translated but the test is still valid).

    Always the first thing to do with a complex Word file, test that it will save Target As before translation. This way you can identify and deal with whatever might cause a failure before translation.

    All the best,

    Ali

  • Thank you, Ali, for your answer. I tried to save it to the Target file document as you said - create a Target file from the SDLXLIFF via File > Options > Save Target As. The error with the pictures is still there. Could you please give me more detail advice how to smooth out the Source Word file?
  • First of all: which view do you use in Word? The print layout or other view? You can only check the layout in the print view, all other views are not suitable for checking the layout.
    You have translated the document - so the amount of text has changed. This must move elements in Word. If your language is longer than the source, simply try this:
    Open in Word, press CTRL+A and then CTRL+D, go to the "Advanced" tab. Set the scaling to 98% and the spacing to -0,2 points. Then press OK. This will make your text visually "shorter".
    If that is not enough, replace the font size by a font size smaller by one point, ie. change 11 points to 10 points and so on. You can easily use search & replace in Word to do that. After that your format should be OK again.
    If your text is shorter, scale it to 101% and space to 0,1 point. Also replace font size making it bigger by 0,5 point (if not enough, take 1 point).

    But in the end of the day this is NOT Studio, what does influence the format here. A PDF converted document will - depending on how it has been converted - typically create formatting problems, so it is necessary to learn how to use Word formatting. This is however not the scope of this forum.

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

  • Thank you, Jerzy.

    I understand your advice on the length of translation, actually, you are right, the target language is 'longer' than the source language. However, I tried just to upload the Word document to SDL Trados Studio, save it in the Studio format and after that just save it again as the Word document. Without putting a target word. Just to see if there will be any changes after this process. And it was there - the mess. Probably you are right and it is just badly formatted in the source Word document.

    Thank you for your reply.
  • Is your Word document stored as doc or docx? A doc Word will be first converted to DOCX before processing - and alone this conversion will change the formatting. If you are using Studio 2017 and the file type used is WordprocessingML, then the target format will be what Office 365 delivers, while the source format might not be Office 365 - this will also change the formatting.
    So in Word, make sure the file is DOCX and when opened does not show "compatibility mode". Store as DOCX from Word and make sure no format changing occurs. Then you can also check, if using the Word 2007-2013 file type instead of Word 2007-2016 does change the document less.
    In the end of the day the problem is related to the way the PDF file has been converted. From what you wrote here, you will find several text boxes and dozens of section changes in Word - which, when only touched, do change the appearance of the whole document. Most probably converting the PDF anew with a decent OCR software will be the best solution here, as the Word you have seems to be crap in all means...

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

  • Hi 

    Sorry I didn't reply sooner, I am ill... Only sinusitis but I stayed in bed this morning. Fortunately I see that has given you an excellent explanation of how various properties of a Word file that can cause displacement of content.

    I'm not sure if he knows that the Word file you are working on wasn't made from a PDF file, that you were supplied with both by the client. However, everything he says is correct and valid.

    SAs he says, save the original Word file as .docx if it is currently .doc and that will prevent some of the problems.

    You could also create a brand new .docx file then highlight all of the original Word document content and paste it into the new .docx file. If it is a huge document, this will take a while but it might leave some properties behind that cause misalignment. (You may have to re-paste the translated content into the original document if it leaves content such as headers and footers behind).

    Then before converting to sdlxliff, go through the Word file to see if the formatting is still correctly aligned with the images, and that all the necessary content is present.

    There may be hidden content that is causing further problems. You can see some of this by hitting the paragraph icon in the Home tab:

    Other content can be present but totally hidden from the 'front end'. Usually it is necessary content (sometimes you will see these items displayed as tags in the SDLXLIFF). There is a trick I used to perform in a different format, Interleaf Quicksilver, that smoothed out all unnecessary 'tag' content. I would highlight the whole content of the document then make a change that affected nothing that needed to stay the same, so I would change the language for example, then repeat the process changing the language back to what it should be. This somehow removed any extra formatting information that was not required. 

    , is there a way that this can be performed in Word, do you think? It might help...

    It is common in Word for images or other content to 'move' because of the differences between the 'before' and the 'after' text. Also, some content won't move when it should because it has been 'anchored'. Also there is sometimes imported content such as spreadsheets or embedded graphics that is either inaccessible or has to be manually translated in the Target Word document. As says, this is down to how the Word document has been compiled and is not a fault in Studio.

    From all of this, you may not be surprised that many people complain about Word-based sdlxliffs that will not save as Target. The best advice is to always do as much as you can to make sure the Word file will save as Target but BEFORE translation.

    I hope I haven't confused things, there's a lot of information here! One final thing... some translation agencies tend to employ someone in-house to deal with all this kind of layout work after translation of any word processing or desktop publishing formats. Thus, your client may not be expecting a perfect Word document from you. Do they want the Word document back or the SDLXLIFF?

    All the best,

    Ali

     

  • Dear Ali

    , is there a way that this can be performed in Word, do you think? It might help...

    You mean "normalizing" of the document?

    Provided one single font has been used in Word and - which is quite lucky case - all the text is in black, you can first adapt the Standard or Normal style (the name depends on the language version of Word) to fit the "body text", for example if a simple paragraph of text is Times New Roman 12 point, then adapt the style to TNR 12 point and set the color there to automatic. Then press CTRL+A and CTRL+D. On the first tab select "Times New Roman" as font and chose the Automatic color. Press ok. Then press CTRL+A and CTRL+D again to make sure all document content has been selected, switch to the Advanced tab and set the scaling to either 100% (your translation is not longer) or something between 100 and 98%, if your translation is longer. You can also squeeze the text by changing the spacing to -0,01 points (nearly no one will notice) or -0,02 (some will notice). This can reduce the text so far, that the changes due to the text length will nearly disappear completely.

    However, in this very case, the problems seems to be caused by compatibility problems - as Zuzana pointed out, even an untranslated document, stored from Studio, does change the layout. So it seems, that the conversion into SDLXLIFF and the back to Word switches the format from one Word version to another. This can have a very big influence on the layout, especially if the original document is crappy formatted.

    @- if you wish, send me the document offline (jerzy at czopik dot com) for some investigation.

    Best regards, Jerzy

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.