PDF Issue: Layout of Translated File & Recognition of Source Text

Former Member
Former Member

Hi all,

I am trying to translate an original (editable) PDF on Trados from English into Greek. However, two major issues arise. First, many of the characters of the source text are not recognized properly in Trados when the file is imported. As a result, the source text on my Editor appears as gibberish in many segments. For example, the character x is replaced with - while w is replaced with x.

Screenshot of Trados Studio Editor showing text recognition errors, with characters like 'x' replaced with '-' and 'w' replaced with 'x'.

An example of bad text recognition

Second, when I generate the translation, the translated file is totally ruined in terms of layout. For example, much of the text of the 1st page has been moved to the 2nd page, the segmentation has been ruined, and the spacing is terrible (see images below).

Screenshot of the original PDF layout with proper text alignment and spacing on the first page.

The layout of the source text

Screenshot of the translated PDF layout on the first page with text misalignment and spacing issues.

The layout of the target file (1st page)

Screenshot of the translated PDF layout on the second page showing text overflow from the first page and disrupted segmentation.

The layout of the target file (2nd page)

In general, I'm always facing layout issues when it comes to translating PDF files. What would be your suggestion on preventing these issues, if possible?

Kind regards,

Christos



Generated Image Alt-Text
[edited by: Trados AI at 8:39 PM (GMT 0) on 28 Feb 2024]
emoji
Parents
  • The suggestion is simple - translate the source document from which the PDF was created, not the PDF.

    Unfortunately, clients are clueless and do not know that PDF is NOT "just another document format" which can be freely edited.
    And it's even more unfortunate that many (majority?) translators do not know this either...

    PDF was invented as consume-only (i.e. read-only, print-only) format and was NEVER intended to be editable or otherwise processable.

    Therefore, if client wants to have a PDF localized, the only correct process is to localize the original format used to create the PDF (which can be Word, InDesign, Quark, or whatever else), NOT the PDF.
    Period.

Reply
  • The suggestion is simple - translate the source document from which the PDF was created, not the PDF.

    Unfortunately, clients are clueless and do not know that PDF is NOT "just another document format" which can be freely edited.
    And it's even more unfortunate that many (majority?) translators do not know this either...

    PDF was invented as consume-only (i.e. read-only, print-only) format and was NEVER intended to be editable or otherwise processable.

    Therefore, if client wants to have a PDF localized, the only correct process is to localize the original format used to create the PDF (which can be Word, InDesign, Quark, or whatever else), NOT the PDF.
    Period.

Children