Translating PDF docs

Dear fellow Translators,

tricky question: I have PDF docs to translate whose quality is just great. (I had hard copy and scanned them myself). However, when I open one of those in Trados.... the letters get all messed up. I do have IRIS and yes, I uninstalled it and installed it again, nothing changed. 

Could anybody please help? 

Thank you,

Rosaria 

Parents
  • Hi ,

    I'd like to mention that Iris is not the panacea for all PDFs ( multifarious.filkin.com/.../ ) even if it does quite a good job in general. PDFs are a problematic format because there is so much that can go wrong with them. You mentioned you scanned them yourself... I don't know how knowledgeable you are in this are so forgive me for the basic questions, but:

    - did you scan as text or as an image? Iris will handle both but obviously one will be better than the other.
    - what software did you use for the scanning? If it was a decent OCR software why did you save to PDF when Word would probably have been a better choice?

    There are probably a lot of things to write about this but let's start with the two questions above.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • Hello Paul,

    thank you for your steady reply. I took some time to get back to you cause I did some tests with the Samsung technicicans (my printer's t.).

    - I do scan my docs as text
    - I use "Easy Document Creator

    Together with the technician we tried changing several formats, yet the result is always the same: docs are properly scanned yet when opened as PDF cannot be read in Trados.

    Solution: I do convert them into Word and only then import them in Trados: ergo..... PDF still wont be recognised by Trados.

    Would you have a solution?
  • Hi ,

    Can I see one of your problematic PDF files? You can email it to pfilkin@sdl.com

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • Before you send any files, just tell us the Studio version you are using: some older versions of Studio cannot convert "graphics" PDFs. And your PDFs are graphic files. This said, in my opinion the way you are trying to go will never satisfy your fully, as the conversion of PDFs - especially ike yours scanned PDFs - is always a very tricky job. So to achieve proper and good results, I would utilize an extra OCR software (like Abbyy FineReader), which can give you better results (provided you take time to achieve those and are willing to do some manual actions in the software) as any automated conversion. From my own experience (since 1990) there was, is and probably won't be any software in the world, which will convert a PDF to Word in automated way so, that no corrections will be necessary. However, when you add the extra step of manual conversion of PDF2Word the results with scanned PDFs of good quality will be great.

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

  • Hello Jerzy, I have 2017, the latest version. Yes, "manual conversion", PDF2Word via Adobe PDF Convertor, that's what I'm doing....
    Thing is I work for a client that can only give me hard copies. So the problem will persist.
  • The problem with scanned PDFs will persist - unless you really use a decent OCR solution with a manual process. An automated conversion will never bring the results you need for smooth translation. So instead of trusting any software (including Studio) to convert a PDF, do that with OCR - like Abbyy FineReader. In my opinion this is one of the best - if not the best - available OCR solutions on the market.

    As far I understand, what Adobe is doing, is creating a Word file out of your scanned files, but it does not recognize the text inside. So what you get, is Word with images. Am I right? Word with images is even worse than a scanned PDF, as Studio treats such Word file as any other Word file, where images are simply treated as such - images, with no translatable text on it.

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

Reply
  • The problem with scanned PDFs will persist - unless you really use a decent OCR solution with a manual process. An automated conversion will never bring the results you need for smooth translation. So instead of trusting any software (including Studio) to convert a PDF, do that with OCR - like Abbyy FineReader. In my opinion this is one of the best - if not the best - available OCR solutions on the market.

    As far I understand, what Adobe is doing, is creating a Word file out of your scanned files, but it does not recognize the text inside. So what you get, is Word with images. Am I right? Word with images is even worse than a scanned PDF, as Studio treats such Word file as any other Word file, where images are simply treated as such - images, with no translatable text on it.

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

Children