RWS AppStore

PDF Assistant for Trados

By Trados AppStore Team

Free

Description

The application is designed to support the conversion of PDF files into a DOCX so that you can improve the quality of the DOCX prior to translating it in Trados Studio. The reason we have taken this approach of convert prior to translation is because PDF to DOCX conversion without professional editing software can sometimes cause formatting issues, resulting in a document that looks different from the original PDF. If you convert first you can correct these issues before translating which leads to a better user experience overall.

It's important to note that the quality of the conversion largely depends on the quality of the original PDF and the conversion software used. Some conversion tools may produce better results than others. This "Add-In" initially makes use of the Microsoft Word desktop API providing simple text conversion and also some OCR capabilities. Whilst you could simply use Word and avoid the "Add-In" altogether it's worth noting that the plugin does provide more support than Microsoft makes available through Microsoft Word, in particular around OCR capability.

To learn how to use this application, please visit PDF Assistant for Trados in the RWS Community wiki

Technical details

2.0.1.0 - Trados Studio 2024

1.1.1.0 - Trados Studio 2022 (SR1)

1.0.1.1 - Trados Studio 2022

Trados Studio 2024

Trados Studio 2022 (SR1)

Trados Studio 2022

The PDF Assistant for Trados is an Add-In for Trados Studio that supports the conversion of a PDF to a DOCX so it can be successfully translated and delivered as a DOCX target file.

Installation

The application is an sdlplugin and can be installed either by visiting the RWS AppStore, downloading, and then manually installing by double clicking the sdlplugin file in the usual way. Alternatively the plugin can be installed through the Integrated AppStore in Trados Studio. For this to work you must have Microsoft Office installed. The testing was carried out on computers using Office 365 and not on older versions.

It's important to note that whilst this tool can do a decent job of converting most PDF files to DOCX it is not a process that is guaranteed to work 100% of the time. Working with PDF files can be a tricky business and it's recommended you review the section on "Working with PDFs" to understand a little where limitations can occur. If you cannot handle your PDF with this plugin you may require a more sophisticated tool such as Abbyy FineReader or Adobe Acrobat Pro which are designed specifically for working with this format.

Where is it installed?

The plugin is installed into the ribbon in the "Add-Ins" tab and into the "Toolbox" group:

Working with PDFs

The application is designed to support the conversion of PDF files into a DOCX so that you can improve the quality of the DOCX prior to translating it in Trados Studio. The reason we have taken this approach is because PDF to DOCX conversion without professional editing software can sometimes cause formatting issues, resulting in a document that looks different from the original PDF.

The more common problems that can occur during PDF to DOCX conversion would be things like:

Text and image placement: Sometimes, the text and image placement can become distorted during conversion, causing the final document to look different from the original PDF.
Formatting issues: PDFs often have complex formatting, such as columns, tables, and graphs. These elements can be difficult to convert to DOCX, leading to formatting issues in the final document.
Fonts: If the PDF contains fonts that are not installed on the computer doing the conversion, the text can appear differently in the final document.
Large files: PDF files can be very large, and converting them to DOCX can result in large files that take up a lot of storage space.
Security features: Some PDFs have security features that prevent copying and pasting, which can make it difficult to convert the document to DOCX.
OCR issues: If the PDF contains scanned images or text that was not originally digital, OCR (optical character recognition) software is needed to convert the text. However, OCR can sometimes produce errors or miss characters, leading to mistakes in the final document.
Unnecessary Tags: any of the above problems can lead to many unnecessary control tags being inserted into the DOCX that will become visible when working with a translation tool.
Poor Segmentation: similarly any of the above issues can lead to unnecessary hard returns being added into the DOCX and these will also make translation more difficult than is necessary.
Incorrect character display: If the character encoding is incorrect, it can cause characters to be displayed incorrectly in the final document. For example, some characters may appear as question marks or boxes especially with Asian character sets.
Missing characters: In some cases, incorrect encoding can cause certain characters to be missing from the final document. This can result in text that is difficult to read or understand.
Encoding conflicts: If different parts of the document are encoded in different ways, it can cause conflicts and errors during conversion. For example, some characters may be encoded in UTF-8 while others are encoded in ASCII, leading to errors when the document is converted to a PDF or other format.

It's important to note that the quality of the conversion largely depends on the quality of the original PDF and the conversion software used. Some conversion tools may produce better results than others. This "Add-In" initially makes use of the Microsoft Word desktop API providing simple text conversion and also some OCR capabilities. Whilst you could simply use Word and avoid the "Add-In" altogether it's worth noting that the plugin does provide more support than Microsoft makes available through Microsoft Word, in particular around OCR capability.

Using the "Add-in"

Adding your files

The PDF Assistant for Trados is started by clicking on the icon in the ribbon. This opens up a small wizard where you can add your files:

You can add as many files as you like, in as many languages as you like, but keep in mind the process could take a considerable amount of time and may even run out of memory if you ask for too much. How many files you can use really depends on the number of pages, number of images in the file, amount of OCR work required etc. Think about the work you are about to carry out and don't expect miracles!

The files or folders can be added via drag and drop, or by using the small icons in the wizard. In this example two PDF files have been added. An English language text containing two images, one that needs to be OCR'd and one that does not; and a Korean document that is non-readable, so the entire content is one big image in the PDF.

Selecting your Provider and OCR options

This screen allows you to do several things:

select the PDF Assistant you wish to use. For now there is only Microsoft Word to select from.check the option to specify whether or not you wish to extract text from the images and if so (in the next screens) which ones you would like to be processed (OCR'd)
keep in mid that when you OCR the images you will lose any background image that was there and will only have the text that the software was able to extract

You can cancel the process at any time if the file is too complex for the application to manage.

Image Selection

This part of the wizard will extract the images the software was able to identify and allow you to specify which of the images contain translatable text.

Summary Stage

This screen in this stage of the wizard displays a summary of the options you have chosen for the conversion.

Preparation

The final stage provides an indication of the progress until the conversion has completed:

DTP the converted files before Translation

Now you can open your converted PDF files as a DOCX in Microsoft Word and improve the quality of the file before you translate it. This way the target file will probably be ready to go, or at least require minimal editing to accommodate changes required as a result of text expansion/contraction in the target language.

A good tool for tidying up files resulting from a messy PDF conversion is TransTools available here - https://www.translatortools.net/products/transtools

In the example files, the English file contained two images, one that was OCR'd and the other treated as an image. The result isn't bad (PDF on the left, converted DOCX on the right) and if you were to open this PDF file in Microsoft Word both images would be handled as images, so the "Add-In" does provide considerable value here. The table needs tidying up but it is editable and could save time when more extensive text is involved:

On the Korean non-readable PDF. Some formatting would be required, but it's not too bad. The image is floating and can be positioned wherever I like, and all the text is available to me for translation. So some small amount of DTP work and I'll have a file that is easily translatable and the target file should be good with minimum work required:

The PDF Assistant for Trados is an Add-In for Trados Studio that supports the conversion of a PDF to a DOCX so it can be successfully translated and delivered as a DOCX target file.

Installation

Where is it installed?

The plugin is installed into the ribbon in the "Add-Ins" tab and into the "Toolbox" group:

Working with PDFs

The application is designed to support the conversion of PDF files into a DOCX so that you can improve the quality of the DOCX prior to translating it in Trados Studio. The reason we have taken this approach is because PDF to DOCX conversion without professional editing software can sometimes cause formatting issues, resulting in a document that looks different from the original PDF.

The more common problems that can occur during PDF to DOCX conversion would be things like:

Text and image placement: Sometimes, the text and image placement can become distorted during conversion, causing the final document to look different from the original PDF.
Formatting issues: PDFs often have complex formatting, such as columns, tables, and graphs. These elements can be difficult to convert to DOCX, leading to formatting issues in the final document.
Fonts: If the PDF contains fonts that are not installed on the computer doing the conversion, the text can appear differently in the final document.
Large files: PDF files can be very large, and converting them to DOCX can result in large files that take up a lot of storage space.
Security features: Some PDFs have security features that prevent copying and pasting, which can make it difficult to convert the document to DOCX.
OCR issues: If the PDF contains scanned images or text that was not originally digital, OCR (optical character recognition) software is needed to convert the text. However, OCR can sometimes produce errors or miss characters, leading to mistakes in the final document.
Unnecessary Tags: any of the above problems can lead to many unnecessary control tags being inserted into the DOCX that will become visible when working with a translation tool.
Poor Segmentation: similarly any of the above issues can lead to unnecessary hard returns being added into the DOCX and these will also make translation more difficult than is necessary.
Incorrect character display: If the character encoding is incorrect, it can cause characters to be displayed incorrectly in the final document. For example, some characters may appear as question marks or boxes especially with Asian character sets.
Missing characters: In some cases, incorrect encoding can cause certain characters to be missing from the final document. This can result in text that is difficult to read or understand.
Encoding conflicts: If different parts of the document are encoded in different ways, it can cause conflicts and errors during conversion. For example, some characters may be encoded in UTF-8 while others are encoded in ASCII, leading to errors when the document is converted to a PDF or other format.

It's important to note that the quality of the conversion largely depends on the quality of the original PDF and the conversion software used. Some conversion tools may produce better results than others. This "Add-In" initially makes use of the Microsoft Word desktop API providing simple text conversion and also some OCR capabilities. Whilst you could simply use Word and avoid the "Add-In" altogether it's worth noting that the plugin does provide more support than Microsoft makes available through Microsoft Word, in particular around OCR capability.

Using the "Add-in"

Adding your files

The PDF Assistant for Trados is started by clicking on the icon in the ribbon. This opens up a small wizard where you can add your files:

Selecting your Provider and OCR options

This screen allows you to do several things:

select the PDF Assistant you wish to use. For now there is only Microsoft Word to select from.check the option to specify whether or not you wish to extract text from the images and if so (in the next screens) which ones you would like to be processed (OCR'd)
keep in mid that when you OCR the images you will lose any background image that was there and will only have the text that the software was able to extract

You can cancel the process at any time if the file is too complex for the application to manage.

Image Selection

This part of the wizard will extract the images the software was able to identify and allow you to specify which of the images contain translatable text.

Summary Stage

This screen in this stage of the wizard displays a summary of the options you have chosen for the conversion.

Preparation

The final stage provides an indication of the progress until the conversion has completed:

DTP the converted files before Translation

A good tool for tidying up files resulting from a messy PDF conversion is TransTools available here - https://www.translatortools.net/products/transtools

Checkout other plugins from this developer:

Free

AI Professional

By Trados AppStore Team

The AI Professional plugin for Trados Studio 2022 leverages from both Azure and OpenAI's language models to assist users in translation projects. Key features include a Translation Provider, an AI companion that is available from the Editor and Terminology-aware Translation Suggestions.The plugin supports Azure OpenAI models, alongside OpenAI (same models used by ChatGPT), each with different capabilities and performance levels. Users can create custom prompts to guide the AI, and the plugin offers a few default prompts to get you started.The AI Professional plugin can be installed via the RWS AppStore or through the integrated AppStore in Trados Studio. To use the plugin, users must sign up for an OpenAI account, obtain an API key, and specify the desired model.More details about AI Professional can be found in this blog article.

Free

ASS File Type

By Trados AppStore Team

Filetype support for the ASS/SSA (Advanced SubStation Alpha) filetype used in subtitling. Can work alongside the Studio Subtitling plugin that is available on the RWS Appstore for enhanced context when translating/editing/proofing.

Free

Amazon Machine Translation Provider for RWS Language Cloud

By Trados AppStore Team

This add-on enables users working in the cloud (through Trados Studio, Trados Team or Trados Enterprise) to receive machine translation results from the Amazon Translate Service (AWS).You can directly install the Amazon Machine Translation Provider from within your cloud account. Simply select your account icon in the top right hand corner of the screen> RWS AppStore.Please note: You will need to purchase a subscription through AWS and you'll need to create an Amazon account. Pricing for AWS is available through this Amazon website.

Free

Amazon Translate MT provider

By Trados AppStore Team

This Amazon Translate MT provider allows you to retrieve translations from Amazon Translate Service AWS.In order to be able to use this application, you will need to create an account with AWS.For more information on how to set up the plugin and create an account-please check the Documentation tab.

Free

Antidote Verifier

By Trados AppStore Team

Antidote is a spelling and grammar checking tool from Druide informatique inc that integrates with MSWord and a variety of other applications.This free plugin adds the Antidote toolbar you'll find in MSWord into Studio under the review ribbon, so you can work with this application interactively as you translate or review.The Antidote Verifier plugin is free, but if you wish to use this, you must have a copy of Antidote installed, otherwise nothing is going to happen!For more details on how the plugin works with Studio refer to this article : https://multifarious.filkin.com/2016/09/08/antidote/.For further information about Antidote refer to this website: http://www.antidote.info/.

Free

Apply Studio Project Template

By Trados AppStore Team

Apply Studio Project Template allows you to apply settings from a template (.sdltpl) or project (.sdlproj) to one or more projects.The following settings can be applied:- Translation Memory and Automated Translation*- Translation Memory- Terminology*- Batch Processing- Verification- File Types* - it's possible to merge the lists of translation and terminology providersThe settings can be applied to either the active project or all selected projects in the projects view.Once installed, you will now be able to see the option to Apply Studio Project Template by right clicking on a project in the Projects view. You can also open the Plugin in Studio by pressing Ctrl + Alt + T.To learn how to use this application, please check the Documentation tab.

Free

AutoHotKey Manager

By Trados AppStore Team

The AutoHotKey Manager provides a simple way to manage and share AutoHotKey scripts from within Trados Studio.AutoHotKey is a free scripting tool which you must have installed to be able to use the plugin effectively; you can get this from here.If you're not familiar with AutoHotKey and how it can be used, then these articles might be useful:- AutoHotkey scripts for translators- AutoCorrect… for everything!In time, we hope there will also be scripts written by others which can be added to the AppStore to give users a headstart and allow them to curate their own favourites using the plugin.Finally, there is an AutoHotKey forum in the RWS Community where you can ask any questions about writing your own scripts or adapting scripts from others.To learn how to use this application, please check the Documentation tab

Free

Change Scaling Behaviour

By Trados AppStore Team

Note: This application is no longer maintained or supported, as it has been removed from our development scopeMany new computers today have the ability to provide a high resolution that offers many advantages such as:- Optimized usability and readability of applications on high-DPI displays- Better experience for multi-display systems- Possibility for developers to optimize app-specific scaling based on display DPIWhilst RWS is working on enhancing support for these high resolution environments it is a work in progress and often some of the menus and screens can appear very crowded and difficult to read.The solution to date is a KB article that offers several solutions, the last of which is a fix in the registry. This application automates the fix known as "Workaround 3" in the KB article.

Free

CleanUp Tasks

By Trados AppStore Team

Modify source/target, lock segments and delete tags with custom batch tasks. You can find an excellent explanation of how to use this tool on the developers website.