Studio not auto-localizing tokens

Juan Carlos Munoz Morano over 6 years ago

Hi all,

The point here is that when I pre-translate with an empty TM a test file that contains some tokens such us figures, acronyms (three letter currencies) and dates only figures are auto-localized even though the settings are configured to recognize all tokens (including acronyms and dates) in both my TM settings and the auto-substitution settings of the target language pair. Is there a way to also auto-localize the tokens that I have mentioned?

The practical case would be a huge file that has a lot of repetitions. In this case, extracting an Unknown Segments file using an empty TM would be very useful to get rid of all those repetitions. The problem arise when Studio only extracts one of those TUs containing only tokens (such us dates or acronyms) since the rest are treated as repetitions but then, if you try to pre-translate the source file with the translated Unknown Segments file all those tokens are not automatically localized and need manual fixing.

Many thanks in advance to any helpful idea!

Kind regards,

Carlos

Translate

Rate translation

Suggest better translation

Moderator UI

Thread Subject & Description
Studio not auto-localizing tokens Hi all, The point here is that when I pre-translate with an empty TM a test file that contains some tokens such us figures, acronyms (three letter currencies) and dates only figures are auto-localized even though the settings are configured to recognize all tokens (including acronyms and dates) in both my TM settings and the auto-substitution settings of the target language pair. Is there a way to also auto-localize the tokens that I have mentioned? The practical case would be a huge file that has a lot of repetitions. In this case, extracting an Unknown Segments file using an empty TM would be very useful to get rid of all those repetitions. The problem arise when Studio only extracts one of those TUs containing only tokens (such us dates or acronyms) since the rest are treated as repetitions but then, if you try to pre-translate the source file with the translated Unknown Segments file all those tokens are not automatically localized and need manual fixing. Many thanks in advance to any helpful idea! Kind regards, Carlos
Get AI Suggestion

AI Reply

Accept answer Reject Answer

Parents

0 Paul Filkin over 6 years ago

Juan Carlos Munoz Morano

How about an example file so we can see what you're working with?

Juan Carlos Munoz Morano said:
The practical case would be a huge file that has a lot of repetitions.

I wouldn't use unknown segments, although you could. Probably easier to simply filter on them, handle them all in one go, change status to translated and then lock them or hide them.

Paul Filkin | RWS

Design your own training!
You've done the courses and still need to go a little further, or still not clear?
Tell us what you need in our Community Solutions Hub
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Reject Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate
0 Juan Carlos Munoz Morano over 6 years ago in reply to Paul Filkin
Hi Paul,

Thanks for your prompt answer!

Attached you have the test files that I used for this. I have already extracted the Unknown Segments file, translated it and populated the TM with it. Please let me know with your findings!

The reason why I use the Unknown Segments file:

When working with hundreds of even thousands of files this option provides a lot of efficiency, since you only have to work with one file.

When extracting Unknown Segments the file size also decreases significantly (also for package creation)

Studio also works more smoothly when dealing with just one file.

Another option would be running two rounds of pre-translation of the source files. One with an empty TM so all the tokens are auto-localized and another one with the translated Unknown Segments file. However, I still face the same issue here, since not all the tokens are auto-localized even with the empty TM.

The option of not recognizing tokens in the TM used to extract the Unknown Segments has been also considered, however, the word count increases significantly since all the tokens would be included as common text and we would losing the opportunity of this auto-localization that Studio offers.

What am I missing? Maybe some settings that I did not pay attention to?

Thank you very much in advance

Carlos

SDL Unknown Segments Test files.zip
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate

Reply

0 Juan Carlos Munoz Morano over 6 years ago in reply to Paul Filkin
Hi Paul,

Thanks for your prompt answer!

Attached you have the test files that I used for this. I have already extracted the Unknown Segments file, translated it and populated the TM with it. Please let me know with your findings!

The reason why I use the Unknown Segments file:

When working with hundreds of even thousands of files this option provides a lot of efficiency, since you only have to work with one file.

When extracting Unknown Segments the file size also decreases significantly (also for package creation)

Studio also works more smoothly when dealing with just one file.

Another option would be running two rounds of pre-translation of the source files. One with an empty TM so all the tokens are auto-localized and another one with the translated Unknown Segments file. However, I still face the same issue here, since not all the tokens are auto-localized even with the empty TM.

The option of not recognizing tokens in the TM used to extract the Unknown Segments has been also considered, however, the word count increases significantly since all the tokens would be included as common text and we would losing the opportunity of this auto-localization that Studio offers.

What am I missing? Maybe some settings that I did not pay attention to?

Thank you very much in advance

Carlos

SDL Unknown Segments Test files.zip
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate

Children

0 Paul Filkin over 6 years ago in reply to Juan Carlos Munoz Morano

Juan Carlos Munoz Morano

Thanks for the file. I cleaned your TM a little so I had no numbers, or acronyms, in there and ran a pre-translation. This is what I get:

The numbers are recognised as expected, but the acronyms are not. The reason for this is that we can't be sure whether an acronym remains the same in pre-translation or not. So it's only available for interactive translation where the user can make the choice.

I guess if you think this should be possible as an option you should raise this here and see whether you get any support:

http://ideas.sdl.com

Paul Filkin | RWS

Design your own training!
You've done the courses and still need to go a little further, or still not clear?
Tell us what you need in our Community Solutions Hub

Generated Image Alt-Text
[edited by: Trados AI at 8:50 PM (GMT 0) on 28 Feb 2024]
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Reject Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate
0 Juan Carlos Munoz Morano over 6 years ago in reply to Paul Filkin

Hi Paul,

Thank you very much for your feedback.

The conclusion is then that, when using this Unknown Segments approach, these acronyms should not be recognized in the TM so that they are fully auto-translated once the Unknown Segments file is translated (meaning this the mentioned increase of word to be translated) or manually fixing them (with regex, for example) since they should be just copied to target. Meaning this that there is no other way to automatically localized them.

Thank you for your time

Kind regards,

Carlos
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate
0 Paul Filkin over 6 years ago in reply to Juan Carlos Munoz Morano

Juan Carlos Munoz Morano

Juan Carlos Munoz Morano said:
Meaning this that there is no other way to automatically localized them.

Also... once in your TM they'll be picked up so I guess over time you'll start to incorporate the sort of acronyms you handle and then they'll be pre-translated in future files.

Paul Filkin | RWS

Design your own training!
You've done the courses and still need to go a little further, or still not clear?
Tell us what you need in our Community Solutions Hub
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate

Trados Studio > 1. Trados Studio

Studio not auto-localizing tokens