Appropriate locale output in Multiterm Convert

I am trying to convert a IATE TBX export (a subset) into a Multiterm 2017 termbase.

The TBX file is about 13 MB in size and the export contains EN and PT as locales. The Java applet that created the export does not have an option to turn those two top locales into what it is more appropriate gb-EN and pt-PT.

Multiterm Convert seems to handle the conversion process flawlessly but the resulting files have got the wrong locales: they're set to US English and Brazilian Portuguese.

Any ideas on how to fix this are much appreciated.

Parents
  • Hi Paulo,

    The locale is set by the TBX, so if you want to use MultiTerm Convert and not the Glossary Converter which is much easier (not sure why you don't use this) then just change the language codes in the TBX before the conversion.

    Regards

    Paul

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • Hello, Paul.

    Thank you for getting back to me.

    I don't think the culprit is the TBX file (that was my initial impression as well but the IATE people confirmed that it was not the case): the TBX file has got the following locale variables: <langSet xml:lang="en"> and <langSet xml:lang="pt">. There is no reference in it to either the USA or to Brazil.

    The XTD file created by Multiterm does reference both locales explicitly.

    I did try to use Glossary Converter before logging my question and the tool creates a us-EN->p-PT term base, which is closer to the facts but still not 100% accurate. I have no problem in using the Glossary Converter, but the tool is not (correct me if I'm wrong!) officially supported by SDL and I would have to rely on the generosity of the developer for help.

    I haven't been able to find anything on the Internet or the Help files that explains why this is happening (is this caused by the TBX not mentioning explicitly that it's gb-EN, etc.?) and, more importantly, how to fix it.

    Cheers

    Paulo

Reply
  • Hello, Paul.

    Thank you for getting back to me.

    I don't think the culprit is the TBX file (that was my initial impression as well but the IATE people confirmed that it was not the case): the TBX file has got the following locale variables: <langSet xml:lang="en"> and <langSet xml:lang="pt">. There is no reference in it to either the USA or to Brazil.

    The XTD file created by Multiterm does reference both locales explicitly.

    I did try to use Glossary Converter before logging my question and the tool creates a us-EN->p-PT term base, which is closer to the facts but still not 100% accurate. I have no problem in using the Glossary Converter, but the tool is not (correct me if I'm wrong!) officially supported by SDL and I would have to rely on the generosity of the developer for help.

    I haven't been able to find anything on the Internet or the Help files that explains why this is happening (is this caused by the TBX not mentioning explicitly that it's gb-EN, etc.?) and, more importantly, how to fix it.

    Cheers

    Paulo

Children