MultiTerm Extract won't process tmx

Hi everyone

I just installed MultiTerm Extract 2021 and have been attempting to extract terminology from bilingual files for hours with no luck.

I exported our SDLTM to TMX format using Studio 2021 and then adding them to a project in MultiTerm Extract, but received this error every time:

"Warning! There were translation units (tu) found in file (name of my TMX file) which did not have a translation unit variant (tuv) with target language "Dutch (Belgium)"

I only found 1 similar Q&A online and followed their workaround:  MultiTerm Extract 2017 just purchased - "warning!" error message on every TMX file and no extraction 

I was able to change all target tuvs from nl-BE to nl, but that did not help. Changing them to nl-NL did not help either.

I even tried to erase all code lines with nl-BE to find any other variants (which would be weird, but you never know), to no avail.

Hopefully someone at SDL can help me with this issue? Suggestions gratefully received!

emoji
Parents Reply
  •  

    ok - so you must be running this on your larger file?

    I think the error is probably due to the complexity of the regular expression, which is taking too long to evaluate.  Regular expressions that have lots of nested repetitions or non-greedy matches can be highly inefficient, especially on large texts.  You may fare better with another editor... I use EditPadPro which handles far larger files than notepad++ can work with, but even with this I don't know how well it would work with your specific file.  You'd need to test and see.

    I also don't think I know how to simplify this expression with regex alone as you'd have to start getting too specific and this would remove the benefit of the regex in the first place, so you may be better off using another method to clean up such large files.  Maybe Powershell or Python for example which can make use of some built in libraries that make it easier to manipulate XML.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
Children
No Data