MultiTerm Extract won't process tmx

Hi everyone

I just installed MultiTerm Extract 2021 and have been attempting to extract terminology from bilingual files for hours with no luck.

I exported our SDLTM to TMX format using Studio 2021 and then adding them to a project in MultiTerm Extract, but received this error every time:

"Warning! There were translation units (tu) found in file (name of my TMX file) which did not have a translation unit variant (tuv) with target language "Dutch (Belgium)"

I only found 1 similar Q&A online and followed their workaround:  MultiTerm Extract 2017 just purchased - "warning!" error message on every TMX file and no extraction 

I was able to change all target tuvs from nl-BE to nl, but that did not help. Changing them to nl-NL did not help either.

I even tried to erase all code lines with nl-BE to find any other variants (which would be weird, but you never know), to no avail.

Hopefully someone at SDL can help me with this issue? Suggestions gratefully received!

emoji
Parents Reply
  •  

    Thanks for the file... found some time to look at it this evening.  I get the same error you mentioned, but the application still extracts the terminology:

    screenshot showing MultiTerm Extract results having extracted 79 terms

    So, I split the file and kept splitting it until I found the problem.  It relates to the segment contents rather than the language codes so this was a misleading error message.  The segment (<seg>) for both the fr-BE and nl-BE languages contains placeholders (<ph>, <bpt>, and <ept>) but no actual translatable content in one single <tu>. The placeholders refer to certain tag formats or structures that were present in the original document.  If you're extracting terminology, the actual terms or sentences should be inside the <seg> tags and since this file you provided has a single TU with only placeholders and no content Extract seems to get a bit upset!

    This is the offending translation unit variant:

          <tuv xml:lang="fr-BE">
            <seg><bpt i="1" type="110" x="1" /><ph x="2" type="114" /><ept i="1" /></seg>
          </tuv>
          <tuv xml:lang="nl-BE">
            <seg><bpt i="1" type="110" x="1" /><ph x="2" type="114" /><ept i="1" /></seg>
          </tuv>

    If you search for that and delete the entire translation unit and then try again the error is not there.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
Children