Can a Word List be imported into the QA Checker?

Is it possible to import a word list into the Studio QA Checker Word List?

It would be convenient to be able to have different word lists giving the wrong and correct forms of words, in Excel format, for instance, and import a different list depending on the project. This would be good, for instance, for projects in sectors that have their own jargon, or different versions of a language that have variations in word spelling or usage.

I have considered having a separate QA checker profile for each language version, but between US and UK English, for instance, there can be a lot of different spellings and/or usages, and Studio does not flag anywhere near all of them. Entering dozens of individual wrong/correct spellings can be so time-consuming!

  • Hello Irene,

    This is a good question, and the answer is yes, but not in a very friendly way.  First of all it'll be worth you looking at this article which describes the process you'll need to use, albeit the article is about autocorrect lists and not verification wordlists:

    Now, to do the same process in a wordlist you need to add a couple of items manually and then export your QA profile.  This will give you an XML file with the extension sdlqasettings.  When you open this file with a text editor you'll see something like this:

        <Setting Id="WrongWordPairs0">
          <WrongWordDef xmlns:i="http://www.w3.org/2001/XMLSchema-instance"
          xmlns="http://schemas.datacontract.org/2004/07/Sdl.Verification.QAChecker">
            <CorrectWord>gas</CorrectWord>
            <WrongWord>petrol</WrongWord>
            <_CorrectWord>gas</_CorrectWord>
            <_WrongWord>petrol</_WrongWord>
          </WrongWordDef>
        </Setting>
        <Setting Id="WrongWordPairs1">
          <WrongWordDef xmlns:i="http://www.w3.org/2001/XMLSchema-instance"
          xmlns="http://schemas.datacontract.org/2004/07/Sdl.Verification.QAChecker">
            <CorrectWord>hood</CorrectWord>
            <WrongWord>bonnet</WrongWord>
            <_CorrectWord>hood</_CorrectWord>
            <_WrongWord>bonnet</_WrongWord>
          </WrongWordDef>
        </Setting>

    Each Setting element contains your word pairs, so it's these you need to get into excel so you can edit it more easily.  Now, having shown you the article and then looked at this I think it's still possible, but it's a lot more work then it would be in the autocorrect files.  I'm also thinking... OMG... we really need to do something about making the resources in Studio more accessible to people with better things to do with their time!!  I think a small utility, or plugin, that easily managed all these sorts of things with a simple excel import/export, would be very much appreciated.

    But as we don't have this yet, and if you feel ready to tackle this I think the key is to get each entry on a single line like this (click it to see the whole line):

    Looks onerus, but I think once you have this format it's not too tricky to sort with excel and you could easily import a huge list in no time once you had the formula in Excel setup to handle one line.  Just follow the principle in the article I shared above.

    Regards

    Paul

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • I absolutely agree with Paul, this could be a little more user-friendly if supported by an app. I recently thought about writing something like that, but didn't have the time :-(

    Even better: why not extend the terminology verifier to perform such checks. Basically this is all about terminology. If the correct terms and wrong terms are suitably marked up in MultiTerm, the "new" terminology verifier could run a check on your translation.

    Let's see if this get's into one of the next releases.

    Just an idea.
  • This is a great input, pls consider my Idea of creating a project-based mt termbases, this could meet this suggestion by matching project related terminology, without forcing generic long lists.
  • I see this post is over 8 years old. Is there a solution for importing a word list into the Studio QA Checker Word List?

    I want to use the word list checker as an alterative solution to a current problem I am having with verification. The termbase I use has identified certain terms as "do not use" terms. For example, remove is a do not use term for delete and vice versa. When I run a verification, I get a long list of false positives because both terms are defined as do not use. I thought, if I could add the correct terms to the word list, I could avoid these erroneous errors, but to do this manually for each language pair I support is very tedious. I'd like to leverage the fact that my termbase already identiies these terms and import the list for each language. 

    Is this possible?

    emoji
  •  

    Probably better to create a new thread seeing as you tagged onto a completely different topic... autocorrect.

    However, yes, you can use the "Import Settings..." in the QA Checker Profile.  Create a couple of rules manually to do what you want, export the settings and take a look at the format with a text editor.

    Export your termbase to Excel with all the fields and filter out the ones you don't need.  Then just convert your spreadsheet into the same format as the exporting settings and import them.  Should be simple enough to achieve.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • Thanks for your answer, Paul! 
    I added my question here as the initial post was "Can a Word List be imported into the QA Checker?" but I will make a new post as I have some more questions about how to use the Word List. 

    emoji