LanguageTool Plugin as smart spell checker

One of the amazing features of the LanguageTool Plugin is advanced spell check. Spell-checking is done in batch mode, so it's faster than a standard Studio spell check. Group ignoring errors is easier and safer. Unnecessarily ignored errors can be quickly restored, which cannot be done with a standard spell check in Studio.

And most importantly, we can detect errors in:

  • proper names
  • product names
  • utranslatables
  • source UI options when bilingual translated
  • acronyms
  • words in capital letters
  • words in mixed letters
  • words containing digits
  • and alphanumeric expressions.

LanguageTool Plugin is smart spell checker for checking acronyms and alphanumeric expressions. This is how it works.

Screenshot of Trados Studio with LanguageTool Plugin showing spell check errors for words like ATmega168, Arduino Due, and LilyPad highlighted in the target column.

Error

Error detected by QA-checker

Source

Target

LT Plugin

Number Verifier

Simple Numcheck

ATmega168

ATmeg168

Yes

No

Yes

ATmega328

ATmega228

Yes

Yes

Yes

Arduino

Ardoino

Yes

No

No

Due

Deu

Yes

No

No

SAM3X8e

SAM38Xe

Yes

Yes

Yes

LilyPad

LillyPad

Yes

No

No

 

Any error detected by the LanguageTool Plugin is clearly indicated in the Message column. Any error detected by the LanguageTool Plugin is clearly highlighted in the Target column. You can be sure that you won't mistakenly ignore the error returned by a standard spell checker in the Studio.

How to configure the LanguageTool Plugin to detect all such errors.

 Screenshot of Trados Studio project settings window with LanguageTool Plugin options for enabling spell check and ignoring specific types of words.

 

Notes in small print: not all errors listed above are detected by the LanguageTool Plugin by default. This depends on the default configuration of the speller of LanguageTool package. LanguageTool spell checker configuration is set in INFO file in dictionary folder:

c:\LanguageTool-4.3\org\languagetool\resource\en\hunspell\en-US.info (for en-US)

c:\LanguageTool-4.3\org\languagetool\resource\pl\hunspell\pl_PL.info (for pl-PL)

The first highlighted part is the path to the LanguageTool package, and the next highlighted parts are the given language.

Modified content of the INFO file

For pl-PL:

# Dictionary properties.
fsa.dict.separator=+
fsa.dict.encoding=UTF-8
fsa.dict.encoder=none
fsa.dict.frequency-included=true
fsa.dict.speller.locale=pl_PL
fsa.dict.speller.ignore-diacritics=true
fsa.dict.speller.ignore-all-uppercase=false
fsa.dict.speller.ignore-camel-case=false
fsa.dict.speller.ignore-numbers=false
fsa.dict.speller.ignore-punctuation=true
fsa.dict.speller.equivalent-chars=x ź, l ł, u ó, ó u, u y, i j, u ł, d t, v w, - —, - –, 0 O
fsa.dict.speller.replacement-pairs=ć dź, ć dz, c dz, ch h, dź ć, dz ć, ę em, ę en, em ę, en ę, h ch, osc ość, loz łos, ą oł, oł ą, ą om, om ą, ą on, on ą, ż rz, rz ż, rz sz, scia ścią, sić ść, sz ż, sz rz, tro rot, x ks, z rz, nei nie, trzcionk czcionk

For en-US:

# Dictionary properties.
fsa.dict.separator=+
fsa.dict.encoding=iso-8859-2
fsa.dict.encoder=SUFFIX
fsa.dict.speller.locale=en_US
fsa.dict.speller.ignore-diacritics=true
fsa.dict.speller.ignore-all-uppercase=false
fsa.dict.speller.ignore-camel-case=false
fsa.dict.speller.ignore-numbers=false
fsa.dict.speller.replacement-pairs=ninties 1990s, a ei, ei a, a ey, ey a, ai ie, ie ai, are air, are ear, are eir, air are, air ere, ere air, ere ear, ere eir, ear are, ear air, ear ere, eir are, eir ere, f ph, ph f, gh f, f gh, kw qu, Bordo Bordeaux, bato bateau, bocoup beaucoup, buro bureau, bo beau, oo ew, ew oo, ew ui, ui ew, oo ui, ui oo, uff ough, oo ieu, ieu oo, ier ear, ear ier, air ear, shun tion, shun sion, shun cion, yersa years, phoby phobia
fsa.dict.frequency-included=true
fsa.dict.speller.ignore-all-uppercase=false

For the Ignore words with UPPER CASE..., Ignore words with miXED CASE... and Ignore words with numbers... to work, there must be 3 highlighted lines in the language configuration file. If there are no lines or they are set to “true”, they must be changed accordingly.

Unfortunately, this will not work for all languages. Below is a table with the default settings of the LanguageTool spell checker module, which will help you easily check if you can change your settings. The FSA ignore params supported column indicates whether it is possible to change them in the INFO file. You can see that among the languages tested by me, it is not possible to set all three parameters only for Danish, German, Portuguese and Ukrainian. Nothing needs to be changed for Catalan, Dutch, Swedish, French. For other languages you can edit FSA parameters in INFO file. 

 

 

LT
Language Name

LT
Short Code

LT
Long Code

FSA ignore params supported

Default ignore-all-uppercase

Default ignore-camel-case

Default ignore-numbers

Path to INFO file

Asturian

ast

ast-ES

Yes

     

\LanguageTool-4.3\org\languagetool\resource\ast\hunspell\ast_ES.info

Belarusian

be

be-BY

Yes

true

false

true

\LanguageTool-4.3\org\languagetool\resource\be\hunspell\be_BY.info

Breton

br

br-FR

Yes

true

true

true

\LanguageTool-4.3\org\languagetool\resource\br\hunspell\br_FR.info

Catalan

ca

ca-ES

No

false

false

false

 

Catalan (Valencian)

ca

ca-ES-valencia

No

false

false

false

 

Chinese

zh

zh-CN

Not tested

       

Danish

da

da-DK

No

     

\LanguageTool-4.3\org\languagetool\resource\da\danish.info

Dutch

nl

nl

Yes

false

false

false

\LanguageTool-4.3\org\languagetool\resource\nl\spelling\nl_NL.info

English (Australian)

en

en-AU

Yes

false

false

true

\LanguageTool-4.3\org\languagetool\resource\en\hunspell\en_AU.info

English (Canadian)

en

en-CA

Yes

false

false

true

\LanguageTool-4.3\org\languagetool\resource\en\hunspell\en_CA.info

English (New Zealand)

en

en-NZ

Yes

false

false

true

\LanguageTool-4.3\org\languagetool\resource\en\hunspell\en_NZ.info

English (South African)

en

en-ZA

Yes

false

false

true

\LanguageTool-4.3\org\languagetool\resource\en\hunspell\en_ZA.info

English (GB)

en

en-GB

Yes

false

false

true

\LanguageTool-4.3\org\languagetool\resource\en\hunspell\en_GB.info

English (US)

en

en-US

Yes

false

false

true

\LanguageTool-4.3\org\languagetool\resource\en\hunspell\en_US.info

Esperanto

eo

eo

Not tested

       

French

fr

fr

Yes

false

false

false

\LanguageTool-4.3\org\languagetool\resource\fr\hunspell\fr_FR.info

Galician

gl

gl-ES

Not tested

       

German (Germany)

de-DE

de-DE

No

false

false

true

\LanguageTool-4.3\org\languagetool\resource\de\hunspell\de_DE.info

German (Austria)

de-AT

de-AT

No

false

false

true

\LanguageTool-4.3\org\languagetool\resource\de\hunspell\de_AT.info

German (Switzerland)

de-CH

de-CH

No

false

false

true

\LanguageTool-4.3\org\languagetool\resource\de\hunspell\de_CH.info

Greek

el

el-GR

No

   

true

\LanguageTool-4.3\org\languagetool\resource\el\hunspell\el_GR.info

Italian

it

it

Yes

true

false

true

\LanguageTool-4.3\org\languagetool\resource\it\hunspell\it_IT.info

Japanese

ja

ja-JP

Not tested

       

Khmer

km

km-KH

Not tested

       

Persian (Iran)

fa

fa

Not tested

       

Polish

pl

pl-PL

Yes

false

false

true

\LanguageTool-4.3\org\languagetool\resource\pl\hunspell\pl_PL.info

Portuguese (Portugal)

pt-PT

pt-PT

No

false

false

true

 

Portuguese (Angola preAO)

pt-AO

pt-AO

No

false

false

true

 

Portuguese (Brazil)

pt-BR

pt-BR

No

false

false

true

 

Portuguese (Moçambique preAO)

pt-MZ

pt-MZ

No

false

false

true

 

Romanian

ro

ro-RO

Yes

     

\LanguageTool-4.3\org\languagetool\resource\ro\hunspell\ro_RO.info

Russian*

ru

ru-RU

Yes*
Latin character words are ignored

     

\LanguageTool-4.3\org\languagetool\resource\ru\hunspell\ru_RU.info

Serbian (Serbia)

sr

sr-RS

Not tested

       

Serbian (Montenegro)

sr

sr-ME

Not tested

       

Serbian (Bosnia and Herzegovina)

sr

sr-BA

Not tested

       

Slovak

sk

sk-SK

Yes

true

false

true

\LanguageTool-4.3\org\languagetool\resource\sk\hunspell\sk_SK.info

Slovenian

sl

sl-SI

Yes

true

false

true

\LanguageTool-4.3\org\languagetool\resource\sl\hunspell\sl_SI.info

Spanish

es

es

Yes

false

false

true

\LanguageTool-4.3\org\languagetool\resource\es\hunspell\es_ES.info

Swedish

sv

sv

No

false

false

false

 

Tagalog

tl

tl-PH

Not tested

       

Tamil

ta

ta-IN

Not tested

       

Ukrainian (Ukraine)

uk

uk-UA

No

true

true

true

 

 Complete post version will be published in User Manual for new plug-in versions and on www.posteditacat.xyz.



Generated Image Alt-Text
[edited by: Trados AI at 5:35 AM (GMT 0) on 6 Mar 2024]
emoji