AutoSuggest dictionary based on regular SDLTM or UpLifted SDLTM

In Studio 2017, I was just wondering if creating an AutoSuggest dictionary would yield a different result depending on whether I would use my old TM or my UpLifted TM. There are a couple of thoughts crossing my mind, but as I cannot see how everything works in the background I cannot be sure. I'd think that there should be no difference as the dictionary cuts up the TM in little parts similar to UpLift (I guess), but creating an AutoSuggest dictionary from an UpLifted TM might not even be an good idea as the size of that TM is about 3 to 4 times bigger, which means you could end up with a dictionary that is X times bigger for no good reason (or maybe that simply isn't the case). Or would there even be no need for an autosuggest dictionary when Fragment Recall is enabled? In which case I wonder why you would even buy it as an add-on.

Parents
  • Interesting questions indeed. And there are other interesting questions:
    1)
    The latest DGT TMX files do contain about 3.4 million entries. I can use the TMX files directly to produce Autosuggest files and they are quite big. But I can also create a TM from these TMx file, meaning Studio will throw away about 1.2 million repetitions resulting in about 2.2 million entries. Autosuggest files created from these TMs are much smaller.
    Does this make sense, would it not be better to use the TMX files with all the repetitions to create more entries in the AutoSuggest files?

    2)
    What about UpLift? As I understand it, UpLift also depends on statistics. But UpLift can only be generated from existing TMs, meaning all the repetitions have already been eliminated. Would Uplift not also benefit if the files would could be created from the TMX with loads of repetitions?
Reply
  • Interesting questions indeed. And there are other interesting questions:
    1)
    The latest DGT TMX files do contain about 3.4 million entries. I can use the TMX files directly to produce Autosuggest files and they are quite big. But I can also create a TM from these TMx file, meaning Studio will throw away about 1.2 million repetitions resulting in about 2.2 million entries. Autosuggest files created from these TMs are much smaller.
    Does this make sense, would it not be better to use the TMX files with all the repetitions to create more entries in the AutoSuggest files?

    2)
    What about UpLift? As I understand it, UpLift also depends on statistics. But UpLift can only be generated from existing TMs, meaning all the repetitions have already been eliminated. Would Uplift not also benefit if the files would could be created from the TMX with loads of repetitions?
Children
No Data