What is the technology behind upLIFT?

I thought it would be helpful to kick off this thread as we are seeing a few questions in different places about upLIFT and whether this is the same as Lift which was the basis of this technology when  first introduced it.

So my first question would be how many TUs are needed in your Translation Memory to be able to upgrade it for full upLIFT capability with fragment matching and fuzzy match repair?  I read that Lift could do this from a very small number of TUs yet upLIFT does seem to require a bigger starting point.

What questions do you have?

Regards

Paul

Paul Filkin | RWS Group

________________________
Design your own training!

You've done the courses and still need to go a little further, or still not clear? 
Tell us what you need in our Community Solutions Hub

Parents
  • Thank you Paul for asking the question. I was wondering whether anything has changed here, because from what I understood, Lift was able to find subsegment matches via linguistic analysis, which rested heavily on the use of bilingual dictionaries.
    upLift, however seems to utilize the statistical approach, i.e. there must be a considerate number of TUs to produce reasonable results.

    As I understand it, each sentence where a phrase appears is a sort of coordinate, helping to locate the appropriate translation in TM. Hence, the more "coordinates", the more precise results.

    With linguistic method the "coordinates" are different. They are mainly bilingual dictionaries and other corpora that tell the software that "this phrase seems to be the most probable translation candidate". Do I understand it correctly?

    It's only my guess, but it looks like upLift is, in this way,an upgraded version of Autosuggest Dictionaries, because now the phrases are added on-the-fly and you have the separate window where you can see the search results, which helps a lot. However, it's not the same Lift as the one from the YouTube presentation and the oen described by Kevin Flanagan on Proz forum, is it?

Reply
  • Thank you Paul for asking the question. I was wondering whether anything has changed here, because from what I understood, Lift was able to find subsegment matches via linguistic analysis, which rested heavily on the use of bilingual dictionaries.
    upLift, however seems to utilize the statistical approach, i.e. there must be a considerate number of TUs to produce reasonable results.

    As I understand it, each sentence where a phrase appears is a sort of coordinate, helping to locate the appropriate translation in TM. Hence, the more "coordinates", the more precise results.

    With linguistic method the "coordinates" are different. They are mainly bilingual dictionaries and other corpora that tell the software that "this phrase seems to be the most probable translation candidate". Do I understand it correctly?

    It's only my guess, but it looks like upLift is, in this way,an upgraded version of Autosuggest Dictionaries, because now the phrases are added on-the-fly and you have the separate window where you can see the search results, which helps a lot. However, it's not the same Lift as the one from the YouTube presentation and the oen described by Kevin Flanagan on Proz forum, is it?

Children
No Data