PerfectMatch misses off last part of some segments (even when using workaround to this known issue)

Hi everyone,

I've had the issue before, when I prepare a new project for Manual B against a previous Studio Project of very similar (identical at most points) Manual A in Studio using PerfectMatch with formatting tags, the last part of the segment (where it combines 2 sentences into 1 segment during the segmentation process) is lost in the translated target segment.

Due to tag/file type issues with PerfectMatch, I am aware that this is a known bug - i.e. PM should not be used to segment the files, i.e. PerfectMatch should not be run at Project creation stage, but rather after as a separate batch task. All fine, what I've been doing is when creating the project, add TM and then let the TM segment the files. At this stage I also uncheck Lookup (and Concordance) so that it does not input the matches during project creation (otherwise I then have to clear all segments post Project creation before I can then run PerfectMatch in the separate step). Once the project has been created, only then do I run PerfectMatch against the sdlxliffs of the previous project.

Up until now, doing this separately has worked (at least, I'm starting to doubt that - it is always something I look out for)). I just happened to notice the issue again on the latest project I've created in this way. 

The frustrating thing is that we use PM as a cost-saver for the client - they have manuals (50-90 files between 30,000-50,000 total words) for machinery - but some of the source file are customised each time, whilst most files stay the same, and they want the full translated manual back each time. This is why PM seems handy and better than using the TM. But this issue with the PMs not being reliable is really quote frustrating, especially because we do not check the PerfectMatch and also since I thought the workaround solved it. 

I checked the Reports, in case I forgot and used PM during Project creation of this particular project, but I definitely followed the right process. In this particular project, the translator is using memoQ - might this have anything to do with it? I doubt it as the translator couldn't amend the segmentation simply by opening it in memoQ? Either that or is it the fact I disable Lookup/Concordance?

Any help would be really appreciated.

Many thanks,

Gemma 

Parents Reply
  • Hi ,

    Thanks for your message. My particular case was unrelated as I did a test and found that the SDLXLIFFs have the wrong segmentation before I sent it to our memoQ-using team. 

    As to the suggested fix: There was an extra CF tag in some segments. The Support team fixed the original translation by splitting the segments before the extra CF tag and fixing the target translation.

    It seems to happen when there is a cf tag between 2 sentences that for some reason then get segmented wrongly into 1 combined segment. Typically in my case, the translation of the first sentence pf the segment is pulled into the target by PerfectMatch, but the translation of the second sentence is missed entirely. I am unable to share a screenshot due to client confidentiality, sorry, but I hope my explanation makes sense. 

    Hope this helps.

    Best wishes,

    Gemma

Children
No Data