Xpath parser to exclude part of text from contetn

Question

Hello, I am trying to figure out how to exclude part of text using Xpath. I have sample text in specific structure:

sample_content

[value_text] sample content

I've tried to get text using Xpath: //section[@tag='2']/sub-section[@id='d] However, it is not enough to exclude "sample_content" from this line. Result is: [value_text] sample content. My goal is: value_text I was looking for solution on internet (this website too) but I didn't get any. I know that Trados Studio only use Xpath 1.0 that doesn't allow to mix Xpath with regular expressions. Also, I couldn't find any useful Xpath functions for my problem. Do you have any ideas how to handle this problem? I use Trados Studio 2019 SR2. I created Filetype XML (embedded content). Kind Regards, Adrian

Paul · Accepted Answer

Adrian Wojewoda 
 It is possible, but not with xpath alone, at least not xpath 1.0. First of all you create your parser rule, exactly as you have done. Then you add some structure context like this for example: 
 
 Then activate the embedded content processor and create a rule using regex with one of the ways available. I used the "Defined by document structure information" as I added the "Paragraph" context above: 
 
 I based this on your specific example, but it might give you an idea if your actual files are a little different. This then gets me the following: 
 
 Which seems to be what you're after.

Paul · Answer

Adrian Wojewoda 
 In this case you'd be better of doing something like this: 
 
 create your filetype with the xpath expression previously agreed 
 Use this expression to create a placeholder instead of the tag pair: (?<!\[)\b[\w\s]+\b(?![\)]) 
 
 This will select everything apart from the text in the brackets... like this for example where I even used a really extreme example: 
 
 And if you then set the embedded content rule to "exclude" you can even get the segmentation: 
 
 Looks like it's what you needed?

Trados Studio > 5. Regex and XPath

Xpath parser to exclude part of text from contetn

Top Replies