XML file with embedded content - how to exclude text from translation

Triview Translations over 4 years ago

I've got an XML structure with nodes like this one:

This document covers the <Model>.

I want the content of the nodes to be translated, except for <Model&gt.

I've set up an XML (Legacy Embedded Content) file type. In the Parser rules, is set as translatable.

In Embedded Content (Legacy), I've set:

- checked Enable embedded content processing

- Document structure information: added a custom type named "Variable"

- tag definition rules: start tag> end tag < Tag pair, Not translatable.

Now in my preview, I expect a segment "This document covers the "

Instead, I get "This document covers the <Model>."

So my embedded content isn't being filtered out. I'm doing something wrong, but what?

Fullscreen Variables test file.xml Download

<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="AuthorIT.xslt"?>
<AuthorIT version="20.3.1.40442" xmlns="http://www.authorit.com/xml/authorit" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.authorit.com/xml/authorit AuthorIT.xsd">
	<Objects>
		<Book wordcount="25">
			<Object>
				<Description>Headers and Footers</Description>
				<GUID>9e6905caaefa4b019204e85a0bd8789e</GUID>
				<ID>6116</ID>
				<VariantParentID>4091</VariantParentID>
			</Object>
			<ContentsNodes>
				<Node id="4092"></Node>
				<Node id="4093"></Node>
				<Node id="4094"></Node>
				<Node id="11821"></Node>
			</ContentsNodes>
			<VariableAssignments>
				<VariableAssignment>
					<ID>70</ID>
					<Name>Model_Number</Name>
					<Value>300</Value>
					<ValueObject>0</ValueObject>
					<Style>-1</Style>
					<PublishPrompt>true</PublishPrompt>
					<IsVariantCriteria>false</IsVariantCriteria>
				</VariableAssignment>
			</VariableAssignments>
			<PrintByLine>Art. No.: &lt;Article_Number&gt;</PrintByLine>
			<PrintVersion>&lt;Version&gt;</PrintVersion>
			<WebVersion>&lt;Version&gt;</WebVersion>
		</Book>
		<Topic wordcount="1">
			<Object>
				<Description>Back Page</Description>
				<GUID>390099a7512b4ba79fa2245641714d47</GUID>
				<ID>6205</ID>
				<VariantParentID>4113</VariantParentID>
			</Object>
			<Headings></Headings>
			<RelatedGroups></RelatedGroups>
			<Text>
				<p id="254">&lt;Version&gt;</p>
			</Text>
			<VariableAssignments></VariableAssignments>
			<PrintSuperHeading></PrintSuperHeading>
			<WebTitle></WebTitle>
		</Topic>
	</Objects>
</AuthorIT>

AIT variables test filter.zip

Translate

Rate translation

Suggest better translation

Moderator UI

Thread Subject & Description
XML file with embedded content - how to exclude text from translation I've got an XML structure with nodes like this one: This document covers the <Model>. I want the content of the nodes to be translated, except for <Model>. I've set up an XML (Legacy Embedded Content) file type. In the Parser rules, is set as translatable. In Embedded Content (Legacy), I've set: - checked Enable embedded content processing - Document structure information: added a custom type named "Variable" - tag definition rules: start tag> end tag < Tag pair, Not translatable. Now in my preview, I expect a segment "This document covers the " Instead, I get "This document covers the <Model>." So my embedded content isn't being filtered out. I'm doing something wrong, but what? .code-editor .code-editor-heading{border-top:1px solid #e7e6e8;border-left:1px solid #e7e6e8;border-right:1px solid #e7e6e8;background-color:#fafafa;border-radius:3px 3px 0 0;font-size:12.6px;display:flex;justify-content:space-between;align-items:center;overflow:hidden}.code-editor .code-editor-heading .icon{width:32px;height:32px;display:block;overflow:hidden;text-indent:-3000em;background-repeat:no-repeat;background-size:80%;background-position:center}.code-editor .code-editor-heading .fs{background-image:url('https://community.rws.com/cfs-filesystemfile/__key/defaultwidgets/547b4cbb4efb4c3d83533f8f35fb4b7b-1a84591e31034fac832d29ed8584666c/fullscreen.svg?_=637923381702208350')}.code-editor .code-editor-heading .dl{background-image:url('https://community.rws.com/cfs-filesystemfile/__key/defaultwidgets/547b4cbb4efb4c3d83533f8f35fb4b7b-1a84591e31034fac832d29ed8584666c/download.svg?_=637923381702118315')}.code-editor .code-editor-heading .filename{padding:10px;display:block;white-space:nowrap;overflow:hidden;text-overflow:ellipsis}.code-editor .code-editor-heading a{color:#271a32}.code-editor .code-editor-heading a:hover{color:#3e016f} https://community.rws.com/# https://community.rws.com/cfs-filesystemfile/__key/communityserver-discussions-components-files/90/Variables-test-file.xml?_=637471954875996689 https://community.rws.com/cfs-filesystemfile/__key/communityserver-discussions-components-files/90/Variables-test-file.xml?_=637471954875996689 <?xml version="1.0" encoding="UTF-8" ?> <?xml-stylesheet type="text/xsl" href="AuthorIT.xslt"?> <AuthorIT version="20.3.1.40442" xmlns="http://www.authorit.com/xml/authorit" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.authorit.com/xml/authorit AuthorIT.xsd"> <Objects> <Book wordcount="25"> <Object> <Description>Headers and Footers</Description> <GUID>9e6905caaefa4b019204e85a0bd8789e</GUID> <ID>6116</ID> <VariantParentID>4091</VariantParentID> </Object> <ContentsNodes> <Node id="4092"></Node> <Node id="4093"></Node> <Node id="4094"></Node> <Node id="11821"></Node> </ContentsNodes> <VariableAssignments> <VariableAssignment> <ID>70</ID> <Name>Model_Number</Name> <Value>300</Value> <ValueObject>0</ValueObject> <Style>-1</Style> <PublishPrompt>true</PublishPrompt> <IsVariantCriteria>false</IsVariantCriteria> </VariableAssignment> </VariableAssignments> <PrintByLine>Art. No.: <Article_Number></PrintByLine> <PrintVersion><Version></PrintVersion> <WebVersion><Version></WebVersion> </Book> <Topic wordcount="1"> <Object> <Description>Back Page</Description> <GUID>390099a7512b4ba79fa2245641714d47</GUID> <ID>6205</ID> <VariantParentID>4113</VariantParentID> </Object> <Headings></Headings> <RelatedGroups></RelatedGroups> <Text> <Version> </Text> <VariableAssignments></VariableAssignments> <PrintSuperHeading></PrintSuperHeading> <WebTitle></WebTitle> </Topic> </Objects> </AuthorIT> jQuery(function(j){ var fullScreenToggle = j('#fragment-1a84591e31034fac832d29ed8584666c1923177753_code-editor-fs'); var codeEditor = j('#fragment-1a84591e31034fac832d29ed8584666c1923177753_code-editor'); fullScreenToggle.on('click', function(){ if (codeEditor.evolutionCodeEditor('fullscreen')) { codeEditor.evolutionCodeEditor('fullscreen', false); } else { codeEditor.evolutionCodeEditor('fullscreen', true); } return false; }); }); https://community.rws.com/cfs-file/__key/communityserver-discussions-components-files/90/AIT-variables-test-filter.zip
Get AI Suggestion

AI Reply

Accept answer Reject Answer

Top Replies

0 Daniel Hug over 4 years ago

Triview Translations

Try this:

> and < are < and >, but in an XML element they are escaped. When Studio extracts the content, it will have: "This document covers the <Model>."

Daniel

(Edited)

Generated Image Alt-Text
[edited by: Trados AI at 2:57 PM (GMT 0) on 1 Mar 2024]
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Reject Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate
+1 Daniel Hug over 4 years ago

Triview Translations

In general, it's nice to provide a sample file, it just helps those who are willing to help...

I made one up from the information you disclosed:

Then I created a custom XML file type based on the XML 2 file type. These file types are just easier to set up than the legacy file type, with only some drawbacks.

All customization is in the parser and embedded content sections of that file type, so it's really a 3 minute job:

This is how Studio displays the content before I set up the embedded content processing:

Embedded content screen:

Rule:

Result:

Unless there are more requirements this is a really simple task in Studio and you can find many different ways of tackling it.

Daniel
Cancel
Vote Up +2 Vote Down

Sign in to reply

Verify Answer

Reject Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate
0 Triview Translations over 4 years ago in reply to Daniel Hug

I've attached a test file to my original post now, sorry about that.
- I have a number of existing XML filters (based on the XML (Legacy Embedded Content) file type) I'd like to add this to. These have ~50 parser rules each, so it'd take a while to replace them with new with new filters.

- I made a new XML 2 filter to try your suggestion. I've set it up the way you indicate, but the Preview still suggests the variables are translatable.

- It seems to me that the conversion of < and > is optional: this is a setting in Entities->HTML Special. Tried both with and without entity conversion, with both options the Preview still suggests the variables are translatable.

I suspect part of the trouble is that <Model> isn't an XML element. It's half of a tag pair.
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate
0 Daniel Hug over 4 years ago in reply to Triview Translations

Triview Translations

The Entity conversion is when Trados Studio writes content, it's not about how content is read.

<Model> is indeed not an XML element, it's embedded content. AFAIK you can't work with parser rules here, you must work with an embedded content processor of some kind.

Daniel
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate
0 Triview Translations over 4 years ago
Thanks to Daniel's info I was able to modify the filter.

This is what I ended up doing (this is for an XML Legacy filter, for the new XML filter format it's slightly different):

In the parser rules, every tag that can contain a variable must have a 'Context' entry.

To add a context entry:
double-click the tag
in the 'edit rule' dialog, click 'Edit'.
In the Structure Information Properties dialog, click 'Add'.
In the Add Structure Information dialog, click in the Name field.
Enter a context name, for example 'text'.
Make a note of the Identifier field.
If the Name field contains upper case letters, these will be converted to lowercase in the Identifier field.

On the Embedded Content (Legacy) page, check the 'Enable embedded content processing' check box.

In the 'Document structure information' field, add the Identifier for each tag you want to filter the variable names in. If you add the Name field, it won't work. So 'text' and not 'Text'.

In the Tag Definition Rules area, add the rule, in my case the start tag is <.*?>

Click Advance to set the segmentation rule. I chose Include.
Cancel
Vote Up +1 Vote Down

Sign in to reply

Verify Answer

Cancel

Share
Documentation Survey: help us offer you better documentation! Translate

Trados Studio > 1. Trados Studio

XML file with embedded content - how to exclude text from translation

Top Replies