Dear Community,
Our client sent us amazon product texts in an xml file with HTML tags to work with, and we'd like to them use instead of the (tiresome) Excel spreadsheets we've been working with.
We were able to prepare them well enough using a custom XML file type, but we encountered one problem: if we don't use the embedded HTML content processor, tagging and entities are displayed like this:
And if we do enable it, the tags are prepared and excluded perfectly, but the length restrictions (as seen in the orange-coloured DSI in the image above) disappear:
I've tried using (multiple) RegEx in the XML file type by adding custom DSI to the parser, but to no avail, it would be as if I hadn't added anything at all.
I have to admit I'm new with the XML file type and am unsure whether I added this right in the parser/DSI menu, but even if I did, that would still leave the entity problem to deal with.
Does anyone know how to convert the HTML tags an entities all the while keeping the length restrictions from the XML file type?
Thank you in advance!
Guillaume
