Segmentation rule for <br>

Hi,

hope someone can help me with this, most likely there is a simple solution I do not know.

We sometimes receive Excel files from a client, with html code included. I can handle all formatting tags, but I would like to tell Studio, that whenever there is "<br>", this should be a segment separator (similar to a paragraph mark), to avoid to have these character combination in the text or formatted as a tag.

In the segmentation rules, I tried a few options, but they don't work.

In the data type settings there is not really a rule for segmentation.

Any ideas out there?

Many thanks,

Johannes

emoji
Parents Reply Children
  •  

    ok - maybe this video will help.  You are definitely doing something wrong.  Maybe applying the exclude after the project is already created, which will not work... or something like that?

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • Hi Paul,

    thanks a lot for the video. That's exactly what I have done as well.

    But you gave me the hint that finally solved it.

    The pre-defined "</?[\p{Ll}\p{Lu}]\w*[^<>]*>" is interfering. When I delete that, then it works.

    In order not to have to delete it, I moved this rule down, and it still works.

    Great, thanks a lot for your help!

    emoji