I asked a similar question some time ago, but, unfortunately without any success.
I am currently revising my book on the practical use of regexes in the translation environment, and would like to include a section on special considerations with non-Latin languages, in particular RTLs such as Arabic and Hebrew. As minimum, I would like to include simple regexes for parsing an expression with mixed LTR term, such as product identifier (with letters and numbers, for example, AB123) and the RTL description. I would appreciate any feedback.
Search regex: .*(?<id>[A-Z]{2}\d{3,4})\s+"(?<name>(\w+\s*)*)".*
Replace term: ${id} ${name}
The AB123 "Large black bolt" is defective
The above “English” regex covers many regex elements.