Gathered to help me learn; thought it may help others too. Add your own to the embedded <dp:testdata> element, and the regex's in the template for <l>. Source here. Run it against itself. No input document needed.
To my knowledge, the current WD|rec doesn't cover the following regex idioms.
| Lookaround | (?=) (?< ) | Omitted |
| \x mode + comments | # comment .. \n | \x OK, no comments though. |
| Word boundaries | \b \<..\> | try \w workaround |
| Unicode combining char. | \X | No valid alternate |
| Comments | (?#...) and #.. | Sadly missing. |
| Embed literals | \Q...\E | For ease of reading. |
| Backreferences | \1 | AFAIK |
And the following need escaping within a character class
Be aware of the XSd usage of these. They may not match your previous expericnece. Of use to anyone using these for XML stuff, see XML for a definition of namechar, the \c option and the \i option (XML
| Character sequence | Equivalent character class |
|---|---|
| . | [^\n\r] |
| \s | [#x20\t\n\r] |
| \S | [^\s] |
| \i | the set of initial name characters, those matched by Letter | '_' | ':' |
| \I | [^\i] |
| \c | the set of name characters, those matched by NameChar |
| \C | [^\c] |
| \d | \p{Nd} |
| \D | [^\d] |
| \w | [#x0000-#x10FFFF]-[\p{P}\p{Z}\p{C}] (all characters except the set of "punctuation", "separator" and "other" characters) |
| \W | [^\w] |