Supported Unicode Character Classes
This section lists the Unicode character classes that are supported by the regular expression engine. Use the one- or two-letter class name in your patterns. For example to match a currency symbol:
\p{Sc}
The supported classes are:
C- OtherCc- ControlCf- FormatCo- PrivateUseCs- SurrogateL- LetterLl- LowercaseLetterLm- ModifierLetterLo- OtherLetterLt- TitlecaseLetterLu- UppercaseLetterM- MarkMc- SpacingMarkMe- EnclosingMarkMn- NonSpacingMarkN- NumberNd- DecimalNumberNl- LetterNumberNo- OtherNumberP- PunctuationPc- ConnectorPunctuationPd- DashPunctuationPe- ClosePunctuationPf- FinalPunctuationPi- InitialPunctuationPo- OtherPunctuationPs- OpenPunctuationS- SymbolSc- CurrencySymbolSk- ModifierSymbolSm- MathSymbolSo- OtherSymbolZ- SeparatorZl- LineSeparatorZp- ParagraphSeparatorZs- SpaceSeparator
Character class assignments are as provided by unicode.org - see http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.