Unicode C++
Unicode C++
Regular Expression with Named Capture Groups
See more Regular Expressions Examples
Demonstrates regular expressions with named capture groups.See the sample code below.
Note: Chilkat uses PCRE2. See PCRE2 Regular Expressions
Also see: PCRE2 Performance
In PCRE2, named capture groups allow you to assign a name to a capturing group, making it easier to reference by name instead of number.
Syntax
(?<name>pattern)
or
(?'name'pattern)
Example
(?<first>\w+)\s+(?<last>\w+)
Applied to:
"John Smith"
Produces:
first:Johnlast:Smith
Chilkat Unicode C++ Downloads
#include <CkStringBuilderW.h>
#include <CkJsonObjectW.h>
void ChilkatSample(void)
{
bool success = false;
const wchar_t *subject = L"John Smith";
const wchar_t *pattern = L"(?<first>\\w+)\\s+(?<last>\\w+)";
CkStringBuilderW sb;
sb.Append(subject);
CkJsonObjectW json;
json.put_EmitCompact(false);
int timeoutMs = 2000;
int numMatches = sb.RegexMatch(pattern,json,timeoutMs);
if (numMatches < 0) {
// Probably an error in the regular expression.
// Suggestion: Use AI to help create and/or diagnose regular expressions.
wprintf(L"%s\n",sb.lastErrorText());
return;
}
// Examine the matches:
wprintf(L"%s\n",json.emit());
// Here is the JSON showing the matches.
// Important: Capture group 0 always contains the entire match — that is, the portion of the input string that matches the full regular expression.
// {
// "named": {
// "first": 1,
// "last": 2
// },
// "match": [
// {
// "group": [
// {
// "cap": "John Smith",
// "idx": 0,
// "len": 10
// },
// {
// "cap": "John",
// "idx": 0,
// "len": 4
// },
// {
// "cap": "Smith",
// "idx": 5,
// "len": 5
// }
// ]
// }
// ]
// }
// The capture group index is obtained by looking up the name in the JSON result.
// For example:
json.put_I(json.IntOf(L"named.first"));
wprintf(L"first: %s\n",json.stringOf(L"match[0].group[i].cap"));
json.put_I(json.IntOf(L"named.last"));
wprintf(L"last: %s\n",json.stringOf(L"match[0].group[i].cap"));
// Output is:
// first: John
// last: Smith
}