REMatch

IMPORTANT

Read more about Data Workbench’s End-of-life announcement.

The REMatch transformation is a pattern-matching transformation that uses regular expressions to specify one or more patterns to look for and capture in the input.

The transformation constructs an output field for each capturing sub-pattern in the regular expression. If the regular expression does not match the input field, the outputs are blank, and if the output field already exists, the values are replaced by the blank values. For a brief guide to using regular expressions, see Regular Expressions.

NOTE

The REMatch transformation operates similarly to the RETransform transformation (see RETransform), which uses regular expressions to capture a string and stores that string in a single output field.

REMatch parses a string more efficiently than multiple RETransform transformations or a single RETransform transformation followed by a Flatten transformation. See Flatten.

Parameter Description Default
Name Descriptive name of the transformation. You can enter any name here.
Case Sensitive True or false. Specifies whether the match is case-sensitive.
Comments Optional. Notes about the transformation.
Condition The conditions under which this transformation is applied.
Expression The regular expression used for matching.
Input The field against which the regular expression is evaluated.
Outputs

The name of the output string or vector. In the case of string vectors as input, the outputs are also string vectors.

An output field must exist for each capturing sub-pattern in the expression.

NOTE

REMatch transformations can be very slow and may account for much of the data processing time.

In this example, a REMatch transformation parses a date of the format YYYY-MM-DD into the fields x-year, x-month, and x-day. For the date 2007-01-02, the values of x-year, x-month, and x-day would be 2007, 01, and 02, respectively.

On this page