RegexMatcher
RegexMatcher allows you matching the values of a column to several regex patterns and extract values into multiple columns.
- Source, Space, Table, Column - defines the input table and the column to parse.
- Target source, Target space, Target table - defines the name of the result table to be created.
- Result columns - lists columns to be added that will contain values extracted as the result of parsing. You will refer to the columns in the Regular expressions table to map extracted values to columns.
- Regular expressions - you can specify multiple regex patterns. The patterns will be applied in the order specified until the first pattern matched. For each pattern, provide:
- Name - can be any text. The name will be stored in the Regex name column to indicate which pattern was applied to extract the value.
- Pattern - regex pattern using Java syntax. For example,
<(.*@.*)>will match any string between<and>if the string contains@symbol. - Expend the pattern line by clicking on the arrow located on the left side of the pattern row. Provide:
- Column name - name of one of the Result columns to store the matched value.
- Substitution - refers to the matched value in the format
$<index-of-the-capturing-group-starting-from-1>, e.g.$1
- Regex name column - name of the column to be added to the result table. It contains the name of the pattern that was matched to extract the value.
Last updated on