This is the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

Utilities

Operators in the Utilities category

Home > Utilities

Operators

OperatorDescription
Random K SamplingRandom sampling with given percentage
Reservoir SamplingReservoir Sampling with k items being kept randomly
SplitSplit data to two different ports
Unnest StringUnnest the string values in the column separated by a delimiter to multiple values

Total: 4 operators

1 - Random K Sampling

Random sampling with given percentage

Home > Utilities

Input Properties

PropertyRequirementTypeDefaultDescription
Random K Sample PercentageInteger0Random k sampling with given percentage

Output Ports

PortMode
0Set Snapshot

2 - Reservoir Sampling

Reservoir Sampling with k items being kept randomly

Home > Utilities

Input Properties

PropertyRequirementTypeDefaultDescription
Number Of Item Sampled In Reservoir SamplingInteger0Reservoir sampling with k items being kept
randomly

Output Ports

PortMode
0Set Snapshot

3 - Split

Split data to two different ports

Home > Utilities

Input Properties

PropertyRequirementTypeDefaultDescription
Split PercentageInteger80Percentage of data going to the upper port
Auto-Generate SeedBooleantrueShuffle the data based on a random seed
↳ SeedInteger1An int for reproducible output across multiple
runs

Output Ports

PortMode
0Set Snapshot
1Set Snapshot

4 - Unnest String

Unnest the string values in the column separated by a delimiter to multiple values

Home > Utilities

Input Properties

PropertyRequirementTypeDefaultDescription
DelimiterString,String that separates the data
AttributeString-Column of the string to unnest
Result AttributeStringunnestResultColumn name of the unnest result

Output Ports

PortMode
0Set Snapshot