Utilities
Operators in the Utilities category
Home > Utilities
Operators
| Operator | Description |
|---|
| Random K Sampling | Random sampling with given percentage |
| Reservoir Sampling | Reservoir Sampling with k items being kept randomly |
| Split | Split data to two different ports |
| Unnest String | Unnest the string values in the column separated by a delimiter to multiple values |
Total: 4 operators
1 - Random K Sampling
Random sampling with given percentage
Home > Utilities
| Property | Requirement | Type | Default | Description |
|---|
| Random K Sample Percentage | ✓ | Integer | 0 | Random k sampling with given percentage |
Output Ports
2 - Reservoir Sampling
Reservoir Sampling with k items being kept randomly
Home > Utilities
| Property | Requirement | Type | Default | Description |
|---|
| Number Of Item Sampled In Reservoir Sampling | ✓ | Integer | 0 | Reservoir sampling with k items being kept randomly |
Output Ports
3 - Split
Split data to two different ports
Home > Utilities
| Property | Requirement | Type | Default | Description |
|---|
| Split Percentage | | Integer | 80 | Percentage of data going to the upper port |
| Auto-Generate Seed | | Boolean | true | Shuffle the data based on a random seed |
| ↳ Seed | | Integer | 1 | An int for reproducible output across multiple runs |
Output Ports
4 - Unnest String
Unnest the string values in the column separated by a delimiter to multiple values
Home > Utilities
| Property | Requirement | Type | Default | Description |
|---|
| Delimiter | ✓ | String | , | String that separates the data |
| Attribute | ✓ | String | - | Column of the string to unnest |
| Result Attribute | ✓ | String | unnestResult | Column name of the unnest result |
Output Ports