Data Input
Operators in the Data Input category
Home > Data Input
Operators
Total: 8 operators
1 - Arrow File Scan
Scan data from an Arrow file
Home > Data Input
| Property | Requirement | Type | Default | Description |
|---|
| File | ✓ | String | - | |
| Limit | | Integer | - | Max output count |
| Offset | | Integer | - | Starting point of output |
Output Ports
2 - CSV File Scan
Scan data from a CSV file
Home > Data Input
| Property | Requirement | Type | Default | Description |
|---|
| File | ✓ | String | - | |
| File Encoding | ✓ | UTF_8, UTF_16, US_ASCII | UTF_8 | Decoding charset to use on input |
| Limit | | Integer | - | Max output count |
| Offset | | Integer | - | Starting point of output |
| Delimiter | | String | , | Delimiter to separate each line into fields |
| Header | | Boolean | true | Whether the CSV file contains a header line |
Output Ports
3 - CSVOld File Scan
Scan data from a CSVOld file
Home > Data Input
| Property | Requirement | Type | Default | Description |
|---|
| File | ✓ | String | - | |
| File Encoding | ✓ | UTF_8, UTF_16, US_ASCII | UTF_8 | Decoding charset to use on input |
| Limit | | Integer | - | Max output count |
| Offset | | Integer | - | Starting point of output |
| Delimiter | | String | , | Delimiter to separate each line into fields |
| Header | | Boolean | true | Whether the CSV file contains a header line |
Output Ports
4 - File Lister
Select a dataset version and output one filename tuple per file
Home > Data Input
| Property | Requirement | Type | Default | Description |
|---|
| Dataset | ✓ | String | - | |
Output Ports
6 - File Scan From Input
Scan data from file paths provided by input tuples
Home > Data Input
| Property | Requirement | Type | Default | Description |
|---|
| Encoding | ✓ | UTF_8, UTF_16, US_ASCII | UTF_8 | |
| Extract | | Boolean | false | |
| Include Filename | | Boolean | false | |
| Attribute Type | ✓ | string, single string, integer, long, double, boolean, timestamp, binary, large binary | string | |
| Attribute Name | ✓ | String | line | |
| Limit | | Integer | - | |
| Offset | | Integer | - | |
Output Ports
7 - JSONL File Scan
Scan data from a JSONL file
Home > Data Input
| Property | Requirement | Type | Default | Description |
|---|
| File | ✓ | String | - | |
| File Encoding | ✓ | UTF_8, UTF_16, US_ASCII | UTF_8 | Decoding charset to use on input |
| Limit | | Integer | - | Max output count |
| Offset | | Integer | - | Starting point of output |
| Flatten | ✓ | Boolean | false | Flatten nested objects and arrays |
Output Ports
8 - Text Input
Source data from manually inputted text
Home > Data Input
| Property | Requirement | Type | Default | Description |
|---|
| Text | ✓ | String | - | |
| Attribute Type | ✓ | string, single string, integer, long, double, boolean, timestamp, binary, large binary | string | |
| Attribute Name | ✓ | String | line | |
| Limit | | Integer | - | |
| Offset | | Integer | - | |
Output Ports