Skip to content

Commit

Permalink
[Improve] add compress_codec parameter to file related connector
Browse files Browse the repository at this point in the history
  • Loading branch information
liunaijie committed Aug 9, 2023
1 parent f4a5d51 commit c2f4d6e
Show file tree
Hide file tree
Showing 15 changed files with 95 additions and 0 deletions.
11 changes: 11 additions & 0 deletions docs/en/connector-v2/source/CosFile.md
Original file line number Diff line number Diff line change
Expand Up @@ -56,6 +56,7 @@ Read all the data in a split in a pollNext call. What splits are read will be sa
| common-options | | no | - |
| sheet_name | string | no | - |
| file_filter_pattern | string | no | - |
| compress_codec | string | no | none |

### path [string]

Expand Down Expand Up @@ -252,6 +253,16 @@ Reader the sheet of the workbook,Only used when file_format is excel.

Filter pattern, which used for filtering files.

### compress_codec [string]

The compress codec of files and the details that supported as the following shown:

- txt: `lzo` `none`
- json: `lzo` `none`
- csv: `lzo` `none`
- orc/parquet:
automatically recognizes the compression type, no additional settings required.

## Example

```hocon
Expand Down
11 changes: 11 additions & 0 deletions docs/en/connector-v2/source/FtpFile.md
Original file line number Diff line number Diff line change
Expand Up @@ -49,6 +49,7 @@ If you use SeaTunnel Engine, It automatically integrated the hadoop jar when you
| common-options | | no | - |
| sheet_name | string | no | - |
| file_filter_pattern | string | no | - |
| compress_codec | string | no | none |

### host [string]

Expand Down Expand Up @@ -228,6 +229,16 @@ Source plugin common parameters, please refer to [Source Common Options](common-

Reader the sheet of the workbook,Only used when file_format is excel.

### compress_codec [string]

The compress codec of files and the details that supported as the following shown:

- txt: `lzo` `none`
- json: `lzo` `none`
- csv: `lzo` `none`
- orc/parquet:
automatically recognizes the compression type, no additional settings required.

## Example

```hocon
Expand Down
11 changes: 11 additions & 0 deletions docs/en/connector-v2/source/HdfsFile.md
Original file line number Diff line number Diff line change
Expand Up @@ -54,6 +54,7 @@ Read all the data in a split in a pollNext call. What splits are read will be sa
| common-options | | no | - |
| sheet_name | string | no | - |
| file_filter_pattern | string | no | - |
| compress_codec | string | no | none |

### path [string]

Expand Down Expand Up @@ -250,6 +251,16 @@ Reader the sheet of the workbook,Only used when file_format is excel.

Filter pattern, which used for filtering files.

### compress_codec [string]

The compress codec of files and the details that supported as the following shown:

- txt: `lzo` `none`
- json: `lzo` `none`
- csv: `lzo` `none`
- orc/parquet:
automatically recognizes the compression type, no additional settings required.

## Example

```hocon
Expand Down
11 changes: 11 additions & 0 deletions docs/en/connector-v2/source/Hive.md
Original file line number Diff line number Diff line change
Expand Up @@ -44,6 +44,7 @@ Read all the data in a split in a pollNext call. What splits are read will be sa
| read_partitions | list | no | - |
| read_columns | list | no | - |
| common-options | | no | - |
| compress_codec | string | no | none |

### table_name [string]

Expand Down Expand Up @@ -84,6 +85,16 @@ The read column list of the data source, user can use it to implement field proj

Source plugin common parameters, please refer to [Source Common Options](common-options.md) for details

### compress_codec [string]

The compress codec of files and the details that supported as the following shown:

- txt: `lzo` `none`
- json: `lzo` `none`
- csv: `lzo` `none`
- orc/parquet:
automatically recognizes the compression type, no additional settings required.

## Example

```bash
Expand Down
11 changes: 11 additions & 0 deletions docs/en/connector-v2/source/OssFile.md
Original file line number Diff line number Diff line change
Expand Up @@ -57,6 +57,7 @@ Read all the data in a split in a pollNext call. What splits are read will be sa
| common-options | | no | - |
| sheet_name | string | no | - |
| file_filter_pattern | string | no | - |
| compress_codec | string | no | none |

### path [string]

Expand Down Expand Up @@ -249,6 +250,16 @@ Source plugin common parameters, please refer to [Source Common Options](common-

Reader the sheet of the workbook,Only used when file_format is excel.

### compress_codec [string]

The compress codec of files and the details that supported as the following shown:

- txt: `lzo` `none`
- json: `lzo` `none`
- csv: `lzo` `none`
- orc/parquet:
automatically recognizes the compression type, no additional settings required.

## Example

```hocon
Expand Down
11 changes: 11 additions & 0 deletions docs/en/connector-v2/source/OssJindoFile.md
Original file line number Diff line number Diff line change
Expand Up @@ -57,6 +57,7 @@ Read all the data in a split in a pollNext call. What splits are read will be sa
| common-options | | no | - |
| sheet_name | string | no | - |
| file_filter_pattern | string | no | - |
| compress_codec | string | no | none |

### path [string]

Expand Down Expand Up @@ -253,6 +254,16 @@ Reader the sheet of the workbook,Only used when file_format is excel.

Filter pattern, which used for filtering files.

### compress_codec [string]

The compress codec of files and the details that supported as the following shown:

- txt: `lzo` `none`
- json: `lzo` `none`
- csv: `lzo` `none`
- orc/parquet:
automatically recognizes the compression type, no additional settings required.

## Example

```hocon
Expand Down
11 changes: 11 additions & 0 deletions docs/en/connector-v2/source/S3File.md
Original file line number Diff line number Diff line change
Expand Up @@ -58,6 +58,7 @@ Read all the data in a split in a pollNext call. What splits are read will be sa
| common-options | | no | - |
| sheet_name | string | no | - |
| file_filter_pattern | string | no | - |
| compress_codec | string | no | none |

### path [string]

Expand Down Expand Up @@ -304,6 +305,16 @@ Reader the sheet of the workbook,Only used when file_format is excel.

Filter pattern, which used for filtering files.

### compress_codec [string]

The compress codec of files and the details that supported as the following shown:

- txt: `lzo` `none`
- json: `lzo` `none`
- csv: `lzo` `none`
- orc/parquet:
automatically recognizes the compression type, no additional settings required.

## Changelog

### 2.3.0-beta 2022-10-20
Expand Down
11 changes: 11 additions & 0 deletions docs/en/connector-v2/source/SftpFile.md
Original file line number Diff line number Diff line change
Expand Up @@ -48,6 +48,7 @@ If you use SeaTunnel Engine, It automatically integrated the hadoop jar when you
| common-options | | no | - |
| sheet_name | string | no | - |
| file_filter_pattern | string | no | - |
| compress_codec | string | no | none |

### host [string]

Expand Down Expand Up @@ -231,6 +232,16 @@ Reader the sheet of the workbook,Only used when file_format is excel.

Filter pattern, which used for filtering files.

### compress_codec [string]

The compress codec of files and the details that supported as the following shown:

- txt: `lzo` `none`
- json: `lzo` `none`
- csv: `lzo` `none`
- orc/parquet:
automatically recognizes the compression type, no additional settings required.

## Example

```hocon
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -61,6 +61,7 @@ public OptionRule optionRule() {
.optional(BaseSourceConfig.DATETIME_FORMAT)
.optional(BaseSourceConfig.TIME_FORMAT)
.optional(BaseSourceConfig.FILE_FILTER_PATTERN)
.optional(BaseSourceConfig.COMPRESS_CODEC)
.build();
}

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -61,6 +61,7 @@ public OptionRule optionRule() {
.optional(BaseSourceConfig.DATETIME_FORMAT)
.optional(BaseSourceConfig.TIME_FORMAT)
.optional(BaseSourceConfig.FILE_FILTER_PATTERN)
.optional(BaseSourceConfig.COMPRESS_CODEC)
.build();
}

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -58,6 +58,7 @@ public OptionRule optionRule() {
.optional(BaseSourceConfig.DATETIME_FORMAT)
.optional(BaseSourceConfig.TIME_FORMAT)
.optional(BaseSourceConfig.FILE_FILTER_PATTERN)
.optional(BaseSourceConfig.COMPRESS_CODEC)
.build();
}

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -61,6 +61,7 @@ public OptionRule optionRule() {
.optional(BaseSourceConfig.DATETIME_FORMAT)
.optional(BaseSourceConfig.TIME_FORMAT)
.optional(BaseSourceConfig.FILE_FILTER_PATTERN)
.optional(BaseSourceConfig.COMPRESS_CODEC)
.build();
}

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -61,6 +61,7 @@ public OptionRule optionRule() {
.optional(BaseSourceConfig.DATETIME_FORMAT)
.optional(BaseSourceConfig.TIME_FORMAT)
.optional(BaseSourceConfig.FILE_FILTER_PATTERN)
.optional(BaseSourceConfig.COMPRESS_CODEC)
.build();
}

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -66,6 +66,7 @@ public OptionRule optionRule() {
.optional(BaseSourceConfig.DATETIME_FORMAT)
.optional(BaseSourceConfig.TIME_FORMAT)
.optional(BaseSourceConfig.FILE_FILTER_PATTERN)
.optional(BaseSourceConfig.COMPRESS_CODEC)
.build();
}

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -61,6 +61,7 @@ public OptionRule optionRule() {
.optional(BaseSourceConfig.DATETIME_FORMAT)
.optional(BaseSourceConfig.TIME_FORMAT)
.optional(BaseSourceConfig.FILE_FILTER_PATTERN)
.optional(BaseSourceConfig.COMPRESS_CODEC)
.build();
}

Expand Down

0 comments on commit c2f4d6e

Please sign in to comment.