(step_functions): S3 Json Lines Item Reader #33601
Labels
@aws-cdk/aws-stepfunctions
Related to AWS StepFunctions
effort/medium
Medium work item β several days of effort
feature-request
A feature should be added or improved.
p2
Describe the feature
The AWS Step Functions team recently introduced support for JSON Lines (JSONL) in Distributed Map, allowing efficient processing of large datasets stored in this format:
π AWS Blog Post β JSONL Support in Step Functions Distributed Map
Currently, the AWS CDK provides S3JsonItemReader (docs), which supports reading JSON objects from an S3 file. However, this construct does not support JSONL files. Given that JSONL is now natively supported by Step Functions Distributed Map, it would be highly beneficial to have native support for JSONL in the CDK as well.
Use Case
Developers using AWS Step Functions with CDK would be able to seamlessly leverage JSONL for large-scale data processing, without resorting to custom implementations or workarounds.
Proposed Solution
Introduce a new construct (or extend the existing S3JsonItemReader) to support JSONL files, aligning with the latest Step Functions capabilities.
Example:
Other Information
No response
Acknowledgements
CDK version used
2.280.0
Environment details (OS name and version, etc.)
Windows 11, python
The text was updated successfully, but these errors were encountered: