Feature request: Support Sequential Async Processing of Records for SqsFifoPartialProcessor #3140

amaral-ng · 2024-09-30T20:29:01Z

Use case

I am working with a FIFO SQS queue that requires processing batch records in an asynchronous manner. However, to maintain the order of messages, the SqsFifoPartialProcessor currently only supports sequential synchronous processing. This limitation prevents me from using asynchronous processing in my FIFO queue handler, which is essential for my use case.

Solution/User Experience

I propose enhancing the SqsFifoPartialProcessor to support sequential asynchronous processing while maintaining message ordering. This approach would be similar to the solution implemented here, but tailored to work with the SqsFifoPartialProcessor. This would allow users to leverage asynchronous processing within FIFO queues without sacrificing the ordering guarantees.

Alternative solutions

No response

Acknowledgment

This feature request meets Powertools for AWS Lambda (TypeScript) Tenets
Should this be considered in other Powertools for AWS Lambda languages? i.e. Python, Java, and .NET

Future readers

Please react with 👍 and your use case to help us understand customer demand.

The text was updated successfully, but these errors were encountered:

boring-cyborg · 2024-09-30T20:29:04Z

Thanks for opening your first issue here! We'll come back to you as soon as we can.
In the meantime, check out the #typescript channel on our Powertools for AWS Lambda Discord: Invite link

dreamorosi · 2024-09-30T20:33:58Z

Hi @amaral-ng, thank you for opening this feature request.

I think it makes total sense to add this, as long as we process the items one by one and await each promise before moving onto the next one.

I've also added the help wanted label, if anyone is interested in picking up the issue and contribute a PR, please leave a comment below so we can assign it to you.

bml1g12 · 2024-10-04T10:47:58Z

Ah yes, I wanted to plus 1 this as I also spent a fair bit of time on this one. I eventually came to the same conclusion - that it's not currently supported, so I ended up dropping powertools for my use case and writing my own boilerplate here.

In my use case, I wanted to not only use await but also apply a global rate limit across each message within the batch, as I'm calling AWS' SDK APIs on each message which have their own rate limits associated.

dreamorosi · 2024-10-04T11:20:36Z

Hi @bml1g12, thanks for the added context, this is very helpful.

May I ask how you'd be doing the rate limiting part? Do you maintain a separate persistence layer? How do you identify a request/operation? We're considering a rate limiting feature since we've had some other customers requesting it, and this info would be valuable.

bml1g12 · 2024-10-04T11:37:43Z

This is the approach I'm using currently, i.e. rate limiting within the handler:

import pThrottle from "p-throttle"
...
const handler: SQSHandler = async (event: SQSEvent, context: Context) => {
    ...
    const throttle = pThrottle({
      limit: config.CallsPerSecondLimit,
      interval: 1000,
      strict: true,
    })
    const throttled = throttle(async (record: SqsRecord) => {
      log.debug("Processing a record from SQS", { local_context: { record } })
      await processRecord(record, sqsClient, "start")
    })
   for (const record of result.data.Records) {
       await throttled(record)

It would be even better if there was a convenience tool for global rate limiting across all lambdas - as it's a common problem we face when we have different lambda execution contextx running and hitting AWS imposed API rate limits

I appreciate maybe a better way would be to use DynamoDB to store the number of calls in last minute and use that instead, to provide persistence between lambda handlers - but also a lot more complex to implement and maintain

arnabrahman · 2024-10-04T12:15:36Z

I am interested in this. Currently, SqsFifoPartialProcessor extends BatchProcessorSync. To support asynchronous operations, a potential solution is for SqsFifoPartialProcessor to extend BasePartialBatchProcessor, where we can implement the async methods process and processRecord alongside processRecordSync. For async processing, the records will always be processed sequentially. Additionally, the function signature of processPartialResponse would need to be updated to match processPartialResponseSync.

There may be other solutions worth exploring, but this is the one that comes to mind. Let me know your thoughts, @dreamorosi.

dreamorosi · 2024-10-04T17:08:48Z

Hey @arnabrahman, ideally that would be the way to go, but unfortunately I think it would constitute a breaking change - even though I doubt many people use the SqsFifoPartialProcessor as-is today because of the sync nature.

I think adding this now will mean we have to do the opposite:

Create a SqsFifoPartialProcessorAsync that extends BasePartialBatchProcessor and implement the asynchronous logic there
Make the necessary changes to the processPartialResponse* method in that newly created class
Leave the current SqsFifoPartialProcessor as-is for now

We'll then add to the v3 backlog an action item to swap the two in the next major version. In v3, SqsFifoPartialProcessor will become the default and async, and SqsFifoPartialProcessorSync will be created.

Regarding the order of processing, yes, we'll need to always keep them sequential to avoid ordering issues.

What do you think?

arnabrahman · 2024-10-05T17:22:36Z

Why not extend to BatchProcessor, the same way SqsFifoPartialProcessor extends BatchProcessorSync?

Since SqsFifoPartialProcessorAsync will have all the features of SqsFifoPartialProcessor, we could consider using Mixins to decouple some of the common logic between the two classes. I’m not entirely sure if this would be achievable, but I can give it a try. @dreamorosi

dreamorosi · 2024-10-06T07:52:04Z

Hey @arnabrahman, I'm not familiar with mixins but I'm open to try. I'd say let's move forward and continue the discussion on the PR. I'm sure it'll be easier to talk once we have the code.

Thanks for the ideas!

amaral-ng added feature-request This item refers to a feature request for an existing or new utility triage This item has not been triaged by a maintainer, please wait labels Sep 30, 2024

dreamorosi added help-wanted We would really appreciate some support from community for this one batch This item relates to the Batch Processing Utility and removed triage This item has not been triaged by a maintainer, please wait labels Sep 30, 2024

dreamorosi assigned arnabrahman Oct 4, 2024

arnabrahman linked a pull request Oct 7, 2024 that will close this issue

feat(batch): Async Processing of Records for for SQS Fifo #3160

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Support Sequential Async Processing of Records for SqsFifoPartialProcessor #3140

Feature request: Support Sequential Async Processing of Records for SqsFifoPartialProcessor #3140

amaral-ng commented Sep 30, 2024

boring-cyborg bot commented Sep 30, 2024

dreamorosi commented Sep 30, 2024

bml1g12 commented Oct 4, 2024

dreamorosi commented Oct 4, 2024

bml1g12 commented Oct 4, 2024 •

edited

Loading

arnabrahman commented Oct 4, 2024

dreamorosi commented Oct 4, 2024 •

edited

Loading

arnabrahman commented Oct 5, 2024 •

edited

Loading

dreamorosi commented Oct 6, 2024

Feature request: Support Sequential Async Processing of Records for SqsFifoPartialProcessor #3140

Feature request: Support Sequential Async Processing of Records for SqsFifoPartialProcessor #3140

Comments

amaral-ng commented Sep 30, 2024

Use case

Solution/User Experience

Alternative solutions

Acknowledgment

Future readers

boring-cyborg bot commented Sep 30, 2024

dreamorosi commented Sep 30, 2024

bml1g12 commented Oct 4, 2024

dreamorosi commented Oct 4, 2024

bml1g12 commented Oct 4, 2024 • edited Loading

arnabrahman commented Oct 4, 2024

dreamorosi commented Oct 4, 2024 • edited Loading

arnabrahman commented Oct 5, 2024 • edited Loading

dreamorosi commented Oct 6, 2024

bml1g12 commented Oct 4, 2024 •

edited

Loading

dreamorosi commented Oct 4, 2024 •

edited

Loading

arnabrahman commented Oct 5, 2024 •

edited

Loading