Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(aws_stepfunctions_tasks): (Add support to start glue crawlers) #24188

Closed
2 tasks
rodgjame opened this issue Feb 15, 2023 · 13 comments · Fixed by #29016 · May be fixed by NOUIY/aws-solutions-constructs#98 or NOUIY/aws-solutions-constructs#99
Closed
2 tasks
Labels
@aws-cdk/aws-stepfunctions-tasks effort/small Small work item – less than a day of effort feature-request A feature should be added or improved. p2

Comments

@rodgjame
Copy link

Describe the feature

As a user of aws-cdk, I would like to create a step function with a step to start a Glue Crawler so I don't have invoke a crawler via a Lambda. Lambdas have a 15 min timeout which is too short for some crawlers of large datasets to be fully crawled.

Use Case

When I'm in the step function design studio, I do see an option for a StartCrawler step. This step is not available in the CDK modules so I'm unable to create step functions that has this step in it.

Proposed Solution

I would like a new task type that allows me to kick off a glue crawler. Something like:

const runGlueCrawler = new tasks.GlueStartCrawler(....

Other Information

No response

Acknowledgements

  • I may be able to implement this feature request
  • This feature might incur a breaking change

CDK version used

"aws-cdk": "^2.58.1"

Environment details (OS name and version, etc.)

Amazon Linux 2

@rodgjame rodgjame added feature-request A feature should be added or improved. needs-triage This issue or PR still needs to be triaged. labels Feb 15, 2023
@amiraliucsc
Copy link

We also need the same feature. up voting this.

@khushail
Copy link
Contributor

khushail commented Feb 15, 2023

Hi @rodgjame , Thanks for reaching out. I am marking this issue as p2, the suggested request should adhere to the guidelines of existing Stepfunction task construct.

We use +1s to help prioritize our work, and are happy to revaluate this issue based on community feedback. You can reach out to the cdk.dev community on Slack to solicit support for reprioritization. Sharing the contributing guide for reference.

@khushail khushail added p2 effort/small Small work item – less than a day of effort and removed needs-triage This issue or PR still needs to be triaged. labels Feb 15, 2023
@amiraliucsc
Copy link

+1

2 similar comments
@glc-imoussaoui
Copy link

+1

@wdesouza
Copy link

+1

@bhavin-a
Copy link

+1

6 similar comments
@zee-fee
Copy link

zee-fee commented May 31, 2023

+1

@nkhandelwal1
Copy link

+1

@wduan101
Copy link

+1

@schneifejan
Copy link

+1

@vilas1999
Copy link

+1

@ocrlgv
Copy link

ocrlgv commented Dec 13, 2023

+1

GavinZZ added a commit that referenced this issue Mar 11, 2024
### Issue

Closes #24188.

### Reason for this change

AWS Step Functions supports the ability to start a Glue Crawler as a
task within a state machine. However, this is not configurable.

### Description of changes

I've implemented `GlueStartCrawlerRun` class in stepfunctions-tasks and
we can create start crawler task easily:
```ts
  const task = new GlueStartCrawlerRun(stack, 'Task', {
    crawlerName: 'glue-crawler-name',
  });
```

### Description of how you validated changes

I've added both unit and integ tests.

### Checklist
- [x] My code adheres to the [CONTRIBUTING
GUIDE](https://github.com/aws/aws-cdk/blob/main/CONTRIBUTING.md) and
[DESIGN
GUIDELINES](https://github.com/aws/aws-cdk/blob/main/docs/DESIGN_GUIDELINES.md)

----

*By submitting this pull request, I confirm that my contribution is made
under the terms of the Apache-2.0 license*

---------

Co-authored-by: Luca Pizzini <lpizzini7@gmail.com>
Co-authored-by: GZ <yuanhaoz@amazon.com>
Copy link

⚠️COMMENT VISIBILITY WARNING⚠️

Comments on closed issues are hard for our team to see.
If you need more assistance, please either tag a team member or open a new issue that references this one.
If you wish to keep having a conversation with other community members under this issue feel free to do so.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment