-
Notifications
You must be signed in to change notification settings - Fork 810
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
alertmanager and ruler irsa is failing, added support to it and also f… #3740
alertmanager and ruler irsa is failing, added support to it and also f… #3740
Conversation
…ixed buy in dockerfile Signed-off-by: Nitesh Vaidyanath <nvaidyanath@stg3bastion.stg3.ap>
Signed-off-by: Nitesh Vaidyanath <nvaidyanath@stg3bastion.stg3.ap>
Signed-off-by: Nitesh Vaidyanath <nvaidyanath@stg3bastion.stg3.ap>
010cd72
to
a744a1e
Compare
…iable Signed-off-by: Nitesh Vaidyanath <nvaidyanath@stg3bastion.stg3.ap>
2d138a8
to
9e777b9
Compare
Signed-off-by: Nitesh-vaidyanath <niteshbv@ymail.com>
d554ac6
to
437d5a6
Compare
…int so got few errors Signed-off-by: Nitesh-vaidyanath <niteshbv@ymail.com>
8e69e79
to
5704bcf
Compare
Signed-off-by: Nitesh-vaidyanath <niteshbv@ymail.com>
bb44046
to
1a5cd04
Compare
Thank you for your PR. From reading the diff, my understanding is that it tries to use AssumeRoleWithWebIdentity to get credentials from session token. I don't think this is the right approach for Cortex. Credentials obtained from |
@pstibrany Yes credentials created by AssumeRoleWithWebIdentity have default TTL 1 hour, i am figuring out a way to add logic in alertmanager and ruler to get credentials whenever it expires. AssumeRoleWithWebIdentity is working fine with ingester and storegateway as it is using different sdk for getting credentials. |
@pstibrany I don't think we need to change the code, just need to upgrade aws-sdk to 0.20.0, currently it is 0.18.0 |
Would you like to send PR updating aws-sdk? |
@@ -4,7 +4,8 @@ | |||
|
|||
|
|||
## 1.7.0 in progress | |||
|
|||
* [ENHANCEMENT] Added support for web indentity tokens |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Cortex release 1.8.0 is now in progress. Could you please rebase master and move the CHANGELOG entry under the master / unreleased section?
@pstibrany Thanks for taking care of this. I will close this PR. |
Hello! Cortex ruler/alertmanager components are still not able to integrate with IRSA, making their setup more complex (as it makes cortex require IAM role + IAM user, instead of using single IAM role with non-static credentials that need to be handled as secrets). @Nitesh-vaidyanath @pstibrany are there any future plans to get back to this, or was this solved somehow? Otherwise, if this was a comms issue, I don't mind opening new PR with these changes + changelog + aws sdk update |
Usually the AWS-Go-SDK will take care of aquiring temporary credentials via the
It looks the the ruler instance is trying to perform the STS Any further activities on this @Nitesh-vaidyanath @pstibrany ? |
@abacus3 please open a new issue with the error message. |
…ixed buy in dockerfile
Adding support to iam role for service account(irsa) and fixed dockerfile.
Which issue(s) this PR fixes:
irsa is failing from alertmanager and ruler pod
level=warn ts=2021-01-25T11:29:36.056182464Z caller=multitenant.go:304 component=MultiTenantAlertmanager msg="error fetching all configurations, backing off" err="WebIdentityErr: failed to retrieve credentials\ncaused by: SerializationError: failed to unmarshal error message\n\tstatus code: 405
Ingestor, storage gateway, compactor are using different s3 sdk cortex/vendor/github.com/thanos-io/thanos/pkg/objstore/s3/s3.go,
alertmanager and ruler using s3 client defined in cortex/pkg/chunk/aws/s3_storage_client.go so we are seeing issue for these services.
Checklist
CHANGELOG.md
updated - the order of entries should be[CHANGE]
,[FEATURE]
,[ENHANCEMENT]
,[BUGFIX]