Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

generic --cloud-type not working #1145

Closed
casperdcl opened this issue Sep 9, 2022 · 4 comments · Fixed by iterative/terraform-provider-iterative#718
Closed

generic --cloud-type not working #1145

casperdcl opened this issue Sep 9, 2022 · 4 comments · Fixed by iterative/terraform-provider-iterative#718
Assignees
Labels
bug Something isn't working cml-runner Subcommand

Comments

@casperdcl
Copy link
Contributor

cml runner ... --cloud=aws --cloud-type=m+t4 doesn't work:

{"level":"error",
 "message":"terraform error: Error: Failed creating the machine: Not able to decode: operation error EC2: RunInstances,
https response error StatusCode: 400,
RequestID: 622a6bc7-75c5-47d1-83ae-76f8f9029186,
api error InvalidParameterValue: Invalid value 'm+t4' for InstanceType."}

meanwhile cml runner ... --cloud=aws --cloud-type=m --cloud-gpu=t4:

{"level":"error",
 "message":"terraform error: Error: Failed creating the machine: Not able to decode: operation error EC2: RunInstances,
https response error StatusCode: 400,
RequestID: dcc5278c-7eda-467a-b142-20c4d3de10f1,
api error InvalidParameterValue: Invalid value 'm' for InstanceType."}

does CML not work with TPI generic-machine-types?

@casperdcl casperdcl added bug Something isn't working cml-runner Subcommand labels Sep 9, 2022
@dacbd
Copy link
Contributor

dacbd commented Sep 9, 2022

yes, and no.

Something does need to be improved as what exists is clearly not clear 🙃

I was previously in favor of splitting out these mappings to something more consolidated but unfortunately, that was cut and these "helper types" are all completely separate in their respective providers. So something like m+t4 which may work on one does not work on others.

for example, m+t4 is missing from all the providers see: AWS and azure:

additionally the --cloud-gpu is essentially only functional with GCP in a way that I would expect since azure and AWS have separate machine types with predetermined GPUs but GCP allows you to more explicitly set something, see: #1049

@casperdcl
Copy link
Contributor Author

...and iterative/cml.dev#251

@dacbd
Copy link
Contributor

dacbd commented Nov 15, 2022

@casperdcl with iterative/terraform-provider-iterative#718 the next tpi release with resolve the error you found here, with iterative/cml.dev#251 as its own issue that should still be addressed I think we can close this one, but I'll leave that to you.

@casperdcl
Copy link
Contributor Author

Thanks; happy to close here :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working cml-runner Subcommand
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants