Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automatic Workload Creation #2536

Closed
m-mamdouhi opened this issue Jul 4, 2024 · 6 comments
Closed

Automatic Workload Creation #2536

m-mamdouhi opened this issue Jul 4, 2024 · 6 comments

Comments

@m-mamdouhi
Copy link

m-mamdouhi commented Jul 4, 2024

is there a way for kueue to make workloads for kubeflow jobs? I know for batch job kueue automatically creates the workload but how to do it with other jobs is not clarified in the documentation. just adding the label to the manifest doesn't seem to be creating the workload automatically then creating the job.

@m-mamdouhi m-mamdouhi changed the title Workload Automatic Workload Creation Jul 4, 2024
@trasc
Copy link
Contributor

trasc commented Jul 4, 2024

Hi we have a couple of guides relating to kubeflow jobs here, please make sure you follow the administer cluster quotas guide before creating the kubefflow jobs.

@m-mamdouhi
Copy link
Author

I have done all the steps of the guide several times, but the problem is that for any job that isn't a Batch/v1 job the workload doesn't get created.
I looked into the controller_manager_config.yaml as well and the name is already added ( I want to deploy kubeflow and xgboost). Is there a step besides the guide that I have to do?
because it works perfectly well with Batch jobs just not any other job.
any help would be very much appreciated

@trasc
Copy link
Contributor

trasc commented Jul 5, 2024

Can you check the logs of the kueue's controller manager?

@m-mamdouhi
Copy link
Author

I found the issue to be a misconfiguration on my part in the helm installation using crossplane.
I managed to fix it. Thanks @trasc for responding and trying to help me out!

@pgn-dev
Copy link

pgn-dev commented Jul 6, 2024

@m-mamdouhi would you mind sharing what was the misconfiguration?

@m-mamdouhi
Copy link
Author

m-mamdouhi commented Jul 8, 2024

@pgn-dev it was one (or both) of these two things not sure which:

  1. the naming of the cluster to be created by crossplane was too long so the workloads weren't being created.
  2. I had not correctly added the name of a custom workload configuration to the external integration framework

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants