-
Notifications
You must be signed in to change notification settings - Fork 134
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How can I submit tf-job in armada? #536
Comments
/help |
Hi, Armada currently does not support TF Jobs, the closest to it you can get is job with multiple podSpecs, its possible to submit jobs like this:
Armada will aim to schedule multiple pods in one of the clusters at the same time. Adding support of custom jobs specifications like TF Jobs and other kubeflow types (https://www.kubeflow.org/docs/components/training/) is something we have considered, but have not implemented yet. Are you planning to use Armada for specific use case? |
But if I set the tf job with multiple podSpecs, How can tf-operator operate it?
Yes, But our job is mainly AI/BigData like tfjob\Pytorch\spark
I'm looking forward to this feature. Do you have a general schedule? |
@jankaspar Thanks |
Hi, sorry for late reply, You are right, its not possible to use Tensor Flow operator, but you can use Tensor Flow in the multi node jobs without the operator. |
Closing this ticket, due to age and this feature still not being planned. Please reopen if this is still strong interest in this feature. |
After I test the example in https://github.com/G-Research/armada/blob/master/example/jobs.yaml
Is there any methods to submit tf job? The object
job
isn't used frequently. The job of Tensorflow or Pytorch is widely used. https://github.com/kubeflow/tf-operator/blob/master/examples/v1/dist-mnist/tf_job_mnist.yaml┆Issue is synchronized with this Jira Task by Unito
The text was updated successfully, but these errors were encountered: