Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LSF adapter is treating cores as nodes #71

Closed
nickjer opened this issue Jan 24, 2018 · 0 comments
Closed

LSF adapter is treating cores as nodes #71

nickjer opened this issue Jan 24, 2018 · 0 comments
Labels

Comments

@nickjer
Copy link
Contributor

nickjer commented Jan 24, 2018

Requesting a job on a cluster with nodes that have 20 cores per node as such:

bsub -n 10 -R "span[ptile=10]"

will give you a single node with access to 10 out of the 20 cores on it.

This is what I currently see in the LSF adapter when viewing info for that job:

OodAppkit.clusters['ada'].job_adapter.info("7168816").allocated_nodes
=> [
     #<OodCore::Job::NodeInfo:0x000000021fbc18 @name="sx6036-1202", @procs=1>,
     #<OodCore::Job::NodeInfo:0x000000021fb920 @name="sx6036-1202", @procs=1>,
     #<OodCore::Job::NodeInfo:0x000000021fb538 @name="sx6036-1202", @procs=1>,
     #<OodCore::Job::NodeInfo:0x000000021fafc0 @name="sx6036-1202", @procs=1>,
     #<OodCore::Job::NodeInfo:0x000000021fa1b0 @name="sx6036-1202", @procs=1>,
     #<OodCore::Job::NodeInfo:0x000000021f9f08 @name="sx6036-1202", @procs=1>,
     #<OodCore::Job::NodeInfo:0x000000058cbf68 @name="sx6036-1202", @procs=1>,
     #<OodCore::Job::NodeInfo:0x000000058cbec8 @name="sx6036-1202", @procs=1>,
     #<OodCore::Job::NodeInfo:0x000000058cbe28 @name="sx6036-1202", @procs=1>,
     #<OodCore::Job::NodeInfo:0x000000058cbd88 @name="sx6036-1202", @procs=1>
   ]

This should instead be:

OodAppkit.clusters['ada'].job_adapter.info("7168816").allocated_nodes
=> [
     #<OodCore::Job::NodeInfo:0x000000021fbc18 @name="sx6036-1202", @procs=10>
   ]

Debugging info:

$ bjobs -a -w -W 7168816
JOBID      USER    STAT  QUEUE      FROM_HOST   EXEC_HOST   JOB_NAME   SUBMIT_TIME  PROJ_NAME CPU_USED MEM SWAP PIDS START_TIME FINISH_TIME SLOTS
7168816 jnicklas RUN   sn_short   login7      sx6036-1202:sx6036-1202:sx6036-1202:sx6036-1202:sx6036-1202:sx6036-1202:sx6036-1202:sx6036-1202:sx6036-1202:sx6036-1202 sys/dashboard/dev/jupyter 01/24-14:53:50 082810563939 000:00:07.00 75     0      31211,31395,31399,31419,31761 01/24-14:53:51 -  10
@nickjer nickjer added the bug label Jan 24, 2018
nickjer pushed a commit that referenced this issue Jan 25, 2018
nickjer added a commit that referenced this issue Jan 26, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant