A collection of custom diamond collectors to gather various slurm stats.
These collectors are intended to be used with diamond to ship stats to graphite. Each collector collects data on a different aspect of slurm. Feel free to add or update these collectors to suit your needs.
This collector is a diamond version of this:
http://giovannitorres.me/graphing-sdiag-with-graphite.html
This collector will collect sdiag stats allowing you to chart your scheduler performance over time.
This collector grabs the current sshare data for users. This assumes that you are using a two tier simple fairshare system of accounts and users of those accounts.
This collector pulls the current state of all the nodes in the cluster and then computes overall stats of the cluster such as number of nodes down, number of nodes in use, etc.
This collector pulls in the current job information for the last hour. It then summarizes the data per user to be plugged into a leaderboard for the top users.
This collector pulls in the current job information for the last hour. It then calculates how many TRES-seconds have been wasted by a job, that meaning how much memory and CPU was not actually used by the job though it was allocated by the scheduler. It then publishes a summary of how much TRES was not used by the user.
Simply add them to /usr/share/diamond/collectors
and then activate them in diamond and you should be good to go.