Collect I/O per LSF job as a resource to be reported to RTM

As number of CPUs per machine rises, the I/O to disk rises proportionally, but the bandwidth to the disk generally does not scale. We have found that in high-cpu machines, the system can grind to a near halt due to overload of the disk I/O. We can see the culprits running iotop on an affected host, or looking at sar. But, this just tells us the PID. We can use lsload -l to see a summary of I/O per host, but again, this does not differentiate the culprit jobs. RTM can plot the overall I/O per host, but not per job. We have resorted to running cron jobs on hosts, which grab the PIDs of the running jobs on a host, and then runs iotop on those PIDs, and saves results to a data file. From this we can at least plot out the I/O per job ... but it would be much better to have this available directly in RTM, and to have LSF able to plot it. Of course, the lim on each host would need to collect this info and pass it to the master. There is already a field in the lsb.acct, in ru_ioch, but only for HP-UX. This could be used for Linux as well.

Post comment

Guest

Reply
| Aug 21, 2020

We have considered this request and it is not something we are able to deliver in the future. If there is broad interest for this, it can be resubmitted in 18 months.

0 reply Hide replies

Guest

Reply
| Aug 30, 2017

Accurately reflecting IO per job is not a trivial task and has been investigated a number of times in the past. Different tools give different results depending on the type of IO - file access to local disk, nfs disk, rdma disk etc all give different results; the effects of caches, toe offload, network compression etc all impact true IO. In most cases, the "true IO" is that entering/leaving the filer - and while there are tools that give you the filer view - they do not give you the "job" that is actually causing the issue.

Tools such as Ellexus Mistral can throttle the job from the client side for high IO (and that is integrated with LSF RTM for reporting/alarms).

We currently have a prototype of being able to report on "per job" IO from the filer perspetctive with Spectrum Scale; we're still investigating whether that approach can be extended to generic NFS or Netapp.

0 reply Hide replies

By clicking the "Post Comment" or "Submit Idea" button, you are agreeing to the IBM Ideas Portal Terms of Use.
Do not place IBM confidential, company confidential, or personal information into any field.

Please enter your email address

RELATED IDEAS

Collect I/O per LSF job as a resource to be reported to RTM