This portal is to open public enhancement requests against products and services offered by the IBM Data & AI organization. To view all of your ideas submitted to IBM, create and manage groups of Ideas, or create an idea explicitly set to be either visible by all (public) or visible only to you and IBM (private), use the IBM Unified Ideas Portal (https://ideas.ibm.com).
Shape the future of IBM!
We invite you to shape the future of IBM, including product roadmaps, by submitting ideas that matter to you the most. Here's how it works:
Search existing ideas
Start by searching and reviewing ideas and requests to enhance a product or service. Take a look at ideas others have posted, and add a comment, vote, or subscribe to updateson them if they matter to you. If you can't find what you are looking for,
Post your ideas
Post ideas and requests to enhance a product or service. Take a look at ideas others have posted and upvote them if they matter to you,
Post an idea
Upvote ideas that matter most to you
Get feedback from the IBM team to refine your idea
Specific links you will want to bookmark for future use
Have LSF Handle ZOMBI job searching using either BTREE or HASH ALGO's
Large number of ZOMBI jobs can impact very large clusters operations resulting in high backlogs and additional lost job status. Improving the ZOMBI search algorithm from Linear Search to BTREE or HASH would reduce this impact.
Enhance the Support Tool to be able to remove ZOMBI jobs and create a log of exec hosts and pgids that need to be cleaned
We had an issue where LSF could not keep up with dispatch due to high numbers of ZOMBI jobs. We would like the ability to shut down LSF and clean up the ZOMBI jobs via a Database repair to accelerate the time-to-restore of the LSF clusters.
Make Queue Open/Close/Act/Inact 'all' Requests a single event
We recently had a serious issue that required us to close/inact all of our queues, and then perform a database repair using the IBM support tool. However, when we started LSF, all but one of the queues was Open/Act again. This was due to the suppo...
Make the TCP Backlog for the MBATCH and MBATCHD Query Port Configurable for Large Clusters
We have been monitoring the TCP backlogs for LIM, MBD, and the Query port for some time, and there are periods where the MBD query port becomes overloaded resulting in a higher than expected count of UNKNOWN and sometimes eventually ZOMBI jobs, an...
Add lsb.params setting to limit the number of times a job can be redispatched upon either a pre-exec failure of job initilization failure
We currently have jobs that will bounce from host to host either based upon a pre-exec failure, or a job initialization failure. This can impact > 500 hosts and cause an increase in MBD memory use. It would be nice to have a setting such as: MA...
Dump configs to standardized format for cluster comparisons
We have multiple stand-alone clusters, and frequently have issues with something working differently in one as opposed to another. The solution from our side is to use something like ansible/chef/cfEngine to build the clusters, but for our existin...
Add Regular Expressions for Project Limits and SLA Auto-Attach and Permissions
We utilize hierarchical projects and at times we would like to allow certain projects and certain functions, both that are a part of the hierarchy to utilize an SLA, for example, assume that the project has three levels project_code:finance_code:f...
This is the current bbot/btop functionality:- Group administrators can change the position of all jobs submitted by users who are members of the user group within a queue (move the selected job before/after the first/last job with the same priorit...
Do not place IBM confidential, company confidential, or personal information into any field.