Skip to Main Content
IBM Data and AI Ideas Portal for Customers


This portal is to open public enhancement requests against products and services offered by the IBM Data & AI organization. To view all of your ideas submitted to IBM, create and manage groups of Ideas, or create an idea explicitly set to be either visible by all (public) or visible only to you and IBM (private), use the IBM Unified Ideas Portal (https://ideas.ibm.com).


Shape the future of IBM!

We invite you to shape the future of IBM, including product roadmaps, by submitting ideas that matter to you the most. Here's how it works:


Search existing ideas

Start by searching and reviewing ideas and requests to enhance a product or service. Take a look at ideas others have posted, and add a comment, vote, or subscribe to updates on them if they matter to you. If you can't find what you are looking for,


Post your ideas

Post ideas and requests to enhance a product or service. Take a look at ideas others have posted and upvote them if they matter to you,

  1. Post an idea

  2. Upvote ideas that matter most to you

  3. Get feedback from the IBM team to refine your idea


Specific links you will want to bookmark for future use

Welcome to the IBM Ideas Portal (https://www.ibm.com/ideas) - Use this site to find out additional information and details about the IBM Ideas process and statuses.

IBM Unified Ideas Portal (https://ideas.ibm.com) - Use this site to view all of your ideas, create new ideas for any IBM product, or search for ideas across all of IBM.

ideasibm@us.ibm.com - Use this email to suggest enhancements to the Ideas process or request help from IBM for submitting your Ideas.

IBM Employees should enter Ideas at https://ideas.ibm.com


Status Planned for future release
Workspace Connectivity
Created by Guest
Created on Jun 21, 2021

Redshift connector slow in IIS DataStage for Insert/Update

We have a use case in which we need to process records from IIS DataStage into an aws Redshift instance. When using the stage as currently implemented, we're seeing a throughput that averages 2-6 records a second which is far below the acceptable range when considering load sizes in the range of hundred of thousands. Additionally depending on your configuration, the connection to Redshift will eventually timeout after several hours. Currently the stage implements inserts and updates using a JDBC insert which is not a best practice according to aws documentation. Instead the stage should be using the Redshift COPY command to bulk load the finished data set which is far faster than inserts.

Needed by Date Sep 1, 2021
  • Guest
    Reply
    |
    Jul 11, 2022

    Virginie Grandhaye,

    Thank you for the update. We need the enhancement for RedShift connector stage performance issue, as soon as possible. We are recently upgraded our environment to IIS DataStage v11.7.1.3 - FixPack3 with Service Pack4 on RHEL 7.9. Please let us know, if there is a Patch that we can install to resolve the issue.

  • Guest
    Reply
    |
    Jan 20, 2022

    We are having the same performance issue using RedShift connector stage when inserting data into RedShift database from DataStage.


    When we use RedShift Connector stage, It's inserting 22 rows per second.


    When we reached IBM Support, they provided the documentation for JDBC Connector stage. After implementing suggested changes for JDBC, Job completed inserting 5 million rows in 16 minutes at a rate of 5377 rows per second. Performance slightly improved and but still it’s not acceptable. We used 500,000 as record count and 500,000 batch size in the JDBC connector stage when processing 5 million rows.


    Using the same set of data and Inserting 5 million rows into Netezza, job completed in less than a minute and processed 421,000 rows per second.


    Using JDBC Connector stage, when we try to read and extract 5 Million rows from RedShift Database, it's completed in less than 1 minute and processed around 350,000 rows per second.

    Using RedShift Connector stage, when we try to read and extract 5 Million rows from RedShift Database, it's completed in less than 1 minute and processed around 160,000 rows per second.

    Reading data from RedShift database using the RedShift connector stage or JDBC Connector stage, we do not see any issues.

    The issue we are having is only, when inserting data into RedShift database.

    We opened PMR with IBM Support : Case TS008045974.

    They stated that, it's an enhancement request and suggested to update here.

    Please let us know, if IBM is currently working on any enhancements to address the performance issues when inserting data into RedShift database from DataStage and when can we expect latest Patches for the RedShift connector and JDBC Connector stages.

  • Guest
    Reply
    |
    Jul 7, 2021

    Thanks for submitting this idea.

    After analysis, it appears that we could consider this item in the future if we get enough demand for that. Changing the status to "Future consideration".

    Thanks