Skip to Main Content
IBM Data and AI Ideas Portal for Customers


This portal is to open public enhancement requests against products and services offered by the IBM Data & AI organization. To view all of your ideas submitted to IBM, create and manage groups of Ideas, or create an idea explicitly set to be either visible by all (public) or visible only to you and IBM (private), use the IBM Unified Ideas Portal (https://ideas.ibm.com).


Shape the future of IBM!

We invite you to shape the future of IBM, including product roadmaps, by submitting ideas that matter to you the most. Here's how it works:


Search existing ideas

Start by searching and reviewing ideas and requests to enhance a product or service. Take a look at ideas others have posted, and add a comment, vote, or subscribe to updates on them if they matter to you. If you can't find what you are looking for,


Post your ideas

Post ideas and requests to enhance a product or service. Take a look at ideas others have posted and upvote them if they matter to you,

  1. Post an idea

  2. Upvote ideas that matter most to you

  3. Get feedback from the IBM team to refine your idea


Specific links you will want to bookmark for future use

Welcome to the IBM Ideas Portal (https://www.ibm.com/ideas) - Use this site to find out additional information and details about the IBM Ideas process and statuses.

IBM Unified Ideas Portal (https://ideas.ibm.com) - Use this site to view all of your ideas, create new ideas for any IBM product, or search for ideas across all of IBM.

ideasibm@us.ibm.com - Use this email to suggest enhancements to the Ideas process or request help from IBM for submitting your Ideas.

IBM Employees should enter Ideas at https://ideas.ibm.com


Status Future consideration
Workspace Db2
Created by Guest
Created on Feb 1, 2023

Building better quality dictionaries while doing CTAS in Warehouse SaaS / CP4D SaaS

Product affected: Db2 Warehouse on Cloud

Feature affected: CTAS is a Db2 statement - CREATE TABLE AS functionality (CTA)

Overview

GEICO sees very bad compression as compared to Netezza.

Its been observed that currently CTAS uses only the first part of data (default 1 million rows or 500K rows per partition) to build the dictionary. This results in poor quality dictionaries for large tables and poor compression.

The customer (GEICO) wants CTAS to build better quality dictionaries (i.e., through sampling the whole data of the source table).

Please refer to the case -
TS011321960 that was opened in relates to this to get more details.


Description: Basically GEICO is using CTAS statements in their day to day operations ...almost daily ...when they use CTAS on very large tables, by default the dictionary that gets built is not providing good compression results on to the newly built table & leads to a lot of space consumption and is a big problem currently in GEICO env. When the case was opened, developers / team looked at it and suggested that CTAS can be enhanced further to build good compression dictionary (for example making use of Bernoulli algo ??) so that good compression dictionary can be built which provide more compression ratio and avoiding to use large space built out of CTAS. U can connect with @gopalv for more details on to it who is working with GEICO team day to day basis.


Point of contact : Venkatesh Gopal
GEICO DATA&AI CSM : GANESH GOSAVI

Needed By Week