This portal is to open public enhancement requests against products and services offered by the IBM Data & AI organization. To view all of your ideas submitted to IBM, create and manage groups of Ideas, or create an idea explicitly set to be either visible by all (public) or visible only to you and IBM (private), use the IBM Unified Ideas Portal (https://ideas.ibm.com).
Shape the future of IBM!
We invite you to shape the future of IBM, including product roadmaps, by submitting ideas that matter to you the most. Here's how it works:
Search existing ideas
Start by searching and reviewing ideas and requests to enhance a product or service. Take a look at ideas others have posted, and add a comment, vote, or subscribe to updateson them if they matter to you. If you can't find what you are looking for,
Post your ideas
Post ideas and requests to enhance a product or service. Take a look at ideas others have posted and upvote them if they matter to you,
Post an idea
Upvote ideas that matter most to you
Get feedback from the IBM team to refine your idea
Specific links you will want to bookmark for future use
We have many customers in Europe who have content in multiple languages. In WEX AC and oneWEX this was not a problem as the first step in the WEX analysis pipeline is to determine the main langaueg for a document and then apply the relevant out of the box annotators for lexical analysis and PoS tagging.
In Watson Discovery only one language is allowed per collection, and this must be set when the collection is created. Any documents not of the collection language will not be analysed, or can be searched on. This reduces WD viability and effectivenss, and makes it harder to sell to customers who have muliple languages in their content.
Perhaps a possible alternative could be to build some kind of language identification in the crawler or conversion pipeline so that for an English language collecion, for example, only English langauge documents will be converted, analysed and indexed.
Do not place IBM confidential, company confidential, or personal information into any field.