Enterprise Search Summit (ESS2012) is a conference of business professionals in the field of professional search applications that takes place in May in New York. It was obvious from this year’s agenda that search technology need to go far beyond simply indexing...
Visiting Docville May 2012
I’m just back from the international Docville meeting in Brussels. Docville is a community of professionals in the ECM and capture industry, organized and facilitated by Michael Ziegler which already has more than 800 members on LinkedIn....
10 rules for creating a successful mailroom classification project
Automatic, context based classification for mailrooms has proven to generate significant ROI and acceleration of processes in the last few years. But we have also seen failures and disappointments. I have managed and monitored many of these projects in the past and...
Visiting Social Media Analytics Summit
I am just back from an interesting conference in San Francisco on social media analytics. It is a rather small conference; however the market it deals with is growing rapidly. You might ask what social media analytics has to do with document understanding...
Measuring Classification Quality
For an active production system, but also when the classification scheme is set up, it is very important to measure the quality of the classification. The goal is to create as few as possible errors in classification (also called false positives) as these can severely...
Faults and Tolerance
Humans have a remarkable capability to compensate for noisy signal and incomplete information. We are able to distinguish and recognize relevant information even when the signal to noise ratio is extremely low. Missing data is reconstructed from context knowledge or...
Visiting AIIM 2012
AIIM is the community that provides education, research, and best practices on information management and collaboration. This year the AIIM community met in San Francisco. For the first time since 10 years it was held in a conference format with general...
Structured and Unstructured – what is this?
When you have been involved in plans or projects for automated document processing you have for sure been exposed to the distinction between structured and unstructured information. And you might have gathered an understanding what this means. But what does it really...
Classification of – Chairs
If you have followed my presentations in the past you know that document classification closely corresponds to concept creation of the human mind. The concepts represent classes of real life objects. We are able to recognize concepts and group objects and group them...
Document Understanding Primer
For a long time document understanding has been a research topic in computer sciences. We have seen conferences discussing concepts and approaches to use computers and machine learning for understanding documents. Quite often this topic appears also in proceedings on...