Skilja Blog

Intelligent Document Processing and Enterprise Security

For those of us who have historically worked in the area of Intelligent Document Processing (IDP), or Capture as it was simply called before, it is a very pleasant observation that IDP, that has been around for a long time, creates more and more interest in the general CEO discussions and is now seen as an integral part of process optimization. IDP gets more and more integrated into the main business processes. In the past Capture used to be almost always departmental. Capture was only allowed in a (badly lit) corner of an Enterprise mainly because no Capture system was able to fully integrate into Enterprise IT and most importantly comply with all security rules established for enterprises.

Document Separation Revisited

One of the frequently overlooked and really difficult problems in document automation, which is also really annoying in daily processing, is the automatic separation of a stack of documents into single meaningful documents and assignment to a document class. The goal would be to simply scan the whole stack and have it separated by an intelligent algorithm. Fortunately this is readily available today from the Skilja technology stack as a built in feature into the Laera classifier. This does not say it is easy. It requires quite some experience and infrastructure to manage several interdependent steps of classification and separation in a stable and reliable way. This is what Laera provides out of the box.

Confusion Matrix

Understanding the quality of an automatic classification system is crucial for its acceptance and any attempt to improve it over time. Quality means that we need to look at errors and at the recognition rate. In classification terms these values are...

Vinna 3.0 Released

We are proud to announce the release of Vinna 3.0, our open 4th generation Document Processing Platform. We created a totally new and modern UI – with an improved backend to support enterprise performance, scalability and security requirements. Process Editor and Process Monitor are completely redesigned with latest web technologies. Vinna is an open and process-oriented platform, that allows users to define a process in exactly the way as it is optimally operated in a company. The architecture of Vinna is service oriented (SOA) and the runtime is easily deployed either in the cloud (Microsoft Azure, AWS or private cloud), on premise or in mixed environments where the data storage is kept in house and processing happens outside.

Process as a Service

Imagine that you have created a powerful process for superb document automation using all kind of advanced recognition, image processing and AI technologies available. With these technologies it is possible to automate almost any document driven process that involves...

Reading Medical Reports

Medical Reports are complex documents that are written by doctors who use their specific language and style to express not only facts but also hypotheses and suggestions. They are intended to be read by other doctors or experts who have a deep knowledge of the subject...

Vinna 2.0 Released

We are happy to announce the release our new version Vinna 2.0, the 4th Generation Document Processing Platform.Since the last major release, a year ago, we have worked hard – and a big thanks to the team – to focus on enterprise features that make Vinna’s...

A New Approach to OCR Quality

The approach to improve OCR on a given document is very similar to human capabilities of adapting their cognitive capabilities to a specific sample. Just imagine that you see a document with very difficult handwriting. In the begining you will be able to distinguish...

OCR on Historical Documents

Skilja is proud to announce that we have received a grant from the European Union supporting a research and development project to improve OCR on historical documents. The grant is provided through the Eurostars program of the European Union. This program supports...

Read Faster with Text Streaming

An interesting new approach to human document understanding is presented by the Boston based startup company „Spritz“. They believe that human understanding of text (i.e. reading) is slowed down by the eye-movement on the text. Therefore they have developed a new...

Classification methods

Classification  tries to mimic human understanding. Several methods have been developed in the past to achieve what we as humans can do almost effortless.  These methods can be divided into two groups. Rule based classification Rule based systems are...