Technologie | Skilja

Das Ende der OCR: Wie Worterkennung alles verändert

von Alexander | Okt. 30, 2025 | Erkennung, Grundlagen, Technologie

Moderne Systeme lesen keine Zeichen – zumindest nicht einzeln. Sie lesen Wörter, Phrasen und sogar Bedeutungen. OCR hat sich zu etwas grundlegend anderem entwickelt, und dieser Wandel sorgt für ein neues Maß an Genauigkeit, Fluidität und Natürlichkeit, das frühere Generationen nicht erreichen konnten.
Genau das hat Skilja mit Lesa erreicht, unserem Deep-Learning-System auf Transformer-Basis, das so konzipiert ist, dass es wie Menschen liest: ganzheitlich, kontextbezogen und intelligent.

Intelligent Document Processing and Enterprise Security

von Alexander | Juni 22, 2025 | Prozess, Technologie

In the past Capture used to be almost always departmental. Capture was only allowed in a (badly lit) corner of an Enterprise mainly because no Capture system was able to fully integrate into Enterprise IT and most importantly comply with all security rules established for enterprises.

This has changed to the better in the past few years. At Skilja we have invested a lot of effort to strictly follow all security requirements so Vinna (our enterprise platform) can become a part of enterprise IT. This means on the one side that during development we are running frequent security screenings and penetrations tests to ensure utmost security for the software. For example Vinna and Skilja Software is Veracode Verified since many years. On the other hand we independently make sure to follow all defined industry standards very closely.

On the Benefits of Page Classification

von skiljaadmin | Sep. 29, 2022 | Erkennung, Klassifikation, Technologie

Classification deals with the categorization of objects. In our process automation and digitization world, we often think of the objects as complete documents that need to be classified. Of course, it is important to understand what the type of a document is...

Document Separation Revisited

von Alexander | Sep. 8, 2022 | Erkennung, Klassifikation, Technologie

One of the frequently overlooked and really difficult problems in document automation, which is also really annoying in daily processing, is the automatic separation of a stack of documents into single meaningful documents and assignment to a document class. The goal would be to simply scan the whole stack and have it separated by an intelligent algorithm. Fortunately this is readily available today from the Skilja technology stack as a built in feature into the Laera classifier. This does not say it is easy. It requires quite some experience and infrastructure to manage several interdependent steps of classification and separation in a stable and reliable way. This is what Laera provides out of the box.

Confusion Matrix

von skiljaadmin | Aug. 10, 2022 | Erkennung, Klassifikation, Technologie

Understanding the quality of an automatic classification system is crucial for its acceptance and any attempt to improve it over time. Quality means that we need to look at errors and at the recognition rate. In classification terms these values are...

Vinna 3.0 Released

von Alexander | Aug. 24, 2021 | News, Prozess, Technologie

We are proud to announce the release of Vinna 3.0, our open 4th generation Document Processing Platform. We created a totally new and modern UI – with an improved backend to support enterprise performance, scalability and security requirements. Process Editor and Process Monitor are completely redesigned with latest web technologies. Vinna is an open and process-oriented platform, that allows users to define a process in exactly the way as it is optimally operated in a company. The architecture of Vinna is service oriented (SOA) and the runtime is easily deployed either in the cloud (Microsoft Azure, AWS or private cloud), on premise or in mixed environments where the data storage is kept in house and processing happens outside.

Process as a Service

von skiljaadmin | Mai 5, 2021 | Erkennung, Prozess, Technologie

Imagine that you have created a powerful process for superb document automation using all kind of advanced recognition, image processing and AI technologies available. With these technologies it is possible to automate almost any document driven process that involves...

Reading Medical Reports

von skiljaadmin | Dez. 27, 2019 | Erkennung, Extraktion, Technologie

Medical Reports are complex documents that are written by doctors who use their specific language and style to express not only facts but also hypotheses and suggestions. They are intended to be read by other doctors or experts who have a deep knowledge of the subject...

The Magic of Online-Learning

von skiljaweb3 | Feb. 3, 2019 | Erkennung, Extraktion, Klassifikation, Technologie

Wouldn’t it be nice if your AI enabled document processing system would continuously take the input from user interactions and use this information to improve the quality of recognition over time? And nobody would have to take care of this – even in the case of...

Vinna 2.0 Released

von skiljaweb3 | Juli 6, 2018 | Prozess, Technologie

We are happy to announce the release our new version Vinna 2.0, the 4th Generation Document Processing Platform.Since the last major release, a year ago, we have worked hard – and a big thanks to the team – to focus on enterprise features that make Vinna’s...

Vinna – 4th Generation Document Processing Platform

von skiljaadmin | Mai 27, 2016 | News, Prozess, Technologie

We are glad to introduce the latest version of our software “Vinna”. Vinna is Icelandic and means “work” and this name is self-explanatory as Vinna is a fourth generation platform for digital document processing. Vinna was presented last week at the open house...

A New Approach to OCR Quality

von skiljaadmin | Apr. 8, 2016 | Erkennung, Technologie

The approach to improve OCR on a given document is very similar to human capabilities of adapting their cognitive capabilities to a specific sample. Just imagine that you see a document with very difficult handwriting. In the begining you will be able to distinguish...

What is a good classifier? (4/4)

von skiljaadmin | Apr. 23, 2015 | Erkennung, Klassifikation, Technologie

This is the fourth and final post on the characteristics of a good content based classifier. In previous posts we have focused on presentation of statistical results and comparison of the Skilja Classifier to a plain vanilla naïve Bayes classifier in Recall-Precision...

What is a good classifier? (3/4)

von skiljaadmin | Jan. 29, 2015 | Erkennung, Klassifikation, Technologie

In recent articles about classifier quality we have focused on the overall statistical results. For this we have used either the precision-recall graph or the inverted precision graph. While these are very good tools to predict the overall quality of a classification...

Auto-Classification Technologies and RFID Smart Docs

von Alexander | Jan. 10, 2015 | Gastbeitrag, Klassifikation, Technologie

Editor’s note: This is a guest post from Cláudio Chaves from TCG Brazil Recent advances in the auto-classification technologies – as described in this blog – have provided a substantial manual labor reduction for several companies related to physical...

What is a good classifier? (2/4)

von Alexander | Dez. 16, 2014 | Erkennung, Klassifikation, Technologie

In our small series about classification quality we have used the precision-recall graph to show the difference between a very good and a so-so classifier in a recent post that you can find here. This graphical representation is very common and easy to understand....

OCR on Historical Documents

von Alexander | Dez. 3, 2014 | Erkennung, Technologie

Skilja is proud to announce that we have received a grant from the European Union supporting a research and development project to improve OCR on historical documents. The grant is provided through the Eurostars program of the European Union. This program supports...

What is a good classifier? (1/4)

von Alexander | Aug. 23, 2014 | Erkennung, Klassifikation, Technologie

Auto-Classification is able to assign categories and hence meaning to documents with an unprecedented speed and quality. The technology for auto-classification has been developed over the last 15 years – from the first tentative rule based systems to elaborate...

Read Faster with Text Streaming

von Alexander | März 11, 2014 | Erkennung, Technologie

An interesting new approach to human document understanding is presented by the Boston based startup company „Spritz“. They believe that human understanding of text (i.e. reading) is slowed down by the eye-movement on the text. Therefore they have developed a new...

Generic and specific classification

von Alexander | Okt. 9, 2012 | Erkennung, Klassifikation, Technologie

In principle it is possible to train enough representative samples to create a classification scheme that is totally generic for a specific purpose. This is what humans do all the time. Reading a text and correctly classifying it manually into a given...

« Ältere Einträge