End-to-end information extraction from documents

The Attend, Copy, Parse architecture is a deep neural network model trained on end-to-end data, that bypasses the need for word-level labels

Article

Reading time:

By

Rasmus Berg Palm, Florian Laws, Ole Winther

TABLE OF CONTENTS

Discover Raffle Search

An AI search engine that simplifies data management, analysis, and insights for smarter business decisions and market strategies.

Discover Now

Document information extraction tasks performed by humans create data consisting of a PDF or document image input and extracted string outputs.

This end-to-end data is naturally consumed and produced when performing the task because it is valuable in and of itself. It is naturally available at no additional cost.

Unfortunately, state-of-the-art word classification methods for information extraction cannot use this data, instead requiring word-level labels, which are expensive to create and consequently not available for many real-life tasks.

In this paper, we propose the Attend, Copy, Parse architecture, a deep neural network model that can be trained directly on end-to-end data, bypassing the need for word-level labels. We evaluate the proposed architecture on a large, diverse set of invoices and outperform a state-of-the-art production system based on word classification.

We believe our proposed architecture can be used on many real-life information extraction tasks where word classification cannot be used due to a lack of the required word-level labels.

Download

More from the Newsroom

Product

March 24, 2024

om | nieuwe energie on Raffle

End-to-end information extraction from documents

The Attend, Copy, Parse architecture is a deep neural network model trained on end-to-end data, that bypasses the need for word-level labels

Document information extraction tasks performed by humans create data consisting of a PDF or document image input and extracted string outputs.

This end-to-end data is naturally consumed and produced when performing the task because it is valuable in and of itself. It is naturally available at no additional cost.

Unfortunately, state-of-the-art word classification methods for information extraction cannot use this data, instead requiring word-level labels, which are expensive to create and consequently not available for many real-life tasks.

In this paper, we propose the Attend, Copy, Parse architecture, a deep neural network model that can be trained directly on end-to-end data, bypassing the need for word-level labels. We evaluate the proposed architecture on a large, diverse set of invoices and outperform a state-of-the-art production system based on word classification.

We believe our proposed architecture can be used on many real-life information extraction tasks where word classification cannot be used due to a lack of the required word-level labels.

Download

Read the customer story

Latest Post

End-to-end information extraction from documents

Discover Raffle Search

More from the Newsroom

How to implement Raffle in any website in minutes

Customer Service Trends in 2023: Enhancing Experiences and Building Brands

Do Customers Care about Self-Service Really?

om | nieuwe energie on Raffle

End-to-end information extraction from documents

More Videos from Raffle

Raffle AI Search for energy companies

What our customers say about us

5 hacks to improve CX in 2024

The Future of AI - Webinar

Other contents from Newsroom

CX stats for 2023

How to implement Raffle in any website in minutes

Can’t Stop Your Customer Support Agents from Quitting? Instant Answers Can Help.

Profil Rejser on Raffle

Platforms

Features

Industries

Company

Resources