Full Content is available to subscribers

Subscribe/Learn More  >
Proceedings Article

Large scale parallel document image processing

[+] Author Affiliations
Tijn van der Zant, Lambert Schomaker, Edwin Valentijn

Univ. of Groningen (Netherlands)

Proc. SPIE 6815, Document Recognition and Retrieval XV, 68150S (January 28, 2008); doi:10.1117/12.765482
Text Size: A A A
From Conference Volume 6815

  • Document Recognition and Retrieval XV
  • Berrin A. Yanikoglu; Kathrin Berkner
  • San Jose, CA | January 27, 2008


Building a system which allows to search a very large database of document images requires professionalization of hardware and software, e-science and web access. In astrophysics there is ample experience dealing with large data sets due to an increasing number of measurement instruments. The problem of digitization of historical documents of the Dutch cultural heritage is a similar problem. This paper discusses the use of a system developed at the Kapteyn Institute of Astrophysics for the processing of large data sets, applied to the problem of creating a very large searchable archive of connected cursive handwritten texts. The system is adapted to the specific needs of processing document images. It shows that interdisciplinary collaboration can be beneficial in the context of machine learning, data processing and professionalization of image processing and retrieval systems.

© (2008) COPYRIGHT SPIE--The International Society for Optical Engineering. Downloading of the abstract is permitted for personal use only.

Tijn van der Zant ; Lambert Schomaker and Edwin Valentijn
"Large scale parallel document image processing", Proc. SPIE 6815, Document Recognition and Retrieval XV, 68150S (January 28, 2008); doi:10.1117/12.765482; http://dx.doi.org/10.1117/12.765482

Access This Article
Sign In to Access Full Content
Please Wait... Processing your request... Please Wait.
Sign in or Create a personal account to Buy this article ($15 for members, $18 for non-members).



Citing articles are presented as examples only. In non-demo SCM6 implementation, integration with CrossRef’s "Cited By" API will populate this tab (http://www.crossref.org/citedby.html).

Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging & repositioning the boxes below.

Related Book Chapters

Topic Collections


Buy this article ($18 for members, $25 for non-members).
Sign In