Overview | Project Details | Team | Awards |
Developing full text research
During the development of this first version of the archive arbeiter-zeitung.at various ways of optical character recognition (OCR) and more importantly different research and presentation solutions for a full-text newspaper archive have been created and evaluated. The full-text issue presented below shows what a highly developed research and presentation-tool can provide. (As full-text is the subject in this test, it is stripped of other standard navigation tools as page turning, zoom or similar.) The example shows June, 22nd 1978 - the day after Austria's historic victory over Germany during the Soccer World Cup in Argentine. As the AZ is in German the following names or words might be useful to test the efficiency of the full-text-research: Krankl (striker in Austria's soccer team), Atom, Carter;Or view a particular page:
Try the full text search:
Another difficulty for OCR in connection with newspapers is their complex layout. This makes an automated separation of single articles challenging if not impossible. One the other hand, to forego the separation of articles will result in lesser quality and usability of the research. Either way, the process of OCR is costly and time consuming. Not only in the interest of arbeiter-zeitung.at Kaltenbrunner Medienberatung and scharf_net will keep on researching and developing creative solutions for highly efficient and user-friendly full-text research in digitised newspapers and their presentation online.