Extracting information from deeds by Optical Character Recognition (OCR) and text interpretation

Wouters, Rik et al.

FIG Congress 2010 Facing the Challenges - Building the Capacity, Sydney, Australia, 11-16 April 2010.

Since a long time the Netherlandss Cadastre, Land Registry and Mapping Agency (in short Kadaster) delivers digital information to customers. Already in the early 1990s information to public notaries was disseminated through IBM-global network. In 2002 Kadaster opened the internet shop KOL (Kadaster-on-line), through which legal ownership information on real estate is provided. At present Kadaster is scanning 15 million deeds which up to now were stored on microfilm. The accessibility of the deeds had its limitations, especially now Kadaster has become an organisation were all processes are organised in a centralised way and at a national level. Part of the project deals with the retrieval of information concerning servitudes, easements and the like. By means of text recognition tools this information is extracted from recorded deeds. In addition this information will become available on the internet. This paper describes which procedures and approaches have been used to make a next step in e-services provided by Kadaster. The paper also dedicates attention to special techniques for interpretation, which was applied by the intelligence services in Romania.

Event: XXIV FIG International Congress 2010 Facing the Challenges - Building the Capacity

Only personal, non-commercial use of this document is allowed.

Document type:Extracting information from deeds by Optical Character Recognition (OCR) and text interpretation (253 kB - pdf)