ECAI 2004 Conference Paper

[PDF] [full paper] [prev] [tofc] [next]

Web Information Extraction: a domain, user adaptive and multilingual approach

Vangelis Karkaletsis, Constantine D. Spyropoulos

For PAIS 2004 This paper describes an advanced prototype system for web information retrieval and extraction adaptable to different domains, languages and users’ interests. This system has been developed in the context of a R&D project involving both academic and industrial organisations. Two different applications were released at the project’s site in four different languages. The system’s architecture is open, modular and multi-agent integrating components for collecting domain-specific web pages using crawling and spidering technologies, for extracting information from the collected web pages using natural language processing and machine learning techniques, and for presenting the extracted information according to users’ interests employing user modelling techniques. A customisation infrastructure is also provided involving an ontology management system and various customisation tools.

Keywords: information retrieval, information extraction, user modelling, machine learning, multilinguality

Citation: Vangelis Karkaletsis, Constantine D. Spyropoulos: Web Information Extraction: a domain, user adaptive and multilingual approach. In R.López de Mántaras and L.Saitta (eds.): ECAI2004, Proceedings of the 16th European Conference on Artificial Intelligence, IOS Press, Amsterdam, 2004, pp.725-729.

[prev] [tofc] [next]

ECAI-2004 is organised by the European Coordinating Committee for Artificial Intelligence (ECCAI) and hosted by the Universitat Politècnica de València on behalf of Asociación Española de Inteligencia Artificial (AEPIA) and Associació Catalana d'Intel-ligència Artificial (ACIA).