Automatic Extraction of Quotes and Topics from News Feeds

1 Fevereiro 2009
L. Sarmento and S. Nunes
4th Doctoral Symposium on Informatics Engineering (DSIE’09), February 2009, Porto, Portugal

The explosive growth in information production poses increasing challenges to consumers, confronted with problems often described as “information overflow”. We present verbatim, a software system that can be used as a personal information butler to help structure and filter information. We address a small part of the information landscape, namely quotes extraction from portuguese news. This problem includes several challenges, specifically in the areas of information extraction and topic distillation. We present a full description of the problems and our adopted approach. verbatim is available online at http://irlab.fe.up.pt/p/verbatim.

