Sciweavers

332 search results - page 4 / 67
» Document Content Extraction Using Automatically Discovered F...
Sort
View
ISMIR
2004
Springer
149views Music» more  ISMIR 2004»
14 years 25 days ago
Indexing and Retrieval of Music Documents through Pattern Analysis and Data Fusion Techniques
One of the challenges of music information retrieval is the automatic extraction of effective content descriptors of music documents, which can be used at indexing and at retrieva...
Giovanna Neve, Nicola Orio
CIKM
2009
Springer
14 years 2 months ago
The impact of document structure on keyphrase extraction
Keyphrases are short phrases that reflect the main topic of a document. Because manually annotating documents with keyphrases is a time-consuming process, several automatic appro...
Katja Hofmann, Manos Tsagkias, Edgar Meij, Maarten...
SIGIR
2005
ACM
14 years 1 months ago
Title extraction from bodies of HTML documents and its application to web page retrieval
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...
AUSAI
2003
Springer
14 years 21 days ago
Semi-Automatic Construction of Metadata from a Series of Web Documents
Metadata plays an important role in discovering, collecting, extracting and aggregating Web data. This paper proposes a method of constructing metadata for a specific topic. The m...
Sachio Hirokawa, Eisuke Itoh, Tetsuhiro Miyahara
CASCON
2006
150views Education» more  CASCON 2006»
13 years 8 months ago
Exploring a new space of features for document classification: figure clustering
Automatic document classification is an important step in organizing and mining documents. Information in documents is often conveyed using both text and images that complement ea...
Nawei Chen, Hagit Shatkay, Dorothea Blostein