Sciweavers

1319 search results - page 23 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
RIAO
2007
13 years 8 months ago
Using a Content-and-Structure Oriented Method for Relevance Feedback in XML Retrieval
As opposed to traditional Information Retrieval (IR) which views whole documents as atomic units of retrieval, XML IR processes XML elements as possible units of retrieval. Many o...
Lobna Hlaoua, Mohand Boughanem, Karen Pinel-Sauvag...
IPPS
2008
IEEE
14 years 1 months ago
Multi-threaded data mining of EDGAR CIKs (Central Index Keys) from ticker symbols
This paper describes how use the Java Swing HTMLEditorKit to perform multi-threaded web data mining on the EDGAR system (Electronic DataGathering, Analysis, and Retrieval system)....
Dougal A. Lyon
SIGIR
2004
ACM
14 years 25 days ago
Corpus structure, language models, and ad hoc information retrieval
Most previous work on the recently developed languagemodeling approach to information retrieval focuses on document-specific characteristics, and therefore does not take into acc...
Oren Kurland, Lillian Lee
SGAI
2004
Springer
14 years 23 days ago
Neighbourhood Exploitation in Hypertext Categorization
As the web expands exponentially, the need to put some order to its content becomes apparent. Hypertext categorization, that is the automatic classification of web documents into ...
Houda Benbrahim, Max Bramer
IDMS
1998
Springer
76views Multimedia» more  IDMS 1998»
13 years 11 months ago
Exploiting User Behaviour in Prefetching WWW Documents
As the popularity of the World Wide Web increases, the amount of traffic results in major congestion problems for the retrieval of data over wide distances. To react to this, user...
Abdulmotaleb El-Saddik, Carsten Griwodz, Ralf Stei...