Sciweavers

3893 search results - page 346 / 779
» Graded-Inclusion-Based Information Retrieval Systems
Sort
View
SIGIR
2010
ACM
14 years 10 months ago
Efficient partial-duplicate detection based on sequence matching
With the ever-increasing growth of the Internet, numerous copies of documents become serious problem for search engine, opinion mining and many other web applications. Since parti...
Qi Zhang, Yue Zhang, Haomin Yu, Xuanjing Huang
WEBI
2004
Springer
15 years 9 months ago
Semi-Structured Complex List Extraction
The semi-structured information available in HTML and similar documents provide valuable information that can be used for information extraction applications. This information tog...
Anders Arpteg
BTW
2005
Springer
125views Database» more  BTW 2005»
15 years 9 months ago
Web Data Extraction for Business Intelligence: The Lixto Approach
: Knowledge about market developments and competitor activities on the market becomes more and more a critical success factor for enterprises. The World Wide Web provides public do...
Georg Gottlob
CIKM
2008
Springer
15 years 6 months ago
Using tag semantic network for keyphrase extraction in blogs
Folksonomies provide a comfortable way to search and browse the blogosphere. As the tags in the blogosphere are sparse, ambiguous and too general, this paper proposes both a super...
Lizhen Qu, Christof Müller, Iryna Gurevych
BMCBI
2006
127views more  BMCBI 2006»
15 years 4 months ago
Exploring supervised and unsupervised methods to detect topics in biomedical text
Background: Topic detection is a task that automatically identifies topics (e.g., "biochemistry" and "protein structure") in scientific articles based on infor...
Minsuk Lee, Weiqing Wang, Hong Yu