Sciweavers

368 search results - page 42 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
KDD
2007
ACM
192views Data Mining» more  KDD 2007»
14 years 7 months ago
Active exploration for learning rankings from clickthrough data
We address the task of learning rankings of documents from search engine logs of user behavior. Previous work on this problem has relied on passively collected clickthrough data. ...
Filip Radlinski, Thorsten Joachims
KDD
2007
ACM
186views Data Mining» more  KDD 2007»
14 years 7 months ago
Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus
We present a document routing and index partitioning scheme for scalable similarity-based search of documents in a large corpus. We consider the case when similarity-based search ...
Deepavali Bhagwat, Kave Eshghi, Pankaj Mehra
PAKDD
2010
ACM
167views Data Mining» more  PAKDD 2010»
13 years 10 months ago
Resource-Bounded Information Extraction: Acquiring Missing Feature Values on Demand
We present a general framework for the task of extracting specific information “on demand” from a large corpus such as the Web under resource-constraints. Given a database wit...
Pallika Kanani, Andrew McCallum, Shaohan Hu
AAAI
2010
13 years 8 months ago
Towards an Intelligent Code Search Engine
Software developers increasingly rely on information from the Web, such as documents or code examples on Application Programming Interfaces (APIs), to facilitate their development...
Jinhan Kim, Sanghoon Lee, Seung-won Hwang, Sunghun...
WWW
2006
ACM
14 years 8 months ago
POLYPHONET: an advanced social network extraction system from the web
Social networks play important roles in the Semantic Web: knowledge management, information retrieval, ubiquitous computing, and so on. We propose a social network extraction syst...
Hideaki Takeda, Junichiro Mori, Kôiti Hasida...