Sciweavers

368 search results - page 40 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
ACL
2012
11 years 9 months ago
A Novel Burst-based Text Representation Model for Scalable Event Detection
Mining retrospective events from text streams has been an important research topic. Classic text representation model (i.e., vector space model) cannot model temporal aspects of d...
Xin Zhao, Rishan Chen, Kai Fan, Hongfei Yan, Xiaom...
KDD
1998
ACM
80views Data Mining» more  KDD 1998»
13 years 11 months ago
Human Performance on Clustering Web Pages: A Preliminary Study
With the increase in information on the World Wide Web it has become difficult to quickly find desired information without using multiple queries or using a topic-specific search ...
Sofus A. Macskassy, Arunava Banerjee, Brian D. Dav...
ICDM
2008
IEEE
186views Data Mining» more  ICDM 2008»
14 years 1 months ago
xCrawl: A High-Recall Crawling Method for Web Mining
Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The first step in the Information Extract...
Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...
ICDIM
2008
IEEE
14 years 1 months ago
Unsupervised key-phrases extraction from scientific papers using domain and linguistic knowledge
The domain of Digital Libraries presents specific challenges for unsupervised information extraction to support both the automatic classification of documents and the enhancement ...
Mikalai Krapivin, Maurizio Marchese, Andrei Yadran...
CIT
2004
Springer
13 years 11 months ago
BioPubMiner: Machine Learning Component-Based Biomedical Information Analysis Platform
Abstract. In this paper we introduce BioPubMiner, a machine learning component-based platform for biomedical information analysis. BioPubMiner employs natural language processing t...
Jae-Hong Eom, Byoung-Tak Zhang