Sciweavers

2337 search results - page 42 / 468
» Extracting Sequences from the Web
Sort
View
LREC
2010
216views Education» more  LREC 2010»
13 years 10 months ago
BlogBuster: A Tool for Extracting Corpora from the Blogosphere
This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, ...
Georgios Petasis, Dimitrios Petasis
DKE
2006
122views more  DKE 2006»
13 years 9 months ago
Sampling, information extraction and summarisation of Hidden Web databases
Hidden Web databases maintain a collection of specialised documents, which are dynamically generated in response to users' queries. The majority of these documents are genera...
Yih-Ling Hedley, Muhammad Younas, Anne E. James, M...
EJC
2007
13 years 10 months ago
A Personal Web Information/Knowledge Retrieval System
The Web is the richest source of information and knowledge. Unfortunately the current structure of Web pages makes it difficult for users to retrieve the information or knowledge ...
Hao Han, Takehiro Tokuda
CIKM
2010
Springer
13 years 7 months ago
Automatic metadata extraction from multilingual enterprise content
Enterprises provide professionally authored content about their products/services in different languages for use in web sites and customer care. For customer care, personalization...
Melike Sah, Vincent Wade
CIKM
2005
Springer
14 years 2 months ago
WAM-Miner: in the search of web access motifs from historical web log data
Existing web usage mining techniques focus only on discovering knowledge based on the statistical measures obtained from the static characteristics of web usage data. They do not ...
Qiankun Zhao, Sourav S. Bhowmick, Le Gruenwald