Sciweavers

129 search results - page 14 / 26
» Combining content extraction heuristics: the CombinE system
Sort
View
IJHIS
2006
95views more  IJHIS 2006»
13 years 8 months ago
A hybrid system for concept-based web usage mining
A web site should be easy to browse by visitors. However, sometimes the reality is quite different. Situations like several unrelated topics in a single web page may lead to confus...
Sebastián A. Ríos, Juan D. Vel&aacut...
ICDE
2000
IEEE
99views Database» more  ICDE 2000»
14 years 10 months ago
XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources
This paper describes the methodology and the software development of XWRAP, an XML-enabled wrapper construction system for semi-automatic generation of wrapper programs. By XML-ena...
Ling Liu, Calton Pu, Wei Han
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
14 years 6 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
SAINT
2005
IEEE
14 years 2 months ago
Inductive Logic Programming for Structure-Activity Relationship Studies on Large Scale Data
Inductive Logic Programming (ILP) is a combination of inductive learning and first-order logic aiming to learn first-order hypotheses from training examples. ILP has a serious b...
Cholwich Nattee, Sukree Sinthupinyo, Masayuki Numa...
ECML
2007
Springer
14 years 2 months ago
Using Text Mining and Link Analysis for Software Mining
Many data mining techniques are these days in use for ontology learning – text mining, Web mining, graph mining, link analysis, relational data mining, and so on. In the current ...
Miha Grcar, Marko Grobelnik, Dunja Mladenic