Sciweavers

8479 search results - page 65 / 1696
» Data Extraction from Web Data Sources
Sort
View
ITCC
2005
IEEE
14 years 4 months ago
Elimination of Redundant Information for Web Data Mining
These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...
Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang
ICDM
2008
IEEE
186views Data Mining» more  ICDM 2008»
14 years 5 months ago
xCrawl: A High-Recall Crawling Method for Web Mining
Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The first step in the Information Extract...
Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...
IQ
1996
14 years 12 days ago
Estimating the Quality of Data in Relational Databases
With more and more electronic information sources becoming widely available, the issue of the quality of these, often-competing, sources has become germane. We propose a standard ...
Amihai Motro, Igor Rakov
RWEB
2005
Springer
14 years 4 months ago
Evolution and Reactivity for the Web
Abstract. The Web and the Semantic Web, as we see it, can be understood as a “living organism” combining autonomously evolving data sources, each of them possibly reacting to e...
José Júlio Alferes, Wolfgang May
DIS
2001
Springer
14 years 3 months ago
Dynamic Aggregation to Support Pattern Discovery: A Case Study with Web Logs
Rapid growth of digital data collections is overwhelming the capabilities of humans to comprehend them without aid. The extraction of useful data from large raw data sets is someth...
Lida Tang, Ben Shneiderman