Sciweavers

708 search results - page 81 / 142
» Identifying Content Blocks from Web Documents
Sort
View
SEMWEB
2001
Springer
14 years 2 days ago
Conceptual Open Hypermedia = The Semantic Web?
The Semantic Web is still a web, a collection of linked nodes. Navigation of links is currently, and will remain for humans if not machines, a key mechanism for exploring the spac...
Carole A. Goble, Sean Bechhofer, Les Carr, David D...
ICEIS
2009
IEEE
14 years 2 months ago
Semi-supervised Information Extraction from Variable-length Web-page Lists
We propose two methods for constructing automated programs for extraction of information from a class of web pages that are very common and of high practical significance - varia...
Daniel Nikovski, Alan Esenther, Akihiro Baba
WWW
2009
ACM
14 years 8 months ago
Data quality in web archiving
Web archives preserve the history of Web sites and have high long-term value for media and business analysts. Such archives are maintained by periodically re-crawling entire Web s...
Marc Spaniol, Dimitar Denev, Arturas Mazeika, Gerh...
DEXAW
2010
IEEE
190views Database» more  DEXAW 2010»
13 years 4 months ago
A Comparison of Stylometric and Lexical Features for Web Genre Classification and Emotion Classification in Blogs
In the blogosphere, the amount of digital content is expanding and for search engines, new challenges have been imposed. Due to the changing information need, automatic methods are...
Elisabeth Lex, Andreas Juffinger, Michael Granitze...
WSE
2003
IEEE
14 years 27 days ago
Resolution of Static Clones in Dynamic Web Pages
Cloning is extremely likely to occur in web sites, much more so than in other software. While some clones exist for valid reasons, or are too small to eliminate, cloning percentag...
Nikita Synytskyy, James R. Cordy, Thomas R. Dean