Sciweavers

2677 search results - page 114 / 536
» Extracting Structured Data from Web Pages
Sort
View
PAKDD
2009
ACM
116views Data Mining» more  PAKDD 2009»
14 years 4 months ago
Scalable Web Mining with Newistic
Abstract. Newistic is a web mining platform that collects and analyses documents crawled from the Internet. Although it currently processes news articles, it can be easily adapted ...
Ovidiu Dan, Horatiu Mocian
ACL
2006
13 years 11 months ago
URES : an Unsupervised Web Relation Extraction System
Most information extraction systems either use hand written extraction patterns or use a machine learning algorithm that is trained on a manually annotated corpus. Both of these a...
Binyamin Rosenfeld, Ronen Feldman
WWW
2006
ACM
14 years 10 months ago
Detecting semantic cloaking on the web
By supplying different versions of a web page to search engines and to browsers, a content provider attempts to cloak the real content from the view of the search engine. Semantic...
Baoning Wu, Brian D. Davison
BMCBI
2010
154views more  BMCBI 2010»
13 years 10 months ago
EnvMine: A text-mining system for the automatic extraction of contextual information
Background: For ecological studies, it is crucial to count on adequate descriptions of the environments and samples being studied. Such a description must be done in terms of thei...
Javier Tamames, Victor de Lorenzo
ESWS
2008
Springer
13 years 11 months ago
Wikipedia Link Structure and Text Mining for Semantic Relation Extraction
Abstract. Wikipedia, a collaborative Wiki-based encyclopedia, has become a huge phenomenon among Internet users. It covers huge number of concepts of various fields such as Arts, G...
Kotaro Nakayama, Takahiro Hara, Shojiro Nishio