Sciweavers

1260 search results - page 222 / 252
» Web Mining
Sort
View
DIS
2001
Springer
14 years 2 months ago
Eliminating Useless Parts in Semi-structured Documents Using Alternation Counts
We propose a preprocessing method for Web mining which, given semi-structured documents with the same structure and style, distinguishes useless parts and non-useless parts in each...
Daisuke Ikeda, Yasuhiro Yamada, Sachio Hirokawa
EMNLP
2008
13 years 11 months ago
One-Class Clustering in the Text Domain
Having seen a news title "Alba denies wedding reports", how do we infer that it is primarily about Jessica Alba, rather than about weddings or reports? We probably reali...
Ron Bekkerman, Koby Crammer
WCE
2007
13 years 11 months ago
LinkGuide: Towards a Better Collection of Hyperlinks in a Website Homepage
—A dramatic and continuous increase in the complexity and size of websites on the Internet makes rather difficult to build such websites with required information to be easily fo...
Ahmad Ammari, Valentina V. Zharkova
WWW
2011
ACM
13 years 5 months ago
From actors, politicians, to CEOs: domain adaptation of relational extractors using a latent relational mapping
We propose a method to adapt an existing relation extraction system to extract new relation types with minimum supervision. Our proposed method comprises two stages: learning a lo...
Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuk...
EMNLP
2011
12 years 9 months ago
Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation
We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...
Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...