Search Sciweavers | Sciweavers

166

DIS
2001
Springer

93views Theoretical Computer Science» more DIS 2001»

Eliminating Useless Parts in Semi-structured Documents Using Alternation Counts

15 years 11 months ago

We propose a preprocessing method for Web mining which, given semi-structured documents with the same structure and style, distinguishes useless parts and non-useless parts in each...

Daisuke Ikeda, Yasuhiro Yamada, Sachio Hirokawa

claim paper

Read More »

148

Voted

EMNLP
2008

105views Natural Language Processing» more EMNLP 2008»

One-Class Clustering in the Text Domain

15 years 8 months ago

Download www.cs.umass.edu

Having seen a news title "Alba denies wedding reports", how do we infer that it is primarily about Jessica Alba, rather than about weddings or reports? We probably reali...

Ron Bekkerman, Koby Crammer

claim paper

Read More »

184

click to vote

WCE
2007

118views Electrical And Computer Engi...» more WCE 2007»

LinkGuide: Towards a Better Collection of Hyperlinks in a Website Homepage

15 years 8 months ago

Download www.iaeng.org

—A dramatic and continuous increase in the complexity and size of websites on the Internet makes rather difficult to build such websites with required information to be easily fo...

Ahmad Ammari, Valentina V. Zharkova

claim paper

Read More »

183

click to vote

WWW
2011
ACM

236views Internet Technology» more WWW 2011»

From actors, politicians, to CEOs: domain adaptation of relational extractors using a latent relational mapping

15 years 1 months ago

Download www.www2011india.com

We propose a method to adapt an existing relation extraction system to extract new relation types with minimum supervision. Our proposed method comprises two stages: learning a lo...

Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuk...

claim paper

Read More »

240

click to vote

EMNLP
2011

164views Natural Language Processing» more EMNLP 2011»

Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation

14 years 6 months ago

Download cs.jhu.edu

We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...

Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers