Sciweavers

483 search results - page 78 / 97
» Sampling the Web as Training Data for Text Classification
Sort
View
99
Voted
DGO
2008
126views Education» more  DGO 2008»
15 years 4 months ago
Active learning for e-rulemaking: public comment categorization
We address the e-rulemaking problem of reducing the manual labor required to analyze public comment sets. In current and previous work, for example, text categorization techniques...
Stephen Purpura, Claire Cardie, Jesse Simons
113
Voted
ACL
2006
15 years 3 months ago
A FrameNet-Based Semantic Role Labeler for Swedish
We present a FrameNet-based semantic role labeling system for Swedish text. As training data for the system, we used an annotated corpus that we produced by transferring FrameNet ...
Richard Johansson, Pierre Nugues
130
Voted
KDD
2007
ACM
155views Data Mining» more  KDD 2007»
16 years 2 months ago
Mining templates from search result records of search engines
Metasearch engine, Comparison-shopping and Deep Web crawling applications need to extract search result records enwrapped in result pages returned from search engines in response ...
Hongkun Zhao, Weiyi Meng, Clement T. Yu
120
Voted
KDD
2004
ACM
192views Data Mining» more  KDD 2004»
16 years 2 months ago
Mining and summarizing customer reviews
Merchants selling products on the Web often ask their customers to review the products that they have purchased and the associated services. As e-commerce is becoming more and mor...
Minqing Hu, Bing Liu
126
Voted
KDD
2006
ACM
129views Data Mining» more  KDD 2006»
16 years 2 months ago
Suppressing model overfitting in mining concept-drifting data streams
Mining data streams of changing class distributions is important for real-time business decision support. The stream classifier must evolve to reflect the current class distributi...
Haixun Wang, Jian Yin, Jian Pei, Philip S. Yu, Jef...