Sciweavers

2 search results - page 1 / 1
» An eRulemaking Corpus: Identifying Substantive Issues in Pub...
Sort
View
LREC
2008
82views Education» more  LREC 2008»
13 years 9 months ago
An eRulemaking Corpus: Identifying Substantive Issues in Public Comments
We describe the creation of a corpus that supports a real-world hierarchical text categorization task in the domain of electronic rulemaking (eRulemaking). Features of the task an...
Claire Cardie, Cynthia Farina, Matt Rawding, Adil ...
DGO
2006
134views Education» more  DGO 2006»
13 years 9 months ago
Next steps in near-duplicate detection for eRulemaking
Large volume public comment campaigns and web portals that encourage the public to customize form letters produce many near-duplicate documents, which increases processing and sto...
Hui Yang, Jamie Callan, Stuart W. Shulman