Sciweavers

489 search results - page 27 / 98
» Classifying the Hungarian Web
Sort
View
ECAI
2004
Springer
14 years 1 months ago
Stacked Generalization for Information Extraction
1 This paper defines a new stacked generalization framework in the context of information extraction (IE) from online sources. The proposed setting removes the constraint of apply...
Georgios Sigletos, Georgios Paliouras, Constantine...
CIKM
2009
Springer
13 years 11 months ago
Ensembles in adversarial classification for spam
The standard method for combating spam, either in email or on the web, is to train a classifier on manually labeled instances. As the spammers change their tactics, the performanc...
Deepak Chinavle, Pranam Kolari, Tim Oates, Tim Fin...
CLEF
2010
Springer
13 years 8 months ago
It Was Easy, when Apples and Blackberries Were only Fruits
Ambiguities in company names are omnipresent. This is not accidental, companies deliberately chose ambiguous brand names, as part of their marketing and branding strategy. This pro...
Surender Reddy Yerva, Zoltán Miklós,...
WEBI
2004
Springer
14 years 1 months ago
Co-training with a Single Natural Feature Set Applied to Email Classification
When dealing with information overload from the Internet, such as the classification of Web pages and the filtering of email spam, a new technique called cotraining has been shown...
Jason Chan, Irena Koprinska, Josiah Poon
ER
1999
Springer
152views Database» more  ER 1999»
13 years 12 months ago
Semantically Accessing Documents Using Conceptual Model Descriptions
. When publishing documents on the web, the user needs to describe and classify her documents for the benefit of later retrieval and use. This paper presents an approach to semanti...
Terje Brasethvik, Jon Atle Gulla