Sciweavers

1969 search results - page 360 / 394
» Fuzzy sets in machine learning and data mining
Sort
View
ESWA
2006
110views more  ESWA 2006»
15 years 4 months ago
XKey: A tool for the generation of identification keys
This paper presents the development of XKey, a tool for generating taxonomical identification keys by means of decision tree construction. The tool is based on an XML standard for...
Miguel Delgado Calvo-Flores, Waldo Fajardo Contrer...
SIGIR
2008
ACM
15 years 3 months ago
Compressed collections for simulated crawling
Collections are a fundamental tool for reproducible evaluation of information retrieval techniques. We describe a new method for distributing the document lengths and term counts ...
Alessio Orlandi, Sebastiano Vigna
JMLR
2002
102views more  JMLR 2002»
15 years 3 months ago
Efficient Algorithms for Decision Tree Cross-validation
Cross-validation is a useful and generally applicable technique often employed in machine learning, including decision tree induction. An important disadvantage of straightforward...
Hendrik Blockeel, Jan Struyf
CORR
2011
Springer
185views Education» more  CORR 2011»
14 years 11 months ago
Large-Scale Collective Entity Matching
There have been several recent advancements in Machine Learning community on the Entity Matching (EM) problem. However, their lack of scalability has prevented them from being app...
Vibhor Rastogi, Nilesh N. Dalvi, Minos N. Garofala...
FUIN
2010
268views more  FUIN 2010»
14 years 11 months ago
Boruta - A System for Feature Selection
Machine learning methods are often used to classify objects described by hundreds of attributes; in many applications of this kind a great fraction of attributes may be totally irr...
Miron B. Kursa, Aleksander Jankowski, Witold R. Ru...