Sciweavers

1006 search results - page 173 / 202
» A Case Study for Learning from Imbalanced Data Sets
Sort
View
EDBT
2010
ACM
185views Database» more  EDBT 2010»
14 years 3 months ago
Optimizing joins in a map-reduce environment
Implementations of map-reduce are being used to perform many operations on very large data. We examine strategies for joining several relations in the map-reduce environment. Our ...
Foto N. Afrati, Jeffrey D. Ullman
ICDM
2007
IEEE
148views Data Mining» more  ICDM 2007»
14 years 25 days ago
Sample Selection for Maximal Diversity
The problem of selecting a sample subset sufficient to preserve diversity arises in many applications. One example is in the design of recombinant inbred lines (RIL) for genetic a...
Feng Pan, Adam Roberts, Leonard McMillan, David Th...
DSN
2008
IEEE
13 years 10 months ago
Convicting exploitable software vulnerabilities: An efficient input provenance based approach
Software vulnerabilities are the root cause of a wide range of attacks. Existing vulnerability scanning tools are able to produce a set of suspects. However, they often suffer fro...
Zhiqiang Lin, Xiangyu Zhang, Dongyan Xu
ERCIMDL
2010
Springer
135views Education» more  ERCIMDL 2010»
13 years 10 months ago
Automating Logical Preservation for Small Institutions with Hoppla
Preserving digital information over the long term becomes increasing important for large number of institutions. The required expertise and limited tool support discourage especial...
Stephan Strodl, Petar Petrov, Michael Greifeneder,...
SIGCSE
2003
ACM
137views Education» more  SIGCSE 2003»
14 years 2 months ago
Measuring the effectiveness of robots in teaching computer science
We report the results of a year-long experiment in the use of robots to teach computer science. Our data set compares results from over 800 students on identical tests from both r...
Barry S. Fagin, Laurence D. Merkle