Recent advances in linear classification have shown that for applications such as document classification, the training can be extremely efficient. However, most of the existing t...
We introduce an open data repository and set of associated visualization and analysis tools. The Pittsburgh Science of Learning Center's "DataShop" has data from tho...
Kenneth R. Koedinger, Kyle Cunningham, Alida Skogs...
Identifying co-varying causal elements in very high dimensional feature space with internal structures, e.g., a space with as many as millions of linearly ordered features, as one...
This paper presents a new method for building domain-specific web search engines. Previous methods eliminate irrelevant documents from the pages accessed using heuristics based on...
Improving data quality is a time-consuming, labor-intensive and often domain specific operation. Existing data repair approaches are either fully automated or not efficient in int...
Mohamed Yakout, Ahmed K. Elmagarmid, Jennifer Nevi...