Sciweavers

SDM
2010
SIAM
144views Data Mining» more  SDM 2010»
13 years 10 months ago
A Probabilistic Framework to Learn from Multiple Annotators with Time-Varying Accuracy
This paper addresses the challenging problem of learning from multiple annotators whose labeling accuracy (reliability) differs and varies over time. We propose a framework based ...
Pinar Donmez, Jaime G. Carbonell, Jeff Schneider
SDM
2010
SIAM
191views Data Mining» more  SDM 2010»
13 years 10 months ago
Active Ordering of Interactive Prediction Tasks
Many applications involve a set of prediction tasks that must be accomplished sequentially through user interaction. If the tasks are interdependent, the order in which they are p...
Abhimanyu Lad, Yiming Yang
SDM
2010
SIAM
153views Data Mining» more  SDM 2010»
13 years 10 months ago
Reconstruction from Randomized Graph via Low Rank Approximation
The privacy concerns associated with data analysis over social networks have spurred recent research on privacypreserving social network analysis, particularly on privacypreservin...
Leting Wu, Xiaowei Ying, Xintao Wu
SDM
2010
SIAM
130views Data Mining» more  SDM 2010»
13 years 10 months ago
Learning Compressible Models
Yi Zhang 0010, Jeff Schneider, Artur Dubrawski
SDM
2010
SIAM
146views Data Mining» more  SDM 2010»
13 years 10 months ago
Towards Finding Valuable Topics
Enterprises depend on their information workers finding valuable information to be productive. However, existing enterprise search and recommendation systems can exploit few studi...
Zhen Wen, Ching-Yung Lin
SDM
2010
SIAM
184views Data Mining» more  SDM 2010»
13 years 10 months ago
A Robust Decision Tree Algorithm for Imbalanced Data Sets
We propose a new decision tree algorithm, Class Confidence Proportion Decision Tree (CCPDT), which is robust and insensitive to class distribution and generates rules which are st...
Wei Liu, Sanjay Chawla, David A. Cieslak, Nitesh V...
EDM
2008
113views Data Mining» more  EDM 2008»
13 years 10 months ago
Mining and Visualizing Visited Trails in Web-Based Educational Systems
A data mining and visualization tool for the discovery of student trails in web-based educational systems is presented and described. The tool uses graphs to visualize results, all...
Cristóbal Romero, Sergio Gutiérrez S...
EDM
2008
97views Data Mining» more  EDM 2008»
13 years 10 months ago
Using Item-type Performance Covariance to Improve the Skill Model of an Existing Tutor
Using data from an existing pre-algebra computer-based tutor, we analyzed the covariance of item-types with the goal of describing a more effective way to assign skill labels to it...
Philip I. Pavlik, Hao Cen, Lili Wu, Kenneth R. Koe...
EDM
2008
96views Data Mining» more  EDM 2008»
13 years 10 months ago
Labeling Student Behavior Faster and More Precisely with Text Replays
We present text replays, a method for generating labels that can be used to train classifiers of student behavior. We use this method to label data as to whether students are gamin...
Ryan Shaun Joazeiro de Baker, Adriana M. J. B. de ...
EDM
2008
141views Data Mining» more  EDM 2008»
13 years 10 months ago
An Open Repository and analysis tools for fine-grained, longitudinal learner data
We introduce an open data repository and set of associated visualization and analysis tools. The Pittsburgh Science of Learning Center's "DataShop" has data from tho...
Kenneth R. Koedinger, Kyle Cunningham, Alida Skogs...