Scalable and Interpretable Data Representation for High-Dimensional, Complex Data

9 years 1 days ago

Download people.csail.mit.edu

The majority of machine learning research has been focused on building models and inference techniques with sound mathematical properties and cutting edge performance. Little attention has been devoted to the development of data representation that can be used to improve a user’s ability to interpret the data and machine learning models to solve real-world problems. In this paper, we quantitatively and qualitatively evaluate an efﬁcient, accurate and scalable feature-compression method using latent Dirichlet allocation for discrete data. This representation can effectively communicate the characteristics of high-dimensional, complex data points. We show that the improvement of a user’s interpretability through the use of a topic modeling-based compression technique is statistically signiﬁcant, according to a number of metrics, when compared with other representations. Also, we ﬁnd that this representation is scalable — it maintains alignment with human classiﬁcation accu...

Been Kim, Kayur Patel, Afshin Rostamizadeh, Julie

Real-time Traffic

AAAI 2015 | Intelligent Agents |

claim paper

Post Info
More Details (n/a)

Added	27 Mar 2016
Updated	27 Mar 2016
Type	Journal
Year	2015
Where	AAAI
Authors	Been Kim, Kayur Patel, Afshin Rostamizadeh, Julie A. Shah

Comments (0)

Sciweavers

Scalable and Interpretable Data Representation for High-Dimensional, Complex Data

AAAI 2015 | Intelligent Agents |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers