Sciweavers

JCDL
2004
ACM

Machine learning for information architecture in a large governmental website

14 years 4 months ago
Machine learning for information architecture in a large governmental website
This paper describes ongoing research into the application of machine learning techniques for improving access to governmental information in complex digital libraries. Under the auspices of the GovStat Project (http://www.ils.unc.edu/govstat), our goal is to identify a small number of semantically valid concepts that adequately spans the intellectual domain of a collection. The goal of this discovery is twofold. First we desire a principled aid to information architects. Second, automatically derived document-concept relationships are a necessary precondition for real-world deployment of many dynamic interfaces. The current study compares concept learning strategies based on three document representations: keywords, titles, and full-text. In statistical and user-based studies, human-created keywords provide significant improvements in concept learning over both title-only and full-text representations.
Miles Efron, Jonathan L. Elsas, Gary Marchionini,
Added 30 Jun 2010
Updated 30 Jun 2010
Type Conference
Year 2004
Where JCDL
Authors Miles Efron, Jonathan L. Elsas, Gary Marchionini, Junliang Zhang
Comments (0)