Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation

15 years 8 months ago

Download www.lrec-conf.org

Linguistic Data Consortium (LDC) at the University of Pennsylvania has participated as a data provider in a variety of governmentsponsored programs that support development of Human Language Technologies. As the number of projects increases, the quantity and variety of the data LDC produces have increased dramatically in recent years. In this paper, we describe the technical infrastructure, both hardware and software, that LDC has built to support these complex, large-scale linguistic data creation efforts at LDC. As it would not be possible to cover all aspects of LDC's technical infrastructure in one paper, this paper focuses on recent development. We also report on our plans for making our custom-built software resources available to the community as open source software, and introduce an initiative to collaborate with software developers outside LDC. We hope that our approaches and software resources will be useful to the community members who take on similar challenges.

Kazuaki Maeda, Haejoong Lee, Stephen Grimes, Jonat

Real-time Traffic

Data Ldc | Education | LDC's Technical Infrastructure | Linguistic Data | LREC 2010 |

claim paper

» Shared Linguistic Resources for the Meeting Domain

» Linguistic Resources and Evaluation Techniques for Evaluation of CrossDocument Automatic C...

» The Text Encoding Initiative Anno 2005 An Orientation and Workshop

» The Role of Social Capital in the Creation of Community Wireless Networks

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Kazuaki Maeda, Haejoong Lee, Stephen Grimes, Jonathan Wright, Robert Parker, David Lee, Andrea Mazzucchi

Comments (0)

Sciweavers

Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation

Data Ldc | Education | LDC's Technical Infrastructure | Linguistic Data | LREC 2010 |

Explore & Download

Productivity Tools

Sciweavers