Subtopic Structuring for Full-Length Document Access

15 years 10 months ago

Download reference.kfupm.edu.sa

We argue that the advent of large volumes of full-length text, as opposed to short texts tracts and newswire, should be accompanied by corresponding new approaches to information access. Toward this end, we discuss the merits of imposing structure on fulllength text documents; that is, a partition of the text into coherent multi-paragraph units that represent the pattern of subtopics that comprise the text. Using this structure, we can make a distinction between the main topics, which occur throughout the length of the text, and the subtopics, which are of only limited extent. We discuss why recognition of subtopic structure is important and how, to some degree of accuracy, it can be found. We describe a new way of specifying queries on full-length documents and then describe an experiment in which making use of the recognition of local structure achieves better results on a typical information retrieval task than does a standard IR measure.

Marti A. Hearst, Christian Plaunt

Real-time Traffic

Full-length Text | Fulllength Text Documents | Information Management | Short Texts Tracts | SIGIR 1993 |

claim paper

Post Info
More Details (n/a)

Added	10 Aug 2010
Updated	10 Aug 2010
Type	Conference
Year	1993
Where	SIGIR
Authors	Marti A. Hearst, Christian Plaunt

Comments (0)

Sciweavers

Subtopic Structuring for Full-Length Document Access

Full-length Text | Fulllength Text Documents | Information Management | Short Texts Tracts | SIGIR 1993 |

Explore & Download

Productivity Tools

Sciweavers