Sciweavers

151 search results - page 10 / 31
» Probabilistic Text Structuring: Experiments with Sentence Or...
Sort
View
SIGMOD
2001
ACM
145views Database» more  SIGMOD 2001»
14 years 7 months ago
Automatic Segmentation of Text into Structured Records
In this paper we present a method for automatically segmenting unformatted text records into structured elements. Several useful data sources today are human-generated as continuo...
Vinayak R. Borkar, Kaustubh Deshmukh, Sunita Saraw...
ANLP
2000
124views more  ANLP 2000»
13 years 9 months ago
A Divide-and-Conquer Strategy for Shallow Parsing of German Free Texts
We present a divide-and-conquer strategy based on finite state technology for shallow parsing of realworld German texts. In a first phase only the topological structure of a sente...
Günter Neumann, Christian Braun, Jakub Piskor...
ICDM
2006
IEEE
109views Data Mining» more  ICDM 2006»
14 years 1 months ago
Star-Structured High-Order Heterogeneous Data Co-clustering Based on Consistent Information Theory
Heterogeneous object co-clustering has become an important research topic in data mining. In early years of this research, people mainly worked on two types of heterogeneous data ...
Bin Gao, Tie-Yan Liu, Wei-Ying Ma
SDM
2009
SIAM
235views Data Mining» more  SDM 2009»
14 years 4 months ago
Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases.
As the amount of textual information grows explosively in various kinds of business systems, it becomes more and more desirable to analyze both structured data records and unstruc...
ChengXiang Zhai, Duo Zhang, Jiawei Han
FLAIRS
2008
13 years 10 months ago
Learning a Probabilistic Model of Event Sequences from Internet Weblog Stories
One of the central problems in building broad-coverage story understanding systems is generating expectations about event sequences, i.e. predicting what happens next given some a...
Mehdi Manshadi, Reid Swanson, Andrew S. Gordon