Search Sciweavers | Sciweavers

542 search results - page 45 / 109

» Learning author-topic models from text corpora

click to vote

DAS
2010
Springer

251views Document Analysis» more DAS 2010»

Overlapped text segmentation using Markov random field and aggregation

13 years 9 months ago

Download www.visionopen.com

Separating machine printed text and handwriting from overlapping text is a challenging problem in the document analysis field and no reliable algorithms have been developed thus f...

Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, ...

claim paper

Read More »

click to vote

ACL
2009

167views Computational Linguistics» more ACL 2009»

Mining Bilingual Data from the Web with Adaptively Learnt Patterns

13 years 5 months ago

Download www.aclweb.org

Mining bilingual data (including bilingual sentences and terms1 ) from the Web can benefit many NLP applications, such as machine translation and cross language information retrie...

Long Jiang, Shiquan Yang, Ming Zhou, Xiaohua Liu, ...

claim paper

Read More »

click to vote

AAAI
2010

198views Intelligent Agents» more AAAI 2010»

A Topic Model for Linked Documents and Update Rules for its Estimation

13 years 5 months ago

Download www.nec-labs.com

The latent topic model plays an important role in the unsupervised learning from a corpus, which provides a probabilistic interpretation of the corpus in terms of the latent topic...

Zhen Guo, Shenghuo Zhu, Zhongfei Zhang, Yun Chi, Y...

claim paper

Read More »

click to vote

LREC
2008

160views Education» more LREC 2008»

Automatic Extraction of Textual Elements from News Web Pages

13 years 9 months ago

Download www.lrec-conf.org

In this paper we present an algorithm for automatic extraction of textual elements, namely titles and full text, associated with news stories in news web pages. We propose a super...

Hossam Ibrahim, Kareem Darwish, Abdel-Rahim Madany

claim paper

Read More »

click to vote

SIGMOD
2008
ACM

123views Database» more SIGMOD 2008»

SchemaScope: a system for inferring and cleaning XML schemas

14 years 7 months ago

Download alpha.uhasselt.be

We present SchemaScope, a system to derive Document Type Definitions and XML Schemas from corpora of sample XML documents. Tools are provided to visualize, clean, and refine exist...

Geert Jan Bex, Frank Neven, Stijn Vansummeren

claim paper

Read More »

« Prev « First page 45 / 109 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers