Sciweavers

898 search results - page 65 / 180
» Making Documents Work: Challenges for Document Understanding
Sort
View
112
Voted
GRC
2010
IEEE
15 years 4 months ago
Searching Digital Political Cartoons
The study of cartoons, manga, and graphic novels is of growing importance to humanity scholars. Managing cartoons for scholarly use presents two challenges: searching and understan...
Yejun Wu
SIGIR
2004
ACM
15 years 9 months ago
On scaling latent semantic indexing for large peer-to-peer systems
The exponential growth of data demands scalable infrastructures capable of indexing and searching rich content such as text, music, and images. A promising direction is to combine...
Chunqiang Tang, Sandhya Dwarkadas, Zhichen Xu
CIKM
2010
Springer
15 years 2 months ago
Decomposing background topics from keywords by principal component pursuit
Low-dimensional topic models have been proven very useful for modeling a large corpus of documents that share a relatively small number of topics. Dimensionality reduction tools s...
Kerui Min, Zhengdong Zhang, John Wright, Yi Ma
ICDAR
2005
IEEE
15 years 9 months ago
UPX: A New XML Representation for Annotated Datasets of Online Handwriting Data
This paper introduces our efforts to create UPX, an XML-based successor to the venerable UNIPEN format for the representation of annotated datasets of online handwriting data. In ...
Mudit Agrawal, Kalika Bali, Sriganesh Madhvanath, ...
ICPR
2008
IEEE
16 years 5 months ago
Background variability modeling for statistical layout analysis
Geometric layout analysis plays an important role in document image understanding. Many algorithms known in literature work well on standard document images, achieving high text l...
Faisal Shafait, Joost van Beusekom, Daniel Keysers...