Search Sciweavers | Sciweavers

188 search results - page 4 / 38

» The hybrid representation model for web document classificat...

242

click to vote

CIKM
2010
Springer

167views Information Technology» more CIKM 2010»

Using Wikipedia categories for compact representations of chemical documents

15 years 6 months ago

Download www.l3s.de

Today, Web pages are usually accessed using text search engines, whereas documents stored in the deep Web are accessed through domain-specific Web portals. These portals rely on e...

Benjamin Köhncke, Wolf-Tilo Balke

claim paper

Read More »

225

click to vote

SIGIR
2004
ACM

166views Information Technology» more SIGIR 2004»

Locality preserving indexing for document representation

16 years 29 days ago

Download research.microsoft.com

Document representation and indexing is a key problem for document analysis and processing, such as clustering, classification and retrieval. Conventionally, Latent Semantic Index...

Xiaofei He, Deng Cai, Haifeng Liu, Wei-Ying Ma

claim paper

Read More »

203

Voted

WWW
2001
ACM

171views Internet Technology» more WWW 2001»

Algorithms and programming models for efficient representation of XML for Internet applications

16 years 8 months ago

Download www10.org

XML is poised to take the World-Wide-Web to the next level of innovation. XML data, large or small, with or without associated schema, will be exchanged between increasing number ...

Neel Sundaresan, Reshad Moussa

claim paper

Read More »

218

click to vote

ECIR
2008
Springer

103views Information Technology» more ECIR 2008»

Semi-supervised Document Classification with a Mislabeling Error Model

15 years 9 months ago

Download eprints.pascal-network.org

Abstract. This paper investigates a new extension of the Probabilistic Latent Semantic Analysis (PLSA) model [6] for text classification where the training set is partially labeled...

Anastasia Krithara, Massih-Reza Amini, Jean-Michel...

claim paper

Read More »

222

Voted

DOCENG
2007
ACM

143views Document Analysis» more DOCENG 2007»

Elimination of junk document surrogate candidates through pattern recognition

15 years 11 months ago

Download research.cs.tamu.edu

A surrogate is an object that stands for a document and enables navigation to that document. Hypermedia is often represented with textual surrogates, even though studies have show...

Eunyee Koh, Daniel Caruso, Andruid Kerne, Ricardo ...

claim paper

Read More »

« Prev « First page 4 / 38 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers