Search Sciweavers | Sciweavers

270 search results - page 21 / 54

» Extracting and Modeling the Semantic Information Content of ...

click to vote

WWW
2004
ACM

130views Internet Technology» more WWW 2004»

Managing versions of web documents in a transaction-time web server

14 years 8 months ago

Download www.iw3c2.org

This paper presents a transaction-time HTTP server, called ? Apache that supports document versioning. A document often consists of a main file formatted in HTML or XML and severa...

Curtis E. Dyreson, Hui-ling Lin, Yingxia Wang

claim paper

Read More »

click to vote

KDD
2004
ACM

160views Data Mining» more KDD 2004»

Boosting for Text Classification with Semantic Features

14 years 8 months ago

Download www.aifb.uni-karlsruhe.de

Abstract. Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic...

Stephan Bloehdorn, Andreas Hotho

claim paper

Read More »

click to vote

KES
2006
Springer

205views Information Technology» more KES 2006»

Integrated Document Browsing and Data Acquisition for Building Large Ontologies

13 years 7 months ago

Download rewerse.net

Named entities (e.g., "Kofi Annan", "Coca-Cola", "Second World War") are ubiquitous in web pages and other types of document and often provide a simpl...

Felix Weigel, Klaus U. Schulz, Levin Brunner, Edua...

claim paper

Read More »

click to vote

CORR
2007
Springer

117views Education» more CORR 2007»

Dirac Notation, Fock Space and Riemann Metric Tensor in Information Retrieval Models

13 years 7 months ago

Download www.shermanlab.com

Using Dirac Notation as a powerful tool, we investigate the three classical Information Retrieval (IR) models and some their extensions. We show that almost all such models can be...

Xing M. Wang

claim paper

Read More »

click to vote

WWW
2010
ACM

257views Internet Technology» more WWW 2010»

CETR: content extraction via tag ratios

14 years 2 months ago

Download www.cs.illinois.edu

We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...

Tim Weninger, William H. Hsu, Jiawei Han

claim paper

Read More »

« Prev « First page 21 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers