Search Sciweavers | Sciweavers

829 search results - page 152 / 166

» Minimal document set retrieval

161

click to vote

WWW
2008
ACM

168views Internet Technology» more WWW 2008»

Performance of compressed inverted list caching in search engines

16 years 6 months ago

Download www2008.org

Due to the rapid growth in the size of the web, web search engines are facing enormous performance challenges. The larger engines in particular have to be able to process tens of ...

Jiangong Zhang, Xiaohui Long, Torsten Suel

claim paper

Read More »

159

click to vote

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

16 years 6 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

155

Voted

WWW
2010
ACM

220views Internet Technology» more WWW 2010»

Not so creepy crawler: easy crawler generation with standard xml queries

16 years 1 months ago

Download www2.pms.ifi.lmu.de

Web crawlers are increasingly used for focused tasks such as the extraction of data from Wikipedia or the analysis of social networks like last.fm. In these cases, pages are far m...

Franziska von dem Bussche, Klara A. Weiand, Benedi...

claim paper

Read More »

188

click to vote

DASFAA
2008
IEEE

188views Database» more DASFAA 2008»

Summarization Graph Indexing: Beyond Frequent Structure-Based Approach

16 years 16 days ago

Download www.leizou.net

Graph is an important data structure to model complex structural data, such as chemical compounds, proteins, and XML documents. Among many graph data-based applications, sub-graph ...

Lei Zou, Lei Chen 0002, Huaming Zhang, Yansheng Lu...

claim paper

Read More »

164

click to vote

PODS
2008
ACM

158views Database» more PODS 2008»

Local Hoare reasoning about DOM

16 years 6 months ago

Download www.doc.ic.ac.uk

The W3C Document Object Model (DOM) specifies an XML update library. DOM is written in English, and is therefore not compositional and not complete. We provide a first step toward...

Philippa Gardner, Gareth Smith, Mark J. Wheelhouse...

claim paper

Read More »

« Prev « First page 152 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers