Search Sciweavers | Sciweavers

2677 search results - page 20 / 536

» Extracting Structured Data from Web Pages

165

click to vote

DL
2000
Springer

351views Digital Library» more DL 2000»

Acrophile: an automated acronym extractor and server

15 years 11 months ago

Download ir.iit.edu

We implemented a web server for acronym and abbreviation lookup, containing a collection of acronyms and their expansions gathered from a large number of web pages by a heuristic ...

Leah S. Larkey, Paul Ogilvie, M. Andrew Price, Bre...

claim paper

Read More »

188

click to vote

WWW
2009
ACM

213views Internet Technology» more WWW 2009»

Extracting article text from the web with maximum subsequence segmentation

16 years 7 months ago

Download www2009.org

Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...

Jeff Pasternack, Dan Roth

claim paper

Read More »

175

click to vote

ICDM
2003
IEEE

225views Data Mining» more ICDM 2003»

Combining the web content and usage mining to understand the visitor behavior in a web site

16 years 2 days ago

Download wi.dii.uchile.cl

A web site is a semi structured collection of different kinds of data, whose motivation is show relevant information to visitor and by this way capture her/his attention. Understa...

Juan D. Velásquez, Hiroshi Yasuda, Terumasa...

claim paper

Read More »

196

click to vote

NAACL
2010

182views Computational Linguistics» more NAACL 2010»

Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment

15 years 4 months ago

Download research.microsoft.com

The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...

Jason R. Smith, Chris Quirk, Kristina Toutanova

claim paper

Read More »

202

click to vote

SOCIALCOM
2010

175views Security Privacy» more SOCIALCOM 2010»

Using Text Analysis to Understand the Structure and Dynamics of the World Wide Web as a Multi-Relational Graph

15 years 4 months ago

Download www.cis.temple.edu

A representation of the World Wide Web as a directed graph, with vertices representing web pages and edges representing hypertext links, underpins the algorithms used by web search...

Harish Sethu, Alexander Yates

claim paper

Read More »

« Prev « First page 20 / 536 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers