Sciweavers

122 search results - page 14 / 25
» Evaluating methods to rediscover missing web pages from the ...
Sort
View
ICDAR
2003
IEEE
14 years 20 days ago
Lexical Postcorrection of OCR-Results: The Web as a Dynamic Secondary Dictionary?
Postcorrection of OCR-results for text documents is usually based on electronic dictionaries. When scanning texts from a specific thematic area, conventional dictionaries often m...
Christian M. Strohmaier, Christoph Ringlstetter, K...
ASSETS
2009
ACM
14 years 1 months ago
Validity and reliability of web accessibility guidelines
Although widely used, Web Content Accessibility Guidelines (WCAG) have not been studied from the viewpoint of their validity and reliability. WCAG 2.0 explicitly claim that they a...
Giorgio Brajnik
ICDM
2008
IEEE
186views Data Mining» more  ICDM 2008»
14 years 1 months ago
xCrawl: A High-Recall Crawling Method for Web Mining
Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The first step in the Information Extract...
Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...
ECAI
2006
Springer
13 years 11 months ago
Disambiguating Personal Names on the Web Using Automatically Extracted Key Phrases
Abstract. When you search for information regarding a particular person on the web, a search engine returns many pages. Some of these pages may be for people with the same name. Ho...
Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuk...
ICASSP
2011
IEEE
12 years 11 months ago
Leveraging the Web for automatically generating indexable and browsable keywords for speech files
This paper presents a method for generating indexable and browsable keyword metadata from ASR transcripts by leveraging the Web. Search engine queries are built from an ASR transc...
Kishan Thambiratnam, Gang Li, Sha Meng, Frank Seid...