Sciweavers

708 search results - page 53 / 142
» Identifying Content Blocks from Web Documents
Sort
View
ICDAR
2009
IEEE
14 years 3 months ago
Indian Multi-Script Full Pin-code String Recognition for Postal Automation
Under three-language formula, the destination address block of postal document of an Indian state is generally written in three languages: English, Hindi and the State official la...
Umapada Pal, Rami Kumar Roy, Kaushik Roy, Fumitaka...
WCW
2004
Springer
14 years 2 months ago
Overhaul: Extending HTTP to Combat Flash Crowds
The increasing use of the web for serving http content, for database transactions, etc. can place heavy stress on servers. Flash crowds can occur at a server when there is a burst ...
Jay A. Patel, Indranil Gupta
BMCBI
2005
122views more  BMCBI 2005»
13 years 8 months ago
Finding genomic ontology terms in text using evidence content
Background: The development of text mining systems that annotate biological entities with their properties using scientific literature is an important recent research topic. These...
Francisco M. Couto, Mário J. Silva, Pedro C...
AAAI
2008
13 years 11 months ago
Extracting Relevant Snippets for Web Navigation
Search engines present fix-length passages from documents ranked by relevance against the query. In this paper, we present and compare novel, language-model based methods for extr...
Qing Li, K. Selçuk Candan, Qi Yan
DIS
2001
Springer
14 years 1 months ago
Eliminating Useless Parts in Semi-structured Documents Using Alternation Counts
We propose a preprocessing method for Web mining which, given semi-structured documents with the same structure and style, distinguishes useless parts and non-useless parts in each...
Daisuke Ikeda, Yasuhiro Yamada, Sachio Hirokawa