Sciweavers

3548 search results - page 64 / 710
» Searching the deep web
Sort
View
WWW
2001
ACM
14 years 10 months ago
Effective Web data extraction with standard XML technologies
We discuss the problem of Web data extraction and describe an XML-based methodology whose goal extends far beyond simple "screen scraping." An ideal data extraction proc...
Jussi Myllymaki
JCDL
2006
ACM
101views Education» more  JCDL 2006»
14 years 3 months ago
Search engine driven author disambiguation
In scholarly digital libraries, author disambiguation is an important task that attributes a scholarly work with specific authors. This is critical when individuals share the sam...
Yee Fan Tan, Min-Yen Kan, Dongwon Lee
ICIW
2009
IEEE
13 years 7 months ago
Using Propagation of Distrust to Find Untrustworthy Web Neighborhoods
Web spamming, the practice of introducing artificial text and links into web pages to affect the results of searches, has been recognized as a major problem for search engines. Bu...
Panagiotis Takis Metaxas
WWW
2006
ACM
14 years 10 months ago
WebKhoj: Indian language IR from multiple character encodings
Today web search engines provide the easiest way to reach information on the web. In this scenario, more than 95% of Indian language content on the web is not searchable due to mu...
Prasad Pingali, Jagadeesh Jagarlamudi, Vasudeva Va...
MIR
2005
ACM
137views Multimedia» more  MIR 2005»
14 years 3 months ago
Similarity space projection for web image search and annotation
Web image search has been explored and developed in academic as well as commercial areas for over a decade. To measure the similarity between Web images and user queries, most of ...
Ying Liu, Tao Qin, Tie-Yan Liu, Lei Zhang, Wei-Yin...