Sciweavers

2677 search results - page 105 / 536
» Extracting Structured Data from Web Pages
Sort
View
WWW
2005
ACM
14 years 3 months ago
An information extraction engine for web discussion forums
In this poster, we present an information extraction engine for web-based forums. The engine analyzes the HTML files crawled from web forums, deduces the wrapper (template) of the...
Hanny Yulius Limanto, Nguyen Ngoc Giang, Vo Tan Tr...
CSMR
2004
IEEE
14 years 1 months ago
Experimental Results on the Alignment of Multilingual Web Sites
Institutions and companies that are based in countries where the main language is not English typically publish Web sites that offer the same information at least in the local lan...
Filippo Ricca, Paolo Tonella, Emanuele Pianta, Chr...
HT
2005
ACM
14 years 3 months ago
As we may perceive: inferring logical documents from hypertext
In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...
Pavel Dmitriev, Carl Lagoze, Boris Suchkov
AWIC
2003
Springer
14 years 3 months ago
Web Page Classification: A Soft Computing Approach
The Internet makes it possible to share and manipulate a vast quantity of information efficiently and effectively, but the rapid and chaotic growth experienced by the Net has gener...
Angela Ribeiro, Víctor Fresno, Maria C. Gar...
KES
2004
Springer
14 years 3 months ago
Intelligent Web Site: Understanding the Visitor Behavior
Abstract. Intelligent web site is a new portal generation, able to improve its structure and content based on the analysis of the user behavior. This paper focuses on modeling the ...
Juan D. Velásquez, Pablo A. Estévez,...