In this paper, we present InfoScent Evaluator, a tool that automatically evaluates the semantic appropriateness of the descriptions of hyperlinks in web pages. The tool is based o...
Christos Katsanos, Nikolaos K. Tselios, Nikolaos M...
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
This paper describes our efforts to factor in a user’s browsing behavior to automatically evaluate web pages that the user shows interest in, based on user browsing behaviors wh...
The design and reification of Web Information Systems is a complex task, for which many integrated development methods have been proposed. While all these methods ultimately lead ...
MathEdit [23] is a browser-based tool implemented in JavaScript that provides a convenient and intuitive graphical user interface for creating and editing mathematical expressions...
Anchor text has been shown to be effective in ranking[6] and a variety of information retrieval tasks on web pages. Some authors have expanded on anchor text by using the words ar...
This paper presents a grammar-induction based approach to partitioning a Web page into several small pages while each small page fits not only spatially but also logically for mob...
Web pages such as news and shopping sites often use modular layouts. When used effectively this practice allows authors to present clearly large amounts of information in a single...
— With the exponentially growing amount of information available on the Internet, retrieving web pages of interest has become increasingly difficult. While several web page recom...
Tao Zhang, Byungjeong Lee, Sooyong Kang, Hanjoon K...
We propose two methods for constructing automated programs for extraction of information from a class of web pages that are very common and of high practical significance - varia...