In this paper, we introduce the concept of a QA-Pagelet to refer to the content region in a dynamic page that contains query matches. We present THOR, a scalable and efficient min...
Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of Web pages reachable purely by following hypertext links, ignoring search forms and pag...
In this paper, we describe how ORM is extended, and combined with Concurrent Task Trees (CTT) to model the content as well as the functionality of a web system in the web design me...
In this paper, we focus on the use of three different techniques that support automatic derivation of video content from raw video data, namely, a spatio-temporal rule-based metho...
Machine-understandable metadata is providing the foundation for next-generationframeworks that enable automated construction ofserver-side Java applications. Suchapplications are ...