Predicting Web Information Content

15 years 8 months ago

Download maya.cs.depaul.edu

In this paper, we propose a novel method to infer the web user’s Information Content (IC), which is the information that the user must examine to complete her task. In particular, our method learns to predict which words (called IC-words) will be in these essential web pages (IC-pages). We ﬁrst collected relevant training data usnig an empirical study, where users explicitly identiﬁed which pages were IC-pages. We then examined page-content information from these clickstreams, to determine “browsing properties” of each individual word ¢ — i.e., how often was ¢ in the title of a page in each session, or in the anchor to a page that was followed, or a link that was skipped, etc. This training data also labeled each word as an IC-word or not. We used this to train a classiﬁer to identify the browsing properties associated with IC-words. Notice this classiﬁer can predict which words are IC given any page sequence, even if those pages are in web-sites that have not been v...

Tingshao Zhu, Russell Greiner, Gerald Häubl,

Real-time Traffic

Browsing Properties | IJCAI 2003 | IJCAI 2007 | Pages | Web Users |

claim paper

» Predicting escalations of medical queries based on web page structure and content

» ContentBased Methods for Predicting WebSite Demographic Attributes

» Towards Informed Web Content Delivery

» Predicting Network Response Times Using Social Information

» Predicting quality flaws in usergenerated content the case of wikipedia

» The Missing Link A Probabilistic Model of Document Content and Hypertext Connectivity

» Models for User Access Patterns on the Web Semantic Content versus Access History

» Integration of news content into web results

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	IJCAI
Authors	Tingshao Zhu, Russell Greiner, Gerald Häubl, Robert Price

Comments (0)

Sciweavers

Predicting Web Information Content

Browsing Properties | IJCAI 2003 | IJCAI 2007 | Pages | Web Users |

Explore & Download

Productivity Tools

Sciweavers