Selecting actions for resource-bounded information extraction using reinforcement learning

14 years 2 months ago

Download people.cs.umass.edu

Given a database with missing or uncertain content, our goal is to correct and ﬁll the database by extracting speciﬁc information from a large corpus such as the Web, and to do so under resource limitations. We formulate the information gathering task as a series of choices among alternative, resource-consuming actions and use reinforcement learning to select the best action at each time step. We use temporal diﬀerence q-learning method to train the function that selects these actions, and compare it to an online, errordriven algorithm called SampleRank. We present a system that ﬁnds information such as email, job title and department aﬃliation for the faculty at our university, and show that the learning-based approach accomplishes this task eﬃciently under a limited action budget. Our evaluations show that we can obtain 92.4% of the ﬁnal F1, by only using 14.3% of all possible actions. Categories and Subject Descriptors I.2.6 [Computing Methodologies]: Artiﬁcial Inte...

Pallika H. Kanani, Andrew K. McCallum

Real-time Traffic