Sciweavers

2137 search results - page 147 / 428
» Extraction of Structural Information from the Web
Sort
View
ACL
2010
15 years 2 months ago
Fine-Grained Tree-to-String Translation Rule Extraction
Tree-to-string translation rules are widely used in linguistically syntax-based statistical machine translation systems. In this paper, we propose to use deep syntactic informatio...
Xianchao Wu, Takuya Matsuzaki, Jun-ichi Tsujii
127
Voted
UIST
2006
ACM
15 years 10 months ago
RecipeSheet: creating, combining and controlling information processors
Many tasks require users to extract information from diverse sources, to edit or process this information locally, and to explore how the end results are affected by changes in th...
Aran Lunzer, Kasper Hornbæk
158
Voted
AAAI
2007
15 years 7 months ago
Template-Independent News Extraction Based on Visual Consistency
Wrapper is a traditional method to extract useful information from Web pages. Most previous works rely on the similarity between HTML tag trees and induced template-dependent wrap...
Shuyi Zheng, Ruihua Song, Ji-Rong Wen
125
Voted
PKDD
2007
Springer
120views Data Mining» more  PKDD 2007»
15 years 10 months ago
Site-Independent Template-Block Detection
Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...
Aleksander Kolcz, Wen-tau Yih
138
Voted
KDD
2008
ACM
211views Data Mining» more  KDD 2008»
16 years 5 months ago
ArnetMiner: extraction and mining of academic social networks
This paper addresses several key issues in the ArnetMiner system, which aims at extracting and mining academic social networks. Specifically, the system focuses on: 1) Extracting ...
Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zha...