Sciweavers

52 search results - page 6 / 11
» Mining the Web for lists of Named Entities
Sort
View
SIGIR
2009
ACM
14 years 1 months ago
Web derived pronunciations for spoken term detection
Indexing and retrieval of speech content in various forms such as broadcast news, customer care data and on-line media has gained a lot of interest for a wide range of application...
Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jan...
JCDL
2004
ACM
198views Education» more  JCDL 2004»
14 years 26 days ago
Finding authoritative people from the web
Today’s web is so huge and diverse that it arguably reflects the real world. For this reason, searching the web is a promising approach to find things in the real world. This ...
Masanori Harada, Shin-ya Sato, Kazuhiro Kazama
PAKDD
2009
ACM
116views Data Mining» more  PAKDD 2009»
14 years 2 months ago
Scalable Web Mining with Newistic
Abstract. Newistic is a web mining platform that collects and analyses documents crawled from the Internet. Although it currently processes news articles, it can be easily adapted ...
Ovidiu Dan, Horatiu Mocian
KDD
2002
ACM
138views Data Mining» more  KDD 2002»
14 years 7 months ago
Learning to match and cluster large high-dimensional data sets for data integration
Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...
William W. Cohen, Jacob Richman
ACL
2006
13 years 8 months ago
Novel Association Measures Using Web Search with Double Checking
A web search with double checking model is proposed to explore the web as a live corpus. Five association measures including variants of Dice, Overlap Ratio, Jaccard, and Cosine, ...
Hsin-Hsi Chen, Ming-Shun Lin, Yu-Chuan Wei