Sciweavers

1947 search results - page 345 / 390
» On the Automatic Extraction of Data from the Hidden Web
Sort
View
DILS
2006
Springer
14 years 14 days ago
Improving Text Mining with Controlled Natural Language: A Case Study for Protein Interactions
Linking the biomedical literature to other data resources is notoriously difficult and requires text mining. Text mining aims to automatically extract facts from literature. Since ...
Tobias Kuhn, Loïc Royer, Norbert E. Fuchs, Mi...
IJDLS
2010
131views more  IJDLS 2010»
13 years 6 months ago
Annotating Historical Archives of Images
Recent initiatives like the Million Book Project and Google Print Library Project have already archived several million books in digital format, and within a few years a significa...
Xiaoyue Wang, Lexiang Ye, Eamonn J. Keogh, Christi...
RAID
2009
Springer
14 years 3 months ago
PE-Miner: Mining Structural Information to Detect Malicious Executables in Realtime
In this paper, we present an accurate and realtime PE-Miner framework that automatically extracts distinguishing features from portable executables (PE) to detect zero-day (i.e. pr...
M. Zubair Shafiq, S. Momina Tabish, Fauzan Mirza, ...
VLDB
2005
ACM
177views Database» more  VLDB 2005»
14 years 2 months ago
Discovering Large Dense Subgraphs in Massive Graphs
We present a new algorithm for finding large, dense subgraphs in massive graphs. Our algorithm is based on a recursive application of fingerprinting via shingles, and is extreme...
David Gibson, Ravi Kumar, Andrew Tomkins
SIGIR
2005
ACM
14 years 2 months ago
Web-based acquisition of Japanese katakana variants
This paper describes a method of detecting Japanese Katakana variants from a large corpus. Katakana words, which are mainly used as loanwords, cause problems with information retr...
Takeshi Masuyama, Hiroshi Nakagawa