Sciweavers

259 search results - page 16 / 52
» Rules of Thumb in Data Engineering
Sort
View
INTERACT
2003
13 years 10 months ago
The Misapplication of Engineering Models to Business Decisions
: The HCI community has long been accused of delivering ‘common sense’, ‘useless’ information, and to be ignorant of business needs. HCI experts are also criticized for fai...
Gitte Lindgaard
VLDB
1998
ACM
138views Database» more  VLDB 1998»
14 years 23 days ago
TOPAZ: a Cost-Based, Rule-Driven, Multi-Phase Parallelizer
Currently the key problems of query optimization are extensibility imposed by object-relational technology, as well as query complexity caused by forthcoming applications, such as...
Clara Nippl, Bernhard Mitschang
WWW
2010
ACM
14 years 3 months ago
A pattern tree-based approach to learning URL normalization rules
Duplicate URLs have brought serious troubles to the whole pipeline of a search engine, from crawling, indexing, to result serving. URL normalization is to transform duplicate URLs...
Tao Lei, Rui Cai, Jiang-Ming Yang, Yan Ke, Xiaodon...
SKG
2006
IEEE
14 years 2 months ago
Abox Inference for Large Scale OWL-Lite Data
Abox inference is an important part in OWL data management. When involving large scale of instance data, it can not be supported by existing inference engines. In this paper, we p...
Xiaofeng Wang, Jianbo Ou, Xiaofeng Meng, Yan Chen
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
14 years 9 months ago
De-duping URLs via rewrite rules
A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...
Anirban Dasgupta, Ravi Kumar, Amit Sasturkar