Online monitoring of data streams poses a challenge in many data-centric applications, such as telecommunications networks, traffic management, trend-related analysis, webclick st...
Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...
Processing data streams with Quality-ofService (QoS) guarantees is an emerging area in existing streaming applications. Although it is possible to negotiate the result quality and...
Researchers in the data mining area frequently have to spend significant portion of their time on preprocessing the data in order to apply their algorithms to real-world datasets...
Zhaoqi Chen, Dmitri V. Kalashnikov, Sharad Mehrotr...
: Documents such as spreadsheets are easy to create, edit, and exchange. However, their use causes a set of well known problems such as poor data quality, lack of multi user suppor...