dom trees | Sciweavers

163

ACL
2006

141views Computational Linguistics» more ACL 2006»

A DOM Tree Alignment Model for Mining Parallel Data from the Web

15 years 8 months ago

This paper presents a new web mining scheme for parallel data acquisition. Based on the Document Object Model (DOM), a web page is represented as a DOM tree. Then a DOM tree align...

Lei Shi, Cheng Niu, Ming Zhou, Jianfeng Gao

claim paper

Read More »

220

click to vote

ICONIP
2007

192views Information Technology» more ICONIP 2007»

Classification of Documents Based on the Structure of Their DOM Trees

15 years 8 months ago

Download www.peter-geibel.de

In this paper, we discuss kernels that can be applied for the classiﬁcation of XML documents based on their DOM trees. DOM trees are ordered trees in which every node might be la...

Peter Geibel, Olga Pustylnikov, Alexander Mehler, ...

claim paper

Read More »

176

click to vote

WCRE
2000
IEEE

115views Software Engineering» more WCRE 2000»

Towards Portable Source Code Representations using XML

15 years 11 months ago

Download www.swen.uwaterloo.ca

One of the most important issue in source code analysis and software re-engineering is the representation of ode text at an abstraction level and form suitable for algorithmic pro...

Evan Mamas, Kostas Kontogiannis

claim paper

Read More »

176

click to vote

WEBDB
2005
Springer

97views Database» more WEBDB 2005»

Towards a Query Language for Multihierarchical XML: Revisiting XPath

16 years 1 days ago

Download www.eppt.org

In recent years it has been argued that when XML encodings become complex, DOM trees are no longer adequate for query processing. Alternative representations of XML documents, suc...

Ionut Emil Iacob, Alex Dekhtyar

claim paper

Read More »

197

click to vote

WWW
2010
ACM

201views Internet Technology» more WWW 2010»

The paths more taken: matching DOM trees to search logs for accurate webpage clustering

16 years 1 months ago

Download www.cs.cmu.edu

An unsupervised clustering of the webpages on a website is a primary requirement for most wrapper induction and automated data extraction methods. Since page content can vary dras...

Deepayan Chakrabarti, Rupesh R. Mehta

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers