Sciweavers

122 search results - page 21 / 25
» Interpreting the layout of web pages
Sort
View
WCRE
2003
IEEE
14 years 24 days ago
Reconstructing Software Architecture for J2EE Web Applications
In this paper we describe our approach to reconstructing the software architecture of J2EE web applications. We use the Siemens Four Views approach, separating the architecture in...
Minmin Han, Christine Hofmeister, Robert L. Nord
LREC
2010
159views Education» more  LREC 2010»
13 years 6 months ago
The Web Library of Babel: evaluating genre collections
We present experiments in automatic genre classification on web corpora, comparing a wide variety of features on several different genreannotated datasets (HGC, I-EN, KI-04, KRYS...
Serge Sharoff, Zhili Wu, Katja Markert
HT
2005
ACM
14 years 1 months ago
From the writable web to global editability
The technical and competence requirements for writing content on the web is still one of the major factors that widens the gap between authors and readers. Although tools that sup...
Angelo Di Iorio, Fabio Vitali
KDD
1997
ACM
169views Data Mining» more  KDD 1997»
13 years 11 months ago
Learning to Extract Text-Based Information from the World Wide Web
Thereis a wealthof informationto be minedfromnarrative text on the WorldWideWeb.Unfortunately, standard natural language processing (NLP)extraction techniques expect full, grammat...
Stephen Soderland
LREC
2010
172views Education» more  LREC 2010»
13 years 9 months ago
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9
CzEng 0.9 is the third release of a large parallel corpus of Czech and English. For the current release, CzEng was extended by significant amount of texts from various types of so...
Ondrej Bojar, Adam Liska, Zdenek Zabokrtský