Sciweavers

684 search results - page 11 / 137
» Extracting semantic structure of web documents using content...
Sort
View
ITCC
2005
IEEE
14 years 2 months ago
Elimination of Redundant Information for Web Data Mining
These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...
Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang
SIGMOD
2000
ACM
236views Database» more  SIGMOD 2000»
14 years 1 months ago
XTRACT: A System for Extracting Document Type Descriptors from XML Documents
XML is rapidly emerging as the new standard for data representation and exchange on the Web. An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the...
Minos N. Garofalakis, Aristides Gionis, Rajeev Ras...
AAAI
1994
13 years 10 months ago
Visual Semantics: Extracting Visual information from Text Accompanying Pictures
This research explores the interaction of textual and photographic information in document understanding. The problem of performing generalpurpose vision without apriori knowledge...
Rohini K. Srihari, Debra T. Burhans
MKM
2004
Springer
14 years 2 months ago
A Graph-Based Approach Towards Discerning Inherent Structures in a Digital Library of Formal Mathematics
As the amount of online formal mathematical content grows, for example through active efforts such as the Mathweb [21], MOWGLI [4], Formal Digital Library, or FDL [1], and others, ...
Lori Lorigo, Jon M. Kleinberg, Richard Eaton, Robe...
PVLDB
2010
184views more  PVLDB 2010»
13 years 7 months ago
TimeTrails: A System for Exploring Spatio-Temporal Information in Documents
Spatial and temporal data have become ubiquitous in many application domains such as the Geosciences or life sciences. Sophisticated database management systems are employed to ma...
Jannik Strötgen, Michael Gertz