Using SOFM to Improve Web Site Text Content

16 years 2 days ago

Download wi.dii.uchile.cl

We introduce a new method to improve web site text content by identifying the most relevant free text in the web pages. In order to understand the variations in web page text, we collect pages during a period. The page text content is then transformed into a feature vector and is used as input of a clustering algorithm (SOFM), which groups the vectors by common text content. In each cluster, a centroid and its neighbor vectors are extracted. Then using a reverse clustering analysis, the pages represented by each vector are reviewed in order to ﬁnd the similar. Furthermore, the proposed method was tested in a real web site, proving the eﬀectiveness of this approach.

Sebastián A. Ríos, Juan D. Vel&aacut

Real-time Traffic

ICNC 2005 | Page Text | Text Content | Web Page |

claim paper

» Web Site OffLine Structure Reconfiguration A Web User Browsing Analysis

» Using Semantic Information to Improve Transparent Query Caching for Dynamic Content Web Si...

» A hybrid system for conceptbased web usage mining

» Using anchor texts with their hyperlink structure for web search

» TopicBased Audience Metrics for Internet Marketing by Combining Ontologies and Output Page...

» Twostream indexing for spoken web search

» Bilingual web page and site readability assessment

» Do not crawl in the DUST different URLs with similar text

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ICNC
Authors	Sebastián A. Ríos, Juan D. Velásquez, Eduardo S. Vera, Hiroshi Yasuda, Terumasa Aoki

Comments (0)

Sciweavers

Using SOFM to Improve Web Site Text Content

ICNC 2005 | Page Text | Text Content | Web Page |

Explore & Download

Productivity Tools

Sciweavers