Finding Thai Web Pages in Foreign Web Spaces

16 years 20 days ago

Download www.ieice.org

While the Web has been increasingly recognized as a culturally valuable social artifact, many nations endeavor to create national Web archives for long term preservation. However, due to its borderless-ness, gathering information for a specific nation from the Web is challenging. This paper proposes language specific web crawling (LSWC) as a method of creating Web archives for countries with linguistic identities such as Thailand. The LSWC strategy for selectively gathering Thai web pages from virtually anywhere on the Web is derived based on static analyses of the Thai Web graph. Then, the LSWC strategy is evaluated on a crawling simulator with large dataset. Keyword ᖱႎᬌ⚝, ᕈ ƫ ܿ ଔ , Web ߣࠗ ࡦ ࠲ ࡯ ࡀ ࠶ ࠻ , Web ࠕ ࡯ ࠞ ࠗ ࡉ ,ࡈ ࠜ ࡯ ࠞ ࠬ ࠻ ࠢ ࡠ ࡯ ࡝ ࡦ ࠣ ,‫܂‬ ‫ݶ‬ ್ ቯ , Web ࠣ ࡜ ࡈ

Kulwadee Somboonviwat, Takayuki Tamura, Masaru Kit

Real-time Traffic

Database | ICDE 2006 | Language Specific Web | National Web Archives | Web Archives |

claim paper

» Cultural differences on attention and perceived usability Investigating color combinations...

» Simulating culture An experiment using a multiuser virtual environment

» Products review page projection using the number of evaluating expressions

» Reverse mapping of referral links from storage hierarchy for Web documents

» Hybrid Indexing and Seamless Ranking of Spatial and Textual Features of Web Documents

» Traffic in Social Media II Modeling Bursty Popularity

» How do people find information on a familiar website

» P2P Authority Analysis for Social Communities

Post Info
More Details (n/a)

Added	11 Jun 2010
Updated	11 Jun 2010
Type	Conference
Year	2006
Where	ICDE
Authors	Kulwadee Somboonviwat, Takayuki Tamura, Masaru Kitsuregawa

Comments (0)

Sciweavers

Finding Thai Web Pages in Foreign Web Spaces

Database | ICDE 2006 | Language Specific Web | National Web Archives | Web Archives |

Explore & Download

Productivity Tools

Sciweavers