Sciweavers

JCDL
2011
ACM

How much of the web is archived?

13 years 3 months ago
How much of the web is archived?
The Memento Project’s archive access additions to HTTP have enabled development of new web archive access user interfaces. After experiencing this web time travel, the inevitable question that comes to mind is “How much of the Web is archived?” This question is studied by approximating the Web via sampling URIs from DMOZ, Delicious, Bitly, and search engine indexes and measuring number of archive copies available in various public web archives. The results indicate that 35%–90% of URIs have at least one archived copy, 17%–49% have two to five copies, 1%–8% have six to ten copies, and 8%–63% at least ten copies. The number of
Scott Ainsworth, Ahmed Alsum, Hany SalahEldeen, Mi
Added 15 Sep 2011
Updated 15 Sep 2011
Type Journal
Year 2011
Where JCDL
Authors Scott Ainsworth, Ahmed Alsum, Hany SalahEldeen, Michele C. Weigle, Michael L. Nelson
Comments (0)