A large time-aware web graph
- 30 November 2008
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGIR Forum
- Vol. 42 (2), 33-38
- https://doi.org/10.1145/1480506.1480511
Abstract
We describe the techniques developed to gather and distribute in a highly compressed, yet accessible, form a series of twelve snapshot of the .uk web domain. Ad hoc compression techniques made it possible to store the twelve snapshots using just 1:9 bits per link, with constant-time access to temporal information. Our collection makes it possible to study the temporal evolution link-based scores (e.g., PageRank), the growth of online communities, and in general time-dependent phenomena related to the link structure.Keywords
This publication has 4 references indexed in Scilit:
- The webgraph framework IPublished by Association for Computing Machinery (ACM) ,2004
- UbiCrawler: a scalable fully distributed Web crawlerSoftware: Practice and Experience, 2004
- Efficient decoding of prefix codesCommunications of the ACM, 1990
- Efficient Storage and Retrieval by Content and Address of Static FilesJournal of the ACM, 1974