To understand what is required to support new innovative Internet applications, a solid understanding of Internet content characteristics (size, distribution, form, structure, evolution, dynamic) is necessary. The LAWA project (LAWA - Longitudinal Analytics of Web Archive data) will build an Internet-based experimental testbed for large-scale data analytics. Its emphasis is on developing a sustain ...