There is a wealth of publicly available data in today's Internet (e.g., Web pages, government law texts, public statistics, media archives, etc.) that can be exploited by large and small companies in various business domains. Storing, processing, and querying such ever increasing amounts of data is becoming a major challenge, and having the capability to do so is a strong asset for the few big com ...