The explosive growth of content in volume, velocity and variety on the Web demands new approaches to content analytics, addressing issues in large scale analysis and interpretation of heterogeneous data sets, originating in different media, human languages, jurisdictions, etc. Among these, language diversity in particular has become a ubiquitous aspect of the Web in light of increasing globalizati ...