Ivannikov Institute for System Programming of the RAS

Harnessing Wikipedia for Smart Tags Clustering.


Grineva M., Grinev M., Turdakov D., Velikhov P., Boldakov A.


The quality of the current tagging services can be greatly improved if the service is able to cluster tags by their meaning. Tag clouds clustered by higher level topics enable the users to explore their tag space, which is especially needed when tag clouds become large. We demonstrate TagCluster - a tool for automated tag clustering that harnesses knowledge from Wikipedia about semantic relatedness between tags and names of categories to achieve smart clustering. Our approach shows much better quality of clusters compared to the existing techniques that rely on tag cooccurrence analysis in the tagging service.

Full text of the paper in pdf


In proceedings of International Workshop on “Knowledge Acquisition from the Social Web” KASW'08.

Research Group

Information Systems

All publications during 2008 All publications