Heuristics for semantic path search in Wikipedia

Valentina Franzoni, Marco Mencacci, Paolo Mengoni, Alfredo Milani

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

23 Citations (Scopus)


In this paper an approach based on Heuristic Semantic Walk (HSW) is presented, where semantic proximity measures among concepts are used as heuristics in order to guide the concept chain search in the collaborative network of Wikipedia, encoding problem-specific knowledge in a problem-independent way. Collaborative information and multimedia repositories over the Web represent a domain of increasing relevance, since users cooperatively add to the objects tags, label, comments and hyperlinks, which reflect their semantic relationships, with or without an underlying structure. As in the case of the so called Big Data, methods for path finding in collaborative web repositories require solving major issues such as large dimensions, high connectivity degree and dynamical evolution of online networks, which make the classical approach ineffective. Experiments held on a range of different semantic measures show that HSW lead to better results than state of the art search methods, and points out the relevant features of suitable proximity measures for the Wikipedia concept network. The extracted semantic paths have many relevant applications such as query expansion, synthesis of explanatory arguments, and simulation of user navigation.

Original languageEnglish
Title of host publicationComputational Science and Its Applications, ICCSA 2014
Subtitle of host publication14th International Conference, Guimarães, Portugal, June 30 – July 3, 2014, Proceedings, Part VI
EditorsBeniamino Murgante, Sanjay Misra, Ana Maria A. C. Rocha, Carmelo Torre, Jorge Gustavo Rocha, Maria Irene Falcão, David Taniar, Bernady O. Apduhan, Osvaldo Gervasi
PublisherSpringer Verlag
Number of pages14
ISBN (Electronic)9783319091532
ISBN (Print)9783319091525
Publication statusPublished - 2014
Event14th International Conference on Computational Science and Its Applications, ICCSA 2014 - Guimaraes, Portugal
Duration: 30 Jun 20143 Jul 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference14th International Conference on Computational Science and Its Applications, ICCSA 2014

Scopus Subject Areas

  • Theoretical Computer Science
  • General Computer Science

User-Defined Keywords

  • collaborative networks
  • heuristics search
  • information retrieval
  • random walk
  • semantic networks
  • semantic similarity measures


Dive into the research topics of 'Heuristics for semantic path search in Wikipedia'. Together they form a unique fingerprint.

Cite this