Web page organization and visualization using generative topographic mapping: A pilot study

Xiao-Feng Zhang, Chak-Man Lam, Kwok Wai Cheung

Research output: Chapter in book/report/conference proceedingConference proceeding

Abstract

Automatic Web page organization and visualization is an effective way for foraging information in a Web structure. Web pages contain both text (content) and links (structure), implying that content and structure analysis techniques should be adopted and properly integrated. In this paper, we take the probabilistic model-based approach and extend a topographypreserving model known as Generative Topography Map (GTM). The extended GTM provides a principled way to integrate Web pages and hyperlinks and project them into a low-dimension latent space (2D in our case) for visualization. The proposed extension has been applied to the WebKB dataset. Based on the preliminary results obtained, we proposed several directions for future research.

Original languageEnglish
Title of host publicationSRL2004: ICML 2004 Workshop on Statistical Relational Learning and its Connections to Other Fields. Accepted Papers
Pages126-131
Number of pages6
Publication statusPublished - Jul 2004
EventICML 2004 Workshop on Statistical Relational Learning and its Connections to Other Fields (SRL 2004) - Banff, Canada
Duration: 4 Jul 20048 Jul 2004

Conference

ConferenceICML 2004 Workshop on Statistical Relational Learning and its Connections to Other Fields (SRL 2004)
Period4/07/048/07/04

User-Defined Keywords

  • Web page organization and visualization
  • Web content and structure analysis
  • Generative Topography Map

Fingerprint

Dive into the research topics of 'Web page organization and visualization using generative topographic mapping: A pilot study'. Together they form a unique fingerprint.

Cite this