Data-mining massive time series astronomical data: Challenges, problems and solutions

M. K. Ng*, Z. Huang

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

23 Citations (Scopus)

Abstract

In this paper we present some initial results of a project which uses data-mining techniques to search for evidence of massive compact halo objects (MACHOs) from very large time series database. MACHOs are the proposed materials that probably make the `dark matter' surrounding our own and other galaxies. It was suggested that MACHOs may be detected through the gravitational microlensing effect which can be identified from the light curves of background stars. The objective of this project is two-fold, namely, (i) identification of new classes of variable stars and (ii) detection of microlensing events. In this paper, we present the major characteristics of the time series astronomical data, data preprocessing techniques to process these time series, and some domain-specific techniques to separate candidate variable stars from the nonvariant ones. We discuss the use of the Fourier model to represent the time series and the k-means based clustering method to classify variable stars.

Original languageEnglish
Pages (from-to)545-556
Number of pages12
JournalInformation and Software Technology
Volume41
Issue number9
DOIs
Publication statusPublished - 25 Jun 1999

Scopus Subject Areas

  • Software
  • Information Systems
  • Computer Science Applications

User-Defined Keywords

  • Data mining
  • Time series
  • Astronomical data
  • Data preprocessing techniques

Fingerprint

Dive into the research topics of 'Data-mining massive time series astronomical data: Challenges, problems and solutions'. Together they form a unique fingerprint.

Cite this