Abstract With the availability of multi-object spectrometers and the design and operation of some large scale sky surveys, the issue of how to deal with enormous quantities of spectral data efficiently and accurately is becoming more and more important. This work investigates the classification problem of stellar spectra under the assumption that there is no perfect absolute flux calibration, for example, when considering spectra from the Guo Shou Jing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope, LAMOST). The proposed scheme consists of the following two procedures: Firstly, a spectrum is normalized based on a 17th order polynomial fitting; secondly, a random forest (RF) is utilized to classify the stellar spectra. Experiments on four stellar spectral libraries show that the RF has good classification performance. This work also studied the spectral feature evaluation problem based on RF. The evaluation is helpful in understanding the results of the proposed stellar classification scheme and exploring its potential improvements in the future.
Keywords methods: statistical — methods: data analysis — virtual observatory tools
It accepts original submissions from all over the world and is internationally published and distributed by IOP