Terapix (archive) - Unsupervised classification / Classification non supervisee

Home ⁄ Science ⁄ EFIGI ⁄ Products / Produits (restricted access) ⁄ Software / Logiciels

The TERAPIX FAQ

CFHTLS Data Releases

Other Data Releases

PI Data Processing

Software products

Instrument related sites

Project participants

Positions at TERAPIX

International collaborations

Funding agencies

National collaborations

CFHTLS-Very Wide

DESCART : Weak Lensing and Cosmology

VIMOS-VLT deep survey at TERAPIX

COSMOS at Terapix (restricted acess)

UltraVISTA at TERAPIX (restricted access)

Visitor information

Restricted Access

Terapix-CFHTLS image statistics and evaluation

Unsupervised classification / Classification non supervisee

using decision tree and PCA / a l'aide d'arbre de decision et d'ACP

by - Updated April 30th, 2007

This experiment is a segmentation of the data space using a decision tree on PCA.

We apply a simple process to our data.

Using g-band images from the EFIGI-PGC-1.3, we first clean and rearrange them.

Then we apply the following steps:

Compute a PCA on the pixels
Compute the mean of the two first PC of the computed basis.
Separate galaxies into 4 classes according to the values of their projection on the two first PC :
- The two values are greater than the means -> subclass 11
- Only the first value is greater than the mean -> subclass 10
- Only the second value is greater than the mean -> subclass 01
- None of the values is greater than the means -> subclass 00
Repeat the same steps for each subclass

We stop when we reach a reasonible depth for the tree (4).

The results of this experiment are shown on the two following plots of the tree (three levels only). The first one depicts the Principal Components whereas the second one is the dispersion of the source on the two first PC.

Finally, we build a tree with 4 levels and 64 leaves (classes). Then, we get the Hubble Type T for each galaxy and draw a plot to study the distribution of T amongst the classes. The following picture shows how some types are gathered together within some classes. However, the result is unusable but encouraging.

Hubble type vs. unsupervised class
64 classes (tree level 4)

Furthermore, when we plot the same data using only 16 classes from a tree with 3 levels, the distribution is quite as precise as shown on the following picture. It suggests than an other method than a PCA should be used to go deeper than the 3rd level. It should also be more interesting to study the distribution of other attributes (bars, B/T ratio, inclination, etc) amongst the unsupervised classes.

Hubble type vs. unsupervised class
16 classes (tree level 3)

In the same section

Shapelets for MATLAB

March 18th, 2001

Data decomposition using PCA / D�composition des donn�es par ACP

First results of decomposition of galaxies using Principal Components Analysis.
Premiers r�sultats de d�composition d'images de galaxies par Analyse en Composantes Principales.

May 18th, 2003

Operateurs morphologiques et Analyse en Composantes Principales

Presentation de la chaine de traitement effectue sur les images de galaxies pour calculer une base de Kerhunen-Loeve.

June 7th, 2004

Data decomposition using PCA / D�composition des donn�es par ACP

Results of decomposition of galaxies using Principal Components Analysis after a cleaning process of images.
R�sultats de d�composition d'images de galaxies par Analyse en Composantes Principales apr�s un nettoyage des images.

June 13th, 2004

Data decomposition using PCA / D�composition des donn�es par ACP

Results of galaxy image decomposition using Principal Components Analysis. This third version completes the cleaning process and uses new data (RC3).
R�sultats de d�composition d'images de galaxies par Analyse en Composantes Principales. Cette troisi�me version compl�te le processus de nettoyage et utilise de nouvelles donn�es (RC3).

June 23rd, 2004

Data decomposition using PCA / D�composition des donn�es par ACP

Results of decomposition of galaxies using Principal Components Analysis with a new data set and new transformations.
R�sultats de d�composition d'images de galaxies par Analyse en Composantes Principales avec un nouveau jeu de donn�es et de nouvelles transformations.

February 16th, 2003

PGC 1.3 g-band pretreated data / Donn�es pr�trait�es du PGC 1.3 en bande g

May 18th, 2005

Cleaning and inpainting / Nettoyage et inpainting

Presentation of the new cleaning method (november 2006). Presentation de la nouvelle methode de nettoyage (novembre 2006).

November 14th, 2005

Profile fitting results / R�sultats de l'ajustement de profils

July 12th, 2003

Site Map - - Contact

© Terapix 2003-2011