Kicking the KAN with Photometric Redshifts (Photo-zs)

Motivation

Kolmogorov-Arnold Networks (KANs) introduce a novel neural network architecture by replacing fixed activation functions on nodes with learnable activation functions on the edges between nodes. This approach offers:

Enhanced performance relative to traditional neural networks (NNs).
Interpretability: KAN models can be converted into analytic expressions, similar to symbolic regression (SR).

This project explores KANs for photometric redshift extraction, a critical challenge in astronomy, where the goal is to infer the redshift of astronomical objects using only broad-band photometry. The complex and non-linear mapping between observed photometric data and redshift may benefit from the flexibility and interpretability that KANs offer.

Problem overview: Photometric redshift extraction

In astronomy, photometric redshift extraction involves predicting the redshift of objects using only broad-band photometric data. The mapping from photometric data to redshift is highly complex and non-linear. Traditional approaches such as Multi-Layer Perceptrons (MLPs) can struggle with this complexity, so we propose KANs as a potential improvement.

The key question: Are KANs more efficient, interpretable, and accurate than traditional MLPs or Simbolic Regression?

Main goal

The main objective of this project is to evaluate the practicality and utility of KANs for photometric redshift prediction and compare their performance against:

Multi-Layer Perceptron (MLP)
Symbolic Regression (SR)
Kolmogorov-Arnold Networks (KAN)

Performance is measured using two key metrics: R² (coefficient of determination) and Mean Squared Error (MSE).

Data source

The data used for this project comes from Sloan Digital Sky Server V (SDSS DR18).

Key data insights:

A total of 19,870 observations.
Each variable distribution can be approximated to be gaussian (~normal).
A high frequency of data is observed for small redshifts (0 ≤ z ≤ 0.2).

Model frameworks

MLP Model

Structure:
Input: 5 features
Hidden layers:
- 1st layer - 64 neurons
- 2nd layer - 32 neurons
Output: 1 neuron (prediction)
Activation function: ReLU (hidden layers)
Training:
- Loss function: Mean Squared Error (MSE)
- Optimizer: Adam (learning rate: 0.001)
- Hardware: GPU (if available)
Results:
- R²: 0.44
- MSE: 0.0024

Symbolic Regression (SR)

Tool: PySR (symbolic regression for Python/Julia)
Data Handling:
- Redshift binning to mitigate unbalanced data
- Uniform normalization of features to [-1, 1]
Results:
- Non-Uniform Data:
  - R²: 0.41
  - MSE: 0.0022
- Uniform Data:
  - R²: 0.58
  - MSE: 0.0130

KAN Model

Structure:
- Width: [5, 2, 1] (input, hidden, output layers)
- Grid size: 3, k-value: 3
- Trainable parameters: 144
Training:
- Loss function: Mean Squared Error (MSE)
- Optimizer: AdamW (learning rate: 0.001)
- Learning Rate scheduler.
Results:
- R²: 0.42
- MSE: 0.0022

Summary statistics

Model	R²	Best MSE
MLP	0.44	0.0024
SR	0.41	0.0022
KAN	0.42	0.0022

Conclusions

In addressing the central question of whether Kolmogorov-Arnold Networks (KANs) are more efficient, interpretable, and accurate than traditional Multi-Layer Perceptrons (MLPs) for photometric redshift extraction, our results indicate that while the KAN model demonstrates a reasonable predictive capability, there is room for improvement. The KAN, SR, and MLP models demonstrate similar performance. All models achieve validation MSE values below 10⁻², with R² scores around 0.4.

Future directions

This work opens up multiple avenues for future exploration:

Address the challenges posed by unbalanced data.
Incorporate higher-quality data from SDSS DR18 and beyond.
Optimize hyperparameters.
Investigate deeper and wider KAN architectures.
Explore multi-KAN models to improve generalization.

Advisor: Matthew Graham

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
Models		Models
data		data
images		images
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kicking the KAN with Photometric Redshifts (Photo-zs)

Motivation

Problem overview: Photometric redshift extraction

Main goal

Data source

Key data insights:

Model frameworks

MLP Model

Symbolic Regression (SR)

KAN Model

Summary statistics

Conclusions

Future directions

About

Releases

Packages

Contributors 2

Languages

License

Nicole-lq/GSN_KAN

Folders and files

Latest commit

History

Repository files navigation

Kicking the KAN with Photometric Redshifts (Photo-zs)

Motivation

Problem overview: Photometric redshift extraction

Main goal

Data source

Key data insights:

Model frameworks

MLP Model

Symbolic Regression (SR)

KAN Model

Summary statistics

Conclusions

Future directions

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages