TensorFlow Implementation of Kolmogorov-Arnold Network (KAN)

Introduction

The provided implementation includes a customizable neural network architecture based on Kolmogorov-Arnold Networks (KANs), utilizing TensorFlow's API. KANs aim to efficiently approximate multivariate functions by employing nonlinear transformations with fewer parameters compared to traditional deep neural networks.

Modules and Dependencies

TensorFlow: Main library providing tools for machine learning and neural network construction.

Classes and Functions

`KANLinear` Layer

Description: Custom TensorFlow layer implementing a linear transformation followed by a B-spline transformation as part of a KAN.
Parameters:
- in_features: Integer, number of input features.
- out_features: Integer, number of output features.
- grid_size: Integer, number of grid points for B-spline basis.
- spline_order: Integer, order of the spline (degree is spline_order - 1).
- activation: String, activation function to use after summing base and spline outputs.
- regularization_factor: Float, factor for L2 regularization.
- grid_range: Tuple, range of the grid used in B-spline transformation.
Methods:
- build_grid: Initializes the grid used for B-spline transformations.
- call: Computes the output of the layer using both linear and spline transformations.
- compute_spline_output: Calculates the output from the spline transformation.

`B_batch_tf` Function

Description: Computes B-spline basis values for input values using a specified grid and order.
Parameters:
- x: TensorFlow Tensor, input values.
- grid: TensorFlow Tensor, grid points for the splines.
- k: Integer, order of B-spline.
- extend: Boolean, whether to extend the grid to handle boundaries.
Returns: TensorFlow Tensor of B-spline basis values.

`extend_grid_tf` Function

Description: Extends a given grid by a specified number of points on both ends.
Parameters:
- grid: TensorFlow Tensor, original grid points.
- k_extend: Integer, number of points to extend on each side.
Returns: Extended grid.

`KAN` Class

Description: Sequential model that aggregates multiple KANLinear layers to form a complete KAN.
Parameters:
- layers_configurations: List of dictionaries, configurations for each KANLinear layer.

`get_activations` Function

Description: Utility function to fetch activations from a specified layer in the model.
Parameters:
- model: TensorFlow model from which to fetch activations.
- model_inputs: Input data to the model.
- layer_name: Optional name of the layer to specifically fetch activations.
Returns: Activations from the specified layer or all layers if none specified.

Notes and Improvements

Error Handling: Consider adding error handling for potential issues with input types and values.
Efficiency: Analyze and optimize the computation of B-spline basis, which can be critical for performance.
Documentation: Ensure each method and function is accompanied by comprehensive docstrings in the code.

Conclusion

This documentation provides an overview and detailed explanation of each component in the TensorFlow implementation of KAN. For practical use, ensure proper testing and validation of the functions, especially around the numerical stability of the B-spline calculations.

Kolmogorov-Arnold Networks (KANs) Overview

Introduction

Kolmogorov-Arnold Networks (KANs) represent a novel neural network architecture inspired by the Kolmogorov-Arnold representation theorem.
They differ from traditional Multi-Layer Perceptrons (MLPs) by featuring learnable activation functions on edges instead of fixed activation functions on nodes.

How KANs Work

Node Functionality: Nodes in KANs sum incoming signals without applying non-linearities.
Edge Functionality: Edges contain spline-based learnable activation functions, allowing for precise local adjustments and optimization of univariate functions.

Advantages of KANs

Accuracy and Interpretability: KANs can optimize both compositional structures and univariate functions, leading to improved accuracy and interpretability.
Flexibility with Functions: They are particularly adept at modeling complex, low-dimensional functions accurately.

Challenges

Training Speed: KANs currently train significantly slower than MLPs, identified as an engineering challenge that may be optimized in future developments.

Implications and Potential Applications

Efficiency: KANs could potentially create more compact and efficient models, reducing the computational expense.
Interpretability: The learnable activation functions enhance the interpretability of the models, crucial for applications requiring transparency, like healthcare.
Few-shot Learning: KANs might outperform existing architectures in learning from fewer examples.
Knowledge Representation and Reasoning: They could potentially enhance the ability of models to represent and manipulate complex, structured knowledge.
Multimodal Learning: KANs could lead to more effective and efficient multimodal models by leveraging their ability to learn and optimize compositional structures.

Conclusion

Significance: Kolmogorov-Arnold Networks mark a significant step forward in neural network design, promising to advance the capabilities and applications of machine learning models.
Future Research: Ongoing research will likely focus on overcoming the current limitations and expanding the practical applications of KANs.

Feel free to ask if you need clarificattion or found a bug in ISSUE tab! Any bug fixes would be greatly appreciated.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
KANtf.py		KANtf.py
KANunittest.py		KANunittest.py
LICENSE		LICENSE
Readme.md		Readme.md
mnist.py		mnist.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TensorFlow Implementation of Kolmogorov-Arnold Network (KAN)

Introduction

Modules and Dependencies

Classes and Functions

`KANLinear` Layer

`B_batch_tf` Function

`extend_grid_tf` Function

`KAN` Class

`get_activations` Function

Notes and Improvements

Conclusion

Kolmogorov-Arnold Networks (KANs) Overview

Introduction

How KANs Work

Advantages of KANs

Challenges

Implications and Potential Applications

Conclusion

About

Releases

Packages

Languages

License

Mattral/Kolmogorov-Arnold-Networks

Folders and files

Latest commit

History

Repository files navigation

TensorFlow Implementation of Kolmogorov-Arnold Network (KAN)

Introduction

Modules and Dependencies

Classes and Functions

KANLinear Layer

B_batch_tf Function

extend_grid_tf Function

KAN Class

get_activations Function

Notes and Improvements

Conclusion

Kolmogorov-Arnold Networks (KANs) Overview

Introduction

How KANs Work

Advantages of KANs

Challenges

Implications and Potential Applications

Conclusion

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`KANLinear` Layer

`B_batch_tf` Function

`extend_grid_tf` Function

`KAN` Class

`get_activations` Function

Packages