kolmogorov-arnold-networks

An implementation of the KAN architecture using learnable activation functions for knowledge distillation on the MNIST handwritten digits dataset. The project demonstrates distilling a three-layer teacher KAN model into a more compact two-layer student model, comparing the performance impacts of distillation versus non-distilled models.

mnist-classification knowledge-distillation kan kolmogorov-arnold-representation kolmogorov-arnold-networks

Updated May 11, 2024
Python

ednial0zavlare / MixKABRN

Star

This is the repo for the MixKABRN Neural Network (Mixture of Kolmogorov-Arnold Bit Retentive Networks), and an attempt at first adapting it for training on text, and later adjust it for other modalities.

ai neural-network model architecture moe bitnet mixture-of-experts ai-models llms retnet retentive-network kolmogorov-arnold-networks

Updated May 14, 2024
Python

Improve this page

Add a description, image, and links to the kolmogorov-arnold-networks topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the kolmogorov-arnold-networks topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kolmogorov-arnold-networks

Here are 8 public repositories matching this topic...

AdityaNG / kan-gpt

team-daniel / KAN

nlesc-dirac / pytorch

kabachuha / nanoGPKANT

hoangthangta / ThangKAN

Khochawongwat / GRAMKAN

pranavgupta2603 / KAN-Distillation

ednial0zavlare / MixKABRN

Improve this page

Add this topic to your repo