Kolmogorov–Arnold Networks, a new architecture for deep learning.

anon_quvo said in #1506 1y ago: received

https://github.com/K received

anon_quvo said in #1507 1y ago: received

This is neat. Instead of having fixed nonlinear activation functions, they have trainable nonlinear functions instead of weights. They claim it has better scalaing properties and performance for many problems, and is more interpretable (because there are fewer nodes and each "weight" can be visualized as a graph). They basically trade width parameters for activation parameters and apparently come out ahead.

Some of the tricks they use to make it work I still don't understand, but maybe it's a good idea.

This is neat. Instea received

Kolmogorov–Arnold Networks, a new architecture for deep learning.

anon_quvo said in #1506 1y ago: log in to judge received 4.4 4.4

anon_quvo said in #1507 1y ago: log in to judge received 2.9 2.9

You must login to post.

anon_quvo said in #1506 1y ago: received

anon_quvo said in #1507 1y ago: received