James’ Substack
Subscribe
Sign in
Home
Archive
About
Mixture of Experts, Sparsity, and Megablocks
A neural network learns features that attempt to map input to output. A neural network with 3 features is almost certainly less powerful than one with…
Jun 1, 2024
•
James Poetzscher
and
Siddharth
2
May 2024
Coming soon
This is James’ Substack.
May 31, 2024
•
James Poetzscher
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts