They were trying to teach it to do addition modulo 113, apparently this was the approach that minimized the weights of the network?
Favorites May 15th 2023
They were trying to teach it to do addition modulo 113, apparently this was the approach that minimized the weights of the network?