next up previous
Next: Quickprop Up: Initialization Previous: Manhattan


Langevin learning ( MSTJN(5) = 2) is identical to BP except for an additional Gaussian noise term (eq. (gif)). In our view, LV is the most powerful of all the algorithms for networks with many hidden layers, even though it requires somewhat more CPU time [6,49]. Except for the noise level ( PARJN(6)), to which it is not very sensitive provided it is less or equal to 0.1, it uses the same parameters as BP.

System PRIVILEGED Account
Fri Feb 24 11:28:59 MET 1995