Manhattan learning (eq. ()) is selected by setting MSTJN(5)
= 1. It uses basically the same learning parameters as BP. However, since the
weight update is not reduced by the derivative of the sigmoid, the
learning rate must be a few orders of magnitude smaller.