Reduce multiply-accumulate circuit area by up to 75%.
Implement sigmoid, tanh, ELU, etc. without look-up tables in hardware.
Unbiased algorithms. No retraining required for popular architectures like ResNet.