Inference: Computation Reduction

Techniques to reduce computational requirements: Early Exiting and Cascade Inference.