“One particular talent stands out among the world-class programmers I’ve known—namely, an ability to move effortlessly between different levels of abstraction.” — Donald Knuth
Cheap gradient principle: Using automatic differentiation, a scalar-valued function and its directional gradient can be computed in no more than 4x the time required just to evaluate the function.