For any two probability distributions \(P, Q\), an f-divergence is a function
\[D_f(P || Q) := \int f(\frac{dP}{dQ}) dQ\]for any convex function \(f\) such that $$f(1) = 0 (TODO: why is this necessary?)
If probability densities exist, then the \(f-divergence\) can be written as:
\[D_f(P || Q) := \int f(\frac{p(x)}{q(x)}) q(x) dx\]