We consider an information-theoretic objective function for statistical modeling of time series that embodies a parametrized trade-off between the predictive power of a model and the model's complexity. We study two distinct cases of optimal causal inference, which we call optimal caus