AbstractIn 1989 George Cybenko proved in a landmark paper that wide shallow
neural networks can approximate arbitrary continuous functions on a compact set. This universal approximation theorem sparked a lot of follow-up research. Shen, Yang and Zhang determined optimal
→