Regression by dependence minimization and its application to causal inference in additive noise models