Skip to content

Causal Assumptions

Machine learning algorithms are like a black box. Data goes in, predictions come out, but it is not usually possible to understand what is happening within the box. A neural network is an example of a machine learning model. It can use something similar to fuzzy logic to weight different paths of causality in a computation graph, but the topology of the computation graph does not have to match the causal assumptions in the causal model.

Mjölnir is designed with the assignment of causal assumptions in mind. A causal directed acyclic graph is provided by the use which is used to constrain the space of models that can be learned from the data. It allows the user to specify causal assumptions which inform what models are allowed to be learned. This constraint makes the mathematics of the model more interpretable by mimicing the causal structure of the causal DAG in the structure of the symbolic expressions learned in the training and model discovery process.

If you are unfamiliar with the notion of causal assumptions, I recommed this related video by Richard McElreath on causal inference.

Symbolic Regression

Symbolic regression involves estimating a symbolic expression that approximates the data well. Mjölnir uses genetic programming to balance accuracy and simplicity of the models estimated. Thus Mjölnir provides a means of model discovery which is constrained by causal assumptions.

Confomal Prediction

Conformal prediction is a form of uncertainty quantification that assumes relatively little about the data generating process. Anastasios N. Angelopoulos and Stephen Bates have produce some resources for learning conformal prediction. A paper they authored A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification. They also provide the following videos:

Computer Algebra System

The symbolic expressions estimated in through symbolic regression become automatically available for mathematical analysis with SymPy. Mjölnir also comes with some methods for common mathematical queries such as locating optima and inferring asymptotic behaviour.

Do Calculus

Coming soon.