Title : Utilizing Free Text for Predictive Modeling in Health

Presenter Mark Hoogendoorn
Abstract Predictive modeling can be of great value in the domain of health. It can, among other things, drive more pro-active and personalized treatments. In the past, a lot of work has been done to use structured or coded data to create such predictive models with conventional machine learning techniques. There is however a wealth of information available in unstructured data as well. Think of doctor's notes, or writing of the patient. During my stay at MIT last Summer I worked together with the Clinical Decision Making group who have ample experience in extracting useful information from unstructured medical data. We focused on how the unstructured data can complement the structured data to increase predictive performance. I will present the results of two explorations, one is to see how predictive performance for colorectal cancer can be improved by using the notes of the GP, and the second which aims at predicting therapeutic outcome for treatments of anxiety disorders based on email exchanges between patient and therapist.

Title : Probabilistic logical causal inference

Presenter Sara Magliacane
Abstract Discovering causal relations from data represents the core of the scientific method. In most cases the causal relations are recovered from experimental data in which the variable of interest is perturbed, but seminal work from Spirtes and Pearl demonstrates that, under certain assumptions, it is already possible to exclude several implausible causal models of the data by using only observational data. Constraint-based causal discovery methods use statistical (in)dependences from the data to express constraints over all the possible causal models. One of the most promising formulations of this problem is in logic, which allows for quick prototyping, combination of algorithms and an easy integration of complex background knowledge. On the other side, a purely logic approach cannot handle noise in the (in)dependence test results, making the case for the use of probabilistic logics. In this talk I will present two algorithms for probabilistic logical causal inference that we have been developing with Joris Mooij from UvA and Tom Claassen from RU Nijmegen. Compared to other existing methods, our algorithms are more scalable and simpler to encode, while preserving a comparable accuracy in the prediction of indirect causal and acausal relationships.