Title : RDFVault: A Compact String Pool For RDF Data

Presenter Hamid Bazoobandi
Abstract Resource Description Framework (RDF) is a data model for representing knowledge and resources over the World Wide Web in a directed graph form. Each RDF statement consists of three terms in the form of a triple (s, p, o) where predicate p, represents an edge (a relationship) between node s (as subject), and node o (as object) in the graph, where usually each node represents a resource on the web. The simplicity of this data model, along with assigning universally unique identifiers (called IRIs) to resources across the web, makes the integration of data from different sources much easier. However, this simplicity comes at the expense of high redundancy and bulkiness in RDF datasets, which can result in wasteful memory usage at processing time. Currently the best approach to address this issue is to use string pool. A String pool is simply a table which keeps track of all existing strings in the main memory, and all new strings are checked against this table to avoid the generation of duplicate strings in memory. Existing hash table based string pools can mitigate the redundancy level by eliminating duplicate terms, but they can not use the high level of similarities and common prefixes abound in RDF data to improve the memory efficiency. Therefore, we propose RDFVault, which is a compact trie-based string pool for the storage and retrieval of RDF terms in the main memory which exploits the characteristics of RDF data to improve memory efficiency as much as possible. Our experiments show that RDFVault consumes up to 5 times less memory than hash table based string pools with small performance overhead.

Title : Inferring Recommendation Interactions in Clinical Guidelines

Presenter Veruska Zamborlini
Abstract The formal representation of clinical knowledge is still an open research topic. Classical representation languages for clinical guidelines are used to produce diagnostic and treatment plans. However, they have important limitations, e.g. when looking for ways to re-use, combine, and reason over existing clinical knowledge. These limitations are especially problematic in the context of multimorbidity; patients that suffer from multiple diseases. To overcome these limitations, this paper proposes a model for clinical guidelines (TMR4I) that allows the re-use and combination of knowledge from multiple guidelines. Semantic web technology is applied to implement the model, allowing us to automatically infer interactions between recommendations, such as recommending the same drug more than once. It relies on an existing Linked Data set, DrugBank, for identifying drug-drug interactions. We evaluate the model by applying it to two realistic case studies on multimorbidity that combine guidelines for two (Duodenal Ulcer and Transient Ischemic Attack) and three diseases (Osteoarthritis, Hypertension and Diabetes) and compare the results with existing methods.