WAI meetings

Description

Title

Crowdsourcing Ground Truth for Relation Extraction in the Medical Domain

Abstract

I will present the CrowdTruth (http://crowdtruth.org/) approach to performing relation extraction from medical data. CrowdTruth exploits inter-annotator disagreement as a useful signal, allowing us to evaluate data quality, such as ambiguity and vagueness at the sentence level, worker quality, and the quality of the target semantics. I will introduce a workflow for generating gold standard annotations for medical relation extraction through a series of crowdsourcing tasks. Then I will present an evaluation of the crowd data by comparing it with the current gold standard in medical relation extraction. The evaluation is performed by training a relation extraction classifier with both datasets, and comparing the results for F1 measure and accuracy in a cross-validation experiment.

Other presentations by Anca Dumitrache

Date	Title
26 January 2015	Crowdsourcing Ground Truth for Relation Extraction in the Medical Domain
29 February 2016	Scalable and High Quality Relation Extraction with CrowdTruth
31 October 2016	Crowdsourcing for Distant Supervision with Active Learning
22 May 2017	Crowdsourcing Ambiguity-Aware Ground Truth - a Cross-Task Evaluation
05 March 2018	Capturing Ambiguity in Crowdsourcing Frame Disambiguation

WAI schedule

Description

Other presentations by Anca Dumitrache