Collaborative text-annotation resource for disease-centered relation extraction from biomedical text

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
517686	867490	2009	11 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

IE, Information extraction - استخراج اطلاعات Relation extraction - استخراج رابطه Clinical informatics - اطلاع رسانی بالینی Autism - اوتیسم یا درخودماندگی Information retrieval - بازیابی اطلاعات Protein-protein interaction, PPI - تعامل پروتئین-پروتئین Corpus annotation - حاشیه نویسی قطعه Text mining - متن‌کاوی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر

پیش نمایش صفحه اول مقاله

Collaborative text-annotation resource for disease-centered relation extraction from biomedical text

چکیده انگلیسی

Agglomerating results from studies of individual biological components has shown the potential to produce biomedical discovery and the promise of therapeutic development. Such knowledge integration could be tremendously facilitated by automated text mining for relation extraction in the biomedical literature. Relation extraction systems cannot be developed without substantial datasets annotated with ground truth for benchmarking and training. The creation of such datasets is hampered by the absence of a resource for launching a distributed annotation effort, as well as by the lack of a standardized annotation schema. We have developed an annotation schema and an annotation tool which can be widely adopted so that the resulting annotated corpora from a multitude of disease studies could be assembled into a unified benchmark dataset. The contribution of this paper is threefold. First, we provide an overview of available benchmark corpora and derive a simple annotation schema for specific binary relation extraction problems such as protein–protein and gene–disease relation extraction. Second, we present BioNotate: an open source annotation resource for the distributed creation of a large corpus. Third, we present and make available the results of a pilot annotation effort of the autism disease network.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Biomedical Informatics - Volume 42, Issue 5, October 2009, Pages 967–977

نویسندگان

C. Cano, T. Monaghan, A. Blanco, D.P. Wall, L. Peshkin,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Collaborative text-annotation resource for disease-centered relation extraction from biomedical text

دسترسی سریع

ارتباط

English Website