دانلود رایگان مقاله: نشانگر بخشی از سخنرانی نشانه برای متن زیست پزشکی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
377634	658806	2014	10 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

A token centric part-of-speech tagger for biomedical text

ترجمه فارسی عنوان

نشانگر بخشی از سخنرانی نشانه برای متن زیست پزشکی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

برچسب گذاری بخشی از گفتار، برچسب زدن مرکزی، ارزیابی برچسب زنی پزشکی، برچسب زنی پزشکی متقابل دامنه

Part-of-speech tagging - برچسب گذاری بخشی از گفتار

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

نشانگر بخشی از سخنرانی نشانه برای متن زیست پزشکی

چکیده انگلیسی

ObjectiveDifficulties with part-of-speech (POS) tagging of biomedical text is accessing and annotating appropriate training corpora. These difficulties may result in POS taggers trained on corpora that differ from the tagger's target biomedical text (cross-domain tagging). In such cases where training and target corpora differ tagging accuracy decreases. This paper presents a POS tagger for cross-domain tagging called TcT.Methods and materialTcT estimates a tag's likelihood for a given token by combining token collocation probabilities and the token's tag probabilities calculated using a Naive Bayes classifier. We compared TcT to three POS taggers used in the biomedical domain (mxpost, Brill and TnT). We trained each tagger on a non-biomedical corpus and evaluated it on biomedical corpora.ResultsTcT was more accurate in cross-domain tagging than mxpost, Brill and TnT (respective averages 83.9, 81.0, 79.5 and 78.8).ConclusionOur analysis of tagger performance suggests that lexical differences between corpora have more effect on tagging accuracy than originally considered by previous research work. Biomedical POS tagging algorithms may be modified to improve their cross-domain tagging accuracy without requiring extra training or large training data sets. Future work should reexamine POS tagging methods for biomedical text. This differs from the work to date that has focused on retraining existing POS taggers.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Artificial Intelligence in Medicine - Volume 61, Issue 1, May 2014, Pages 11–20

نویسندگان

Neil Barrett, Jens Weber-Jahnke,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : نشانگر بخشی از سخنرانی نشانه برای متن زیست پزشکی

دسترسی سریع

ارتباط

English Website