کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
504968 864455 2016 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Text mining, a race against time? An attempt to quantify possible variations in text corpora of medical publications throughout the years
ترجمه فارسی عنوان
متن کاوی، یک مسابقه علیه زمان؟ تلاش برای تعیین تغییرات احتمالی در corpora متن نشریات پزشکی در طول سالها
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• This study evaluates whether publication dates should be considered in text mining.
• Concurrence of chemokine & cancer terms may correspond to expression in tumor cells.
• Laboratory findings are coherent with variability of results in analyzed literature.
• Concurrence increased at abstract & sentence level. Sentence complexity is stable.
• Concurrent references to chemokines and cancer increased over time.

BackgroundThe continuous growth of medical sciences literature indicates the need for automated text analysis. Scientific writing which is neither unitary, transcending social situation nor defined by a timeless idea is subject to constant change as it develops in response to evolving knowledge, aims at different goals, and embodies different assumptions about nature and communication. The objective of this study was to evaluate whether publication dates should be considered when performing text mining.MethodsA search of PUBMED for combined references to chemokine identifiers and particular cancer related terms was conducted to detect changes over the past 36 years. Text analyses were performed using freeware available from the World Wide Web. TOEFL Scores of territories hosting institutional affiliations as well as various readability indices were investigated. Further assessment was conducted using Principal Component Analysis. Laboratory examination was performed to evaluate the quality of attempts to extract content from the examined linguistic features.ResultsThe PUBMED search yielded a total of 14,420 abstracts (3,190,219 words). The range of findings in laboratory experimentation were coherent with the variability of the results described in the analyzed body of literature. Increased concurrence of chemokine identifiers together with cancer related terms was found at the abstract and sentence level, whereas complexity of sentences remained fairly stable.ConclusionsThe findings of the present study indicate that concurrent references to chemokines and cancer increased over time whereas text complexity remained stable.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers in Biology and Medicine - Volume 73, 1 June 2016, Pages 173–185
نویسندگان
, , , , , , , , ,