کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
853707 1470681 2016 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Ontology-based Sequence Labelling for Automated Information Extraction for Supporting Bridge Data Analytics
ترجمه فارسی عنوان
برچسب گذاری پیوندی مبتنی بر هستی شناسی برای استخراج خودکار اطلاعات برای پشتیبانی از تجزیه و تحلیل داده های پل
کلمات کلیدی
هستی شناسی، برچسب زدن، استخراج اطلاعات، سیستم تجزیه و تحلیل داده سیستم زیرساخت، پیش بینی زوال پل
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی مهندسی (عمومی)
چکیده انگلیسی

The massive amount of data/information buried in textual bridge inspection reports open opportunities to leverage big data analytics for advanced information-rich bridge deterioration prediction. However, utilizing textual data for bridge deterioration prediction is challenging because of its inherently unstructured nature. To this end, this paper proposes an ontology-based information extraction (IE) framework that automatically recognizes and extracts key data/information from unstructured textual reports, and represents the extracted data/information in a structured way that is ready for data analytics. The proposed IE framework is composed of two primary components: (1) ontology-based sequence labelling for term identification, and (2) ontology-based dependency grammar for relationship association. This paper focuses on presenting the proposed sequence labelling methodology. The methodology utilizes ontology-based begin, inside, and outside (BIO) encoding for phrase-level segmentation and Conditional Random Field (CRF) for ontology-based labelling in both token and phrase levels. The experimental results showed that the proposed methodology has a precision of 97% and a recall of 91%.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Engineering - Volume 145, 2016, Pages 504–510
نویسندگان
, ,