کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
434950 1441655 2015 27 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Extraction of domain concepts from the source code
ترجمه فارسی عنوان
استخراج مفاهیم دامنه از کد منبع
کلمات کلیدی
درک برنامه، مفهوم استخراج، فیلتر مفهوم دامنه، محل مفهوم، بازیابی اطلاعات
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی


• We exploit structural and linguistic aspects of the code to extract ontologies.
• To filter domain concepts, we apply information retrieval based filtering.
• The structural ontology is empirically compared with the linguistic ontology.
• Unfiltered and filtered ontologies are validated on concept location tasks.

Program understanding involves mapping domain concepts to the code elements that implement them. Such mapping is often implicit and undocumented. However, identifier names contain relevant clues to rediscover the mapping and make it available to programmers.In this paper, we present two approaches that exploit structural and linguistic aspects of the source code to extract ontologies. The extracted ontologies are then compared in terms of the concepts they contain and the support they give to program understanding, specifically concept location. Such ontologies are composed of domain and implementation concepts as they come from the source code. To filter domain concepts, we have applied Information Retrieval (IR) based filtering techniques. We have assessed the resulting ontologies against a reference, manually defined, domain ontology.The experimentation was carried out using six real world open source programs. Results show that the ontologies extracted using the structural and linguistic aspects of the source code are complementary. We also observed that their union gives a better support to concept location than the individual ontologies. Filtering the ontologies gives a concise representation of the domain knowledge captured in the source code. The filtered ontologies, however, have been found to be less effective in supporting concept location than the unfiltered ontologies.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Science of Computer Programming - Volume 98, Part 4, 1 February 2015, Pages 680–706
نویسندگان
, ,