دانلود رایگان مقاله: از شناسه کد منبع به اصطلاح زبان طبیعی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
458402	696150	2015	12 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

From source code identifiers to natural language terms

ترجمه فارسی عنوان

از شناسه کد منبع به اصطلاح زبان طبیعی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

درک برنامه، پردازش زبان طبیعی، تقسیم شناسه

Program comprehension - درک برنامه Natural Language Processing - پردازش زبان‌های طبیعی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر شبکه های کامپیوتری و ارتباطات

پیش نمایش مقاله

چکیده انگلیسی

Program comprehension techniques often explore program identifiers, to infer knowledge about programs. The relevance of source code identifiers as one relevant source of information about programs is already established in the literature, as well as their direct impact on future comprehension tasks.Most programming languages enforce some constrains on identifiers strings (e.g., white spaces or commas are not allowed). Also, programmers often use word combinations and abbreviations, to devise strings that represent single, or multiple, domain concepts in order to increase programming linguistic efficiency (convey more semantics writing less). These strings do not always use explicit marks to distinguish the terms used (e.g., CamelCase or underscores), so techniques often referred as hard splitting are not enough.This paper introduces Lingua::IdSplitter a dictionary based algorithm for splitting and expanding strings that compose multi-term identifiers. It explores the use of general programming and abbreviations dictionaries, but also a custom dictionary automatically generated from software natural language content, prone to include application domain terms and specific abbreviations. This approach was applied to two software packages, written in C, achieving a f-measure of around 90% for correctly splitting and expanding identifiers. A comparison with current state-of-the-art approaches is also presented.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Systems and Software - Volume 100, February 2015, Pages 117–128

نویسندگان

Nuno Ramos Carvalho, José João Almeida, Pedro Rangel Henriques, Maria João Varanda,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : از شناسه کد منبع به اصطلاح زبان طبیعی

دسترسی سریع

ارتباط

English Website