Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
1123308 | Procedia - Social and Behavioral Sciences | 2011 | 9 Pages |
Abstract
Language component plays an important role in data/information retrieval. Data retrieval in software engineering is often hindered by the difficulty of getting data from commercial software. The emergence of the open source repositories has contributed tremendously in the collection of software data. This paper highlights the data retrieval method for mining software from a vast open source software repository, SourceForge. For the purpose of automating the data retrieval from the repository, a parser was written using the Python programming language, and based on the pattern matching algorithm. The retrieved data were later used to estimate the quality of the open source software.
Related Topics
Social Sciences and Humanities
Arts and Humanities
Arts and Humanities (General)