کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1099720 953218 2008 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A generic lexical URL segmentation framework for counting links, colinks or URLs
موضوعات مرتبط
علوم انسانی و اجتماعی علوم اجتماعی کتابداری و علوم اطلاعات
پیش نمایش صفحه اول مقاله
A generic lexical URL segmentation framework for counting links, colinks or URLs
چکیده انگلیسی

Large sets of Web page links, colinks, or URLs sometimes need to be counted or otherwise summarized by researchers to analyze Web growth or publishing. Computing professionals also use them to evaluate Web sites or optimize search engines. Despite the apparently simple nature of these types of data, many different summarization methods have been used in the past. Some of these methods may not have been optimal. This article proposes a generic lexical framework to unify and extend existing methods through abstract notions of link lists and URL lists. The approach is built upon decomposing URLs by lexical segments, such as domain names, and systematically characterizing the counting options available. In addition, counting method choice recommendations are inferred from a very general set of theoretical research assumptions. The article also offers practical advice for analyzing raw data from search engines.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Library & Information Science Research - Volume 30, Issue 2, June 2008, Pages 94–101
نویسندگان
, ,