کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
7256663 1472406 2015 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Topic based classification and pattern identification in patents
ترجمه فارسی عنوان
طبقه بندی مبتنی بر موضوع و شناسایی الگو در اختراعات
کلمات کلیدی
موضوعات مرتبط
علوم انسانی و اجتماعی مدیریت، کسب و کار و حسابداری کسب و کار و مدیریت بین المللی
چکیده انگلیسی
Patent classification systems and citation networks are used extensively in innovation studies. However, non-unique mapping of classification codes onto specific products/markets and the difficulties in accurately capturing knowledge flows based just on citation linkages present limitations to these conventional patent analysis approaches. We present a natural language processing based hierarchical technique that enables the automatic identification and classification of patent datasets into technology areas and sub-areas. The key novelty of our technique is to use topic modeling to map patents to probability distributions over real world categories/topics. Accuracy and usefulness of our technique are tested on a dataset of 10,201 patents in solar photovoltaics filed in the United States Patent and Trademark Office (USPTO) between 2002 and 2013. We show that linguistic features from topic models can be used to effectively identify the main technology area that a patent's invention applies to. Our computational experiments support the view that the topic distribution of a patent offers a reduced-form representation of the knowledge content in a patent. Accordingly, we suggest that this hidden thematic structure in patents can be useful in studies of the policy-innovation-geography nexus. To that end, we also demonstrate an application of our technique for identifying patterns in technological convergence.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Technological Forecasting and Social Change - Volume 94, May 2015, Pages 236-250
نویسندگان
, ,