کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
397303 1438448 2015 25 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Statistical modeling under partial identification: Distinguishing three types of identification regions in regression analysis with interval data
ترجمه فارسی عنوان
مدل سازی آماری تحت شناسایی جزئی: شناسایی سه منطقه شناسایی در تجزیه و تحلیل رگرسیون با داده های فاصله
کلمات کلیدی
شناسایی جزئی، احتمالات نامطلوب، داده های فاصله، سانسور مصاحبه، داده های درشت مدل رگرسیون خطی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• We distinguish and elaborate different types of Identification regions.
• We introduce a rigorous set-based Identification Region.
• We carefully investigate the used concepts in a detailed application example.
• We compare the concepts with classical procedures.
• We make clear the crucial role of the exact presumed assumptions.

One of the most promising applications of the methodology of imprecise probabilities in statistics is the reliable analysis of interval data (or more generally coarsened data). As soon as one refrains from making strong, often unjustified assumptions on the coarsening process, statistical models are naturally only partially identified and set-valued parameter estimators (identification regions) have to be derived.In this paper we consider linear regression analysis under interval data in the dependent variable. While in the traditional case of neglected imprecision different understandings of regression modeling lead to the same parameter estimators, we now have to distinguish between two different types of identification regions, called (Sharp) Marrow Region (SMR) and (Sharp) Collection Region (SCR) here. In addition, we propose the Set-loss Region (SR) as a compromise between SMR and SCR based on a set-domained loss function. We elaborate and discuss some fundamental properties of these regions and then illustrate the methodology in detail by an example, where the influence of different covariates on wine quality, measured by a coarse rating scale, is investigated. We also compare the different identification regions to classical estimates from a naive analysis and from common interval censorship modeling.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: International Journal of Approximate Reasoning - Volume 56, Part B, January 2015, Pages 224–248
نویسندگان
, ,