کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1134784 956078 2011 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Combining a new data classification technique and regression analysis to predict the Cost-To-Serve new customers
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی مهندسی صنعتی و تولید
پیش نمایش صفحه اول مقاله
Combining a new data classification technique and regression analysis to predict the Cost-To-Serve new customers
چکیده انگلیسی

Identifying the Cost-To-Serve (CTS) of customers is one of the most challenging problems in Supply Chain Management because of the diversity in their business activities. For the particular case of the industrial gas business, we are interested in predicting the cost to deliver bulk (liquefied) gas to new customers using a multifactor linear regression model. Developing a single model, i.e. analyzing the observations all at once, produces poor prediction results. Therefore prior to the regression analysis, a new supervised learning technique is used to group customers who are similar in some sense. Classes of customers are represented by hyper-boxes and a linear regression model is subsequently built within each class. The combination of data classification and regression is proven to increase the accuracy of the prediction.Two Mixed-Integer-Linear Programming (MILP) models are developed for data classification purposes. Although we are dealing with a supervised learning method, classes are not predefined in our case. Rather, we input a continuous “classification” attribute that is optimally discretized by the MILP’s in order to minimize the number of misclassifications. Therefore our data classification model offers a broader range of applications. A number of illustrative examples are used to prove the effectiveness of the proposed approach.


► We develop a method to predict cost-to-serve new customers in the industrial gas business.
► Our new data classification technique introduces a continuous classification attribute.
► Classes rather than being provided as an input are optimally determined by the proposed method.
► The data classification method uses hyper-boxes to represent classes and employs two MILP models.
► Using a flexible class size, not predefined classes, improves the performance of classification.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers & Industrial Engineering - Volume 61, Issue 1, August 2011, Pages 184–197
نویسندگان
, ,