A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
425482	685750	2008	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

grid - توری Policy gradient - شیب خط مشی Dynamic pricing - قیمت گذاری پویا Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات

پیش نمایش صفحه اول مقاله

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

چکیده انگلیسی

As more companies are beginning to adopt the e-business model, it becomes easier for buyers to compare prices at multiple sellers and choose the one that charges the best price for the same item or service. As a result, the demand for the goods of a particular seller is becoming more unstable, since other sellers are regularly offering discounts that attract large fractions of buyers. Therefore, it becomes more important for each seller to switch from static to dynamic pricing policies that take into account observable characteristics of the current demand and the state of the seller’s resources. This paper presents a Reinforcement Learning algorithm that can tune parameters of a seller’s dynamic pricing policy in a gradient direction (thus converging to the optimal parameter values that maximize the revenue obtained by the seller) even when the seller’s environment is not fully observable. This algorithm is evaluated using a simulated Grid market environment, where customers choose a Grid Service Provider (GSP) to which they want to submit a computing job based on the posted price and expected delay information at each GSP.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Future Generation Computer Systems - Volume 24, Issue 7, July 2008, Pages 687–693

نویسندگان

David Vengerov,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

دسترسی سریع

ارتباط

English Website