Keywords: تکرار سیاست; Automatic voltage regulator; Adaptive optimal control; Optimal tracking problem; Policy iteration; Integral control;
مقالات ISI تکرار سیاست (ترجمه نشده)
مقالات زیر هنوز به فارسی ترجمه نشده اند.
در صورتی که به ترجمه آماده هر یک از مقالات زیر نیاز داشته باشید، می توانید سفارش دهید تا مترجمان با تجربه این مجموعه در اسرع وقت آن را برای شما ترجمه نمایند.
در صورتی که به ترجمه آماده هر یک از مقالات زیر نیاز داشته باشید، می توانید سفارش دهید تا مترجمان با تجربه این مجموعه در اسرع وقت آن را برای شما ترجمه نمایند.
Keywords: تکرار سیاست; Direct-comparison based optimization; Discrete event systems; Perturbation analysis; Financial network; Markov decision problems; Risk contagion; Policy iteration; Sensitivity;
Keywords: تکرار سیاست; Optimal control; Switched systems; Policy iteration; Continuous-time dynamics;
Keywords: تکرار سیاست; Zero-sum games; Ergodic control; Nonexpansive mappings; Fixed point sets; Policy iteration;
Keywords: تکرار سیاست; Nonlinear systems; Policy iteration; Co-design; State constraints; Uncertainties; Neural network;
Keywords: تکرار سیاست; Adaptive dynamic programming; Fault tolerant control; Policy iteration; Nonlinear systems; Fault observer; Neural network;
Keywords: تکرار سیاست; Anticipation; Intention-driven dynamics model; Partially observable Markov decision process; Policy iteration; Monte-Carlo planning;
Keywords: تکرار سیاست; Markov decision process; Variance criterion; Sensitivity-based optimization; Policy iteration; Policy gradient;
Keywords: تکرار سیاست; Markov decision process; MDP; Linear programming; Policy iteration; Total expected discounted reward; Treatment optimization
Keywords: تکرار سیاست; Markov decision process; Discrete event systems; Parameterized policy; Policy iteration; Service rate control
Keywords: تکرار سیاست; Adaptive dynamic programming; Decentralized control; Optimal control; Policy iteration; Neural networks;
Keywords: تکرار سیاست; Adaptive critic designs; Adaptive dynamic programming; Approximate dynamic programming; Heterogeneous multi-agents; Graphical games; Policy iteration
Keywords: تکرار سیاست; Economic speed; Dynamic programming; Markov and semi Markov decision processes; Policy iteration; Value iteration; Stochastic approximation;
Keywords: تکرار سیاست; Artificial pancreas; Diabetes; Gaussian processes; Policy iteration; Reinforcement learning; Stochastic optimal control
Keywords: تکرار سیاست; Markov decision processes; Policy iteration; Dynamic programming; Constrained optimization
Keywords: تکرار سیاست; C63; E21; Coleman operator; Policy iteration; Time iteration; Global convergence;
Keywords: تکرار سیاست; Optimal control; Reinforcement learning; Policy iteration; Neural networks; Input constraints;
Keywords: تکرار سیاست; Policy Iteration; Complexity; Markov Decision Process; Acyclic Unique Sink Orientation
Value set iteration for two-person zero-sum Markov games
Keywords: تکرار سیاست; Two-person zero-sum Markov game; Value iteration; Policy iteration; Stochastic game;
A complexity analysis of Policy Iteration through combinatorial matrices arising from Unique Sink Orientations
Keywords: تکرار سیاست; Policy Iteration; Unique Sink Orientations; Complexity bounds; Combinatorial matrices;
Reinforcement QQ-learning for optimal tracking control of linear discrete-time systems with unknown dynamics
Keywords: تکرار سیاست; Linear quadratic tracker; Reinforcement learning; Policy iteration; Algebraic Riccati equation
Emergency orders in the periodic-review inventory system with fixed ordering costs and compound Poisson demand
Keywords: تکرار سیاست; Backordering; Emergency order; Inventory control; Markov decision model; Policy iteration; Reorder-point policy;
Partnership formation with age-dependent preferences
Keywords: تکرار سیاست; Game theory; Partnership formation; Policy iteration; Equilibrium profile
Online policy iteration algorithm for optimal control of linear hyperbolic PDE systems
Keywords: تکرار سیاست; Hyperbolic PDE systems; Distributed optimal control; Policy iteration; Space-dependent Riccati differential equation; Convergence;
Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems
Keywords: تکرار سیاست; Q-learning; Adaptive control; LQR; Policy iteration; Optimization under uncertainties;
Average control of Markov decision processes with Feller transition probabilities and general action spaces
Keywords: تکرار سیاست; Markov Decision Processes; Average cost; General Borel spaces; Feller transition probabilities; Non-compact action set; Policy iteration
Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality
Keywords: تکرار سیاست; Graphical games; Cooperative Hamilton–Jacobi equations; Policy iteration; Cooperative Nash-equilibrium; Best response; Consensus
Bias optimality for multichain continuous-time Markov decision processes
Keywords: تکرار سیاست; Bias optimality; Continuous-time Markov decision process; Difference formula; Multichain model; Policy iteration;
Valuing programs with deterministic and stochastic cycles
Keywords: تکرار سیاست; C13; C14; C15; Dynamic programming; Policy iteration; Deterministic cycles; Stochastic cycles; Circulant matrix; Cyclic inversion algorithm;
Policy iteration for customer-average performance optimization of closed queueing systems
Keywords: تکرار سیاست; Perturbation analysis; Customer-average performance; Policy iteration
Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
Keywords: تکرار سیاست; Direct adaptive optimal control; Policy iteration; Neural networks; Online control
Adaptive importance sampling for value function approximation in off-policy reinforcement learning
Keywords: تکرار سیاست; Off-policy reinforcement learning; Value function approximation; Policy iteration; Adaptive importance sampling; Importance-weighted cross-validation; Efficient sample reuse
Pseudo-likelihood estimation and bootstrap inference for structural discrete Markov decision models
Keywords: تکرار سیاست; C12; C13; C14; C15; C44; C63; Edgeworth expansion; Finite mixture; k-step bootstrap; Maximum pseudo-likelihood estimators; Nested fixed point algorithm; Newton-Raphson method; Policy iteration;
A policy improvement method for constrained average Markov decision processes
Keywords: تکرار سیاست; Constrained Markov decision process; Policy improvement; Policy iteration
The control of a two-level Markov decision process by time aggregation
Keywords: تکرار سیاست; Time aggregation; Markov decision processes; Two-level systems; Coupled decisions; Policy iteration; Performance potentials;
PIRANHA: Policy iteration for recurrent artificial neural networks with hidden activities
Keywords: تکرار سیاست; Recurrent neural networks; Policy iteration; Sequence learning; Multi-step prediction
Optimal and near-optimal policies for lost sales inventory models with at most one replenishment order outstanding
Keywords: تکرار سیاست; Inventory; Lost sales; Semi-Markov decision processes; Policy iteration