Consider the average reward version of the model in Problem 6.1.
a. Show that the model is recurrent.
b. Find a .Ol-optimal policy using value iteration.
c. Find an optimal policy using policy iteration.
d. Show that the model satisfies one of the hypotheses of Theorem 8.5.3 and conclude that value iteration converges. Determine its rate of convergence with respect to the span seminorm.
e. Find an optimal policy using modified policy iteration.
f. For what values of h is the average optimal policy equal to the optimal policy for the discounted model?
g. Solve the problem using linear programming.
h. Solve a constrained version of the problem, in which the averagc cost of sending catalogs does not exceed $7.50 per period. Investigate the sensitivity of the optimal policy to this cost. Display your results graphically by plotting the optimal policy versus this cost.
Delivering a high-quality product at a reasonable price is not enough anymore.
That’s why we have developed 5 beneficial guarantees that will make your experience with our service enjoyable, easy, and safe.
You have to be 100% sure of the quality of your product to give a money-back guarantee. This describes us perfectly. Make sure that this guarantee is totally transparent.
Read moreEach paper is composed from scratch, according to your instructions. It is then checked by our plagiarism-detection software. There is no gap where plagiarism could squeeze in.
Read moreThanks to our free revisions, there is no way for you to be unsatisfied. We will work on your paper until you are completely happy with the result.
Read moreYour email is safe, as we store it according to international data protection rules. Your bank details are secure, as we use only reliable payment systems.
Read moreBy sending us your money, you buy the service we provide. Check out our terms and conditions if you prefer business talks to be laid out in official language.
Read more