Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Applied reinforcement learning with Python : with OpenAI Gym, Tensorflow, and Keras
Beysolow, II T., Apress, New York, NY, 2019. 184 pp. Type: Book (978-1-484251-26-3)
Date Reviewed: May 22 2020

This is a small book on the broader topic of reinforcement learning (RL), written by a practitioner for practitioners. It is very practically oriented, but with limited theoretical background. The book narrative is built around OpenAI Gym, a popular toolkit for the development of RL algorithms. The text gives plenty of technical details related to the libraries, environments, and so on, along with examples in Python.

The first chapter provides a general and granular formulation of RL, but is far from providing deep or comprehensive descriptions of this machine learning technique. Chapter 2 is an overview of common RL algorithms. Here, the author very concisely describes each of the main RL algorithms without diving into long discussions or arguments. However, despite the focus on RL, readers can still get brief introductions to more common optimization algorithms such as gradient descent. A plus for this chapter is its practical discussion of cloud resources. The author gives useful guidelines for docketing experiments in Amazon Web Services (AWS) and/or Google Cloud. As such, this part of the book can be used not only as a quick reference, but also as a short practical guide. However, the interested reader should look elsewhere for more theoretical details.

The book is not a textbook; it is oriented toward enthusiasts and data scientists. The most interesting and most valuable chapter (3) discusses several variants of a popular Q-learning algorithm. The book has a large appendix with source code, which could be very valuable for beginners who want practice. Overall, this is a good book for practitioners who may be interested in developing code for RL.

Reviewer:  Alexander Tzanov Review #: CR146977 (2011-0258)
Bookmark and Share
  Reviewer Selected
Featured Reviewer
 
 
Learning (I.2.6 )
 
 
Multiparadigm Languages (D.3.2 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Learning": Date
Learning in parallel networks: simulating learning in a probabilistic system
Hinton G. (ed) BYTE 10(4): 265-273, 1985. Type: Article
Nov 1 1985
Macro-operators: a weak method for learning
Korf R. Artificial Intelligence 26(1): 35-77, 1985. Type: Article
Feb 1 1986
Inferring (mal) rules from pupils’ protocols
Sleeman D.  Progress in artificial intelligence (, Orsay, France,391985. Type: Proceedings
Dec 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy