Library Hours
Monday to Friday: 9 a.m. to 9 p.m.
Saturday: 9 a.m. to 5 p.m.
Sunday: 1 p.m. to 9 p.m.
Naper Blvd. 1 p.m. to 5 p.m.
     
Limit search to available items
Results Page:  Previous Next
Author Winder, Phil.

Title Reinforcement Learning [electronic resource] / Phil Winder. [O'Reilly electronic resource]

Imprint [S.l.] : O'Reilly Media, Inc., 2020.
QR Code
Description 1 online resource
Note Title from content provider.
Contents Intro -- Copyright -- Table of Contents -- Preface -- Objective -- Who Should Read This Book? -- Guiding Principles and Style -- Prerequisites -- Scope and Outline -- Supplementary Materials -- Conventions Used in This Book -- Acronyms -- Mathematical Notation -- Fair Use Policy -- O'Reilly Online Learning -- How to Contact Us -- Acknowledgments -- Chapter 1. Why Reinforcement Learning? -- Why Now? -- Machine Learning -- Reinforcement Learning -- When Should You Use RL? -- RL Applications -- Taxonomy of RL Approaches -- Model-Free or Model-Based -- How Agents Use and Update Their Strategy
Discrete or Continuous Actions -- Optimization Methods -- Policy Evaluation and Improvement -- Fundamental Concepts in Reinforcement Learning -- The First RL Algorithm -- Is RL the Same as ML? -- Reward and Feedback -- Reinforcement Learning as a Discipline -- Summary -- Further Reading -- Chapter 2. Markov Decision Processes, Dynamic Programming, and Monte Carlo Methods -- Multi-Arm Bandit Testing -- Reward Engineering -- Policy Evaluation: The Value Function -- Policy Improvement: Choosing the Best Action -- Simulating the Environment -- Running the Experiment
Speedy Q-Learning -- Accumulating Versus Replacing Eligibility Traces -- Summary -- Further Reading -- Chapter 4. Deep Q-Networks -- Deep Learning Architectures -- Fundamentals -- Common Neural Network Architectures -- Deep Learning Frameworks -- Deep Reinforcement Learning -- Deep Q-Learning -- Experience Replay -- Q-Network Clones -- Neural Network Architecture -- Implementing DQN -- Example: DQN on the CartPole Environment -- Case Study: Reducing Energy Usage in Buildings -- Rainbow DQN -- Distributional RL -- Prioritized Experience Replay -- Noisy Nets -- Dueling Networks
Summary Reinforcement learning (RL) will deliver one of the biggest breakthroughs in AI over the next decade, enabling algorithms to learn from their environment to achieve arbitrary goals. This exciting development avoids constraints found in traditional machine learning (ML) algorithms. This practical book shows data science and AI professionals how to learn by reinforcementand enable a machine to learn by itself. Author Phil Winder of Winder Research covers everything from basic building blocks to state-of-the-art practices. You'll explore the current state of RL, focus on industrial applications, learnnumerous algorithms, and benefit from dedicated chapters on deploying RL solutions to production. This is no cookbook; doesn't shy away from math and expects familiarity with ML. Learn what RL is and how the algorithms help solve problems Become grounded in RL fundamentals including Markov decision processes, dynamic programming, and temporal difference learning Dive deep into a range of value and policy gradient methods Apply advanced RL solutions such as meta learning, hierarchical learning, multi-agent, and imitation learning Understand cutting-edge deep RL algorithms including Rainbow, PPO, TD3, SAC, and more Get practical examples through the accompanying website.
ISBN 9781098114831 (paperback)
1098114833 (paperback)
Patron reviews: add a review
Click for more information
EBOOK
No one has rated this material

You can...
Also...
- Find similar reads
- Add a review
- Sign-up for Newsletter
- Suggest a purchase
- Can't find what you want?
More Information