Close category search window
 

Simulation, learning, and optimization techniques in Watson's game strategies

Full text access may be available

To access full text, please use your member or institutional sign in.


This paper appears in:
IBM Journal of Research and Development
Date of Publication: May-June 2012
Author(s): Tesauro, G.
IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY, USA
Gondek, D. C. ;  Lenchner, J. ;  Fan, J. ;  Prager, J. M.
Volume: 56,   Issue: 3.4
On Page(s): 16:1 - 16:11
Product Type: Journals & Magazines

Available Formats Non-Member Price Member Price
US31 US31
Learn how you can qualify for the best price for the item!
  • Email
  • Print
  • Rights And Permissions

Abstract

The game of Jeopardy!™ features four types of strategic decision-making: 1) Daily Double wagering; 2) Final Jeopardy! wagering; 3) selecting the next square when in control of the board; and 4) deciding whether to attempt to answer, i.e., buzz in. Strategies that properly account for the game state and future event probabilities can yield a huge boost in overall winning chances, when compared with simple rule-of-thumb strategies. In this paper, we present an approach to developing and testing components to make said strategy decisions, founded upon development of reasonably faithful simulation models of the players and the Jeopardy! game environment. We describe machine learning and Monte Carlo methods used in simulations to optimize the respective strategy algorithms. Application of these methods yielded superhuman game strategies for IBM Watson™ that significantly enhanced its overall competitive record.

Note: The Institute of Electrical and Electronics Engineers, Incorporated is distributing this Article with permission of the International Business Machines Corporation (IBM) who is the exclusive owner. The recipient of this Article may not assign, sublicense, lease, rent or otherwise transfer, reproduce, prepare derivative works, publicly display or perform, or distribute the Article.  

Index Terms

Index Terms are available to subscribers and IEEE members.

 





Need Help?


IEEE Advancing Technology for Humanity About IEEE Xplore | Contact | Help | Privacy & Security | Terms of Use | Nondiscrimination Policy | Accessibility | Site Map

A non-profit organization, IEEE is the world's largest professional association for the advancement of technology.
© Copyright 2012 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.