I have one of Richard Sutton's books, and RL methods are useful but I also have some reservations about them. Often in this sort of approach a strict behaviorist position is adopted where the system is simply trying to find an appropriate function mapping inputs to outputs. The internals of the system are usually treated as a black box with a homogenous structure, and it's this zero architecture or trivial architecture approach which can make the learning problem exceptionally hard.
----- This list is sponsored by AGIRI: http://www.agiri.org/email To unsubscribe or change your options, please go to: http://v2.listbox.com/member/?member_id=231415&user_secret=e9e40a7e