Skip to content

Personal tools
Document Actions

Using Decision Trees as the Answer Networks in Temporal Difference-Networks

Laura-Andreea Antanas, Kurt Driessens, Jan Ramon and Tom Croonenborghs

What Talk
When 2008-07-01
from 10:25 to 10:50
Add event to calendar vCal
iCal
Temporal difference networks (or TD-Nets) offer a framework for predictive state representations. TD-Nets break up into two parts: the question network and the answer network. The question network defines which questions about future observations are of importance, while the answer network provides a way to update the answers to those questions as the environment changes. Currently, TD-Nets use logistic regression functions to represent the answer networks. We propose the use of probability trees in their stead. Trees offer a different but powerful way of generalisation and using them may be beneficial in a number of applications. Moreover, we believe this aids in a better understanding of the strengths and weaknesses of TD-Nets and represents an important first step towards the application of temporal difference networks in environments with more extensive, i.e. complex and numerous, observations than those currently employed. We compare the learning behavior of TD-Nets using logistic regression and probability trees using an array of experiments in two simple grid worlds and a ring world.

This site conforms to the following standards: WCAG Valid XHTML Valid CSS