Reinforcement Learning and
Artificial
Intelligence (RLAI) |
|
Project
Proposals |
[SDS94]
Nicol N. Schraudolph, Peter Dayan, and Terrence J. Sejnowski. Temporal
difference learning of position evaluation in the game of Go. In Advances
in Neural Information Processing 6. Morgan Kaufmann, 1994.
[ http://www.gatsby.ucl.ac.uk/~dayan/papers/sds94.pdf ]
[Enz96]
Markus Enzenberger. The integration of a priori knowledge into a Go
playing neural network, 1996. Available by Internet.
[ http://www.markus-enzenberger.de/neurogo.ps.gz ]
[Enz03]
Markus Enzenberger. Evaluation in Go by a neural network using soft
segmentation. In 10th Advances in Computer Games conference, pages 97-108,
2003.
[ http://www.markus-enzenberger.de/neurogo3.ps.gz ]
I've been thinking a lot about doing something that comes from a real problem, and there would probably be numerous examples here, like controlling industrial processes, driving (or racing) a car, deciding how much supply to order if you're a store owner - or, most importantly deciding what to do for your course project. But, since I know close to nothing about all these, the "building the environment" part just seemed too complicated.
Anyway, I am considering all proposals from other people interested in working with me - I could even try to make up a CV if required :) . I hope tomorrow after the class I will be able to decide upon something, or that.
Cosmin