STatistical AI Reading Group previous readings: September-December, 2003

September 10 (Stairmaster: Mike):: The Information Bottleneck Method, (1999), Naftali Tishby, Fernando C. Pereira, William Bialek Proc. of the 37-th Annual Allerton Conference on Communication, Control and Computing Citeseer .
September 18 (Stairmaster: Natalia & Luke):: First-order probabilistic inference. David Poole. International Joint Conference in Artificial Intelligence 2003 (IJCAI-03) html
September 25 (Stairmaster: Natalia):: Logical Filtering. E. Amir and S. Russell. International Joint Conference in Artificial Intelligence 2003 (IJCAI-03) Postscript
October 2 (Stairmaster: Hanna):: Generalizing Plans to New Environments in Relational MDPs. Carlos Guestrin, Daphne Koller, Chris Gearhart and Neal Kanodia. International Joint Conference in Artificial Intelligence 2003 (IJCAI-03). Postscript .

October 9 (Stairmaster: Leon): Overiew of RL and active vision. Suggested papers:

"A Reinforcement Learning Model of Selective Visual Attention", Silviu Minut and Sridhar Mahadevan, Proceedings of the 5-th International Conference on Autonomous Agents, pp 457-464, Montreal, Canada, 2001 http://www.cs.umass.edu/~mahadeva/papers/aa2001.ps.gz

"Learning Visual Routines with Reinforcement Learning" Andrew McCallum, ftp://ftp.cs.rochester.edu/pub/papers/robotics/96.mccallum-aaai-e.ps.gz

"Learning to generate artificial fovea trajectories for target detection" Schmidhuber, Juergen and Huber, R., International Journal of Neural Systems, 2(1 & 2):135-141, 1991, ftp://ftp.idsia.ch/pub/juergen/attention.ps.gz

"Residual Q-Learning Applied to Visual Attention" Bandera, C. ., Vico, F. J., Bravo, J. M., Harmon, M. E., & Baird, L. C. Proceedings of the Thirteenth International Conference on Machine Learning, Bari, Italy, 3-6 July (1996) http://www.leemon.com/papers/icml96/ICML96.pdf

"Active object recognition by view integration and reinforcement learning", Lucas Paletta, Axel Pinz, Int.J. Robotics and Autonomous Systems, 31(1-2):71-86, 2000. http://www.emt.tu-graz.ac.at/~pinz/onlinepapers/RAS00.pdf


October 16 (Stairmaster: Georgios):

Factored Planning. E. Amir and B. Engelhard, International Joint Conference in Artificial Intelligence 2003 (IJCAI-03) Postscript

October 30 (Stairmaster: Yu-Han):

Distributed Planning in Hierarchical Factored MDPs, Carlos Guestrin and Geoffrey Gordon, UAI-2002. Postscript .

November 6 (Stairmaster: Kurt):

"APRICODD: Approximate policy construction using decision diagrams", Robert St-Aubin , Jesse Hoey and Craig Boutilier, Advances in Neural Information Processing 13 ( NIPS 2000). ps.gz.

"Optimal and Approximate Stochastic Planning using Decision Diagrams (2000) (Make Corrections) Jesse Hoey, Robert St-Aubin, Alan Hu, Craig Boutilier, tech report. ps.gz

"SPUDD: Stochastic Planning using Decision Diagrams", Jesse Hoey , Robert St. Aubin, Alan Hu and Craig Boutilier, Proceedings of UAI 99. ps.gz.

November 13 (Invited Speaker: Daniela P. de Farias ):

The focus of this talk is on the approximate dynamic programming algorithm known as approximate linear programming. I will present background on the algorithm and then discuss my work, including performance and approximation error bounds, an efficient constraint sampling scheme for dealing with the large number of constraints the algorithm typically involves, and application to queueing networks.

November 20 (Stairmaster: Georgios):

"A Nonlinear Predictive State Representation", Matthew R. Rudary and Satinder Singh, NIPS-03. PDF

December 4 (Stairmaster: Bruno):

Eyal Even-Dar, Shie Mannor, Yishay Mansour: Action Elimination and Stopping Conditions for Reinforcement Learning. ICML 2003: 162-169. PDF