STatistical AI Reading Group previous readings: June-August, 2002
- June 13 (Stairmaster: Georgios):
- Xavier Boyen and Daphne Koller, Tractable Inference for Complex
Stochastic Processes, Proceedings of the 14th Annual Conference
on Uncertainty in Artificial Intelligence (UAI-98). longer
version Postscript ,
on-line slides html
- June 21 (Stairmaster: Leslie & Luke):
- Adnan Darwiche and Matthew L. Ginsberg. A symbolic
generalization of probability theory. In Proceedings of the
Tenth National Conference on Artificial Intelligence (AAAI),
pages 622-627, 1992. Postscript
- June 27 (Special STAIR meeting: LOCATION: 8th floor playroom at
1:00 pm.)
- Talk by Jeniffer Dy from Northeastern University: "Feature
Selection for Unsupervised Learning Applied to Content-Based
Image Retrieval". Abstract
- July 2 (Stairmaster: Terran)
- M. Kearns, Y. Mansour and A. Ng. A Sparse Sampling Algorithm
for Near-Optimal Planning in Large Markov Decision
Processes. Proceedings of the Sixteenth International Joint
Conference on Artificial Intelligence Morgan Kaufmann, 1999,
pages 1324--1331. To appear in a special issue of the journal
Machine Learning Postscript
- David A. McAllester and Satinder Singh. Approximate Planning
for Factored POMDPs using Belief State Simplification.
Proceedings of the Fifteenth Conference on Uncertainty in
Artificial Intelligence (UAI-99) Postscript
- July 11 (Stairmaster: Georgios):
- D. Freitag and A. McCallum, Information extraction with HMM
structures learned by stochastic optimization. Proceedings of
AAAI-2000. Postscript
- A. Stolcke & S. Omohundro (1992), Hidden Markov Model Induction
by Bayesian Model Merging. In Advances in Neural Information
Processing Systems 5, S. J. Hanson, J. D. Cowan & C. L. Giles,
editors, Morgan Kaufman, pp. 11-18. Postscript
(optional reading)
- Matthew Brand, An entropic estimator for structure discovery,
in Neural Information Processing Systems 11. PDF
- July 18 (Stairmaster: Georgios):
- Blai Bonet, An epsilon-Optimal Grid-Based Algorithm for
Partially Observable Markov Decision Processes (ICML 2002)
Postscript
- July 25 (Stairmaster: Georgios):
- G. Z. Grudic and L. H. Ungar. Exploiting Multiple Secondary
Reinforcers in Policy Gradient Reinforcement Learning, Seventeenth
International Joint Conference on Artificial Intelligence (IJCAI
01), August 4th - 10th, 2001, Seattle, Washington.
Postscript
(background material)
-
Sutton, R.S., McAllester, D., Singh, S., Mansour,
Y. (2000). Policy Gradient Methods for Reinforcement Learning with
Function Approximation. Advances in Neural Information Processing
Systems 12 (Proceedings of the 1999 conference),
pp. 1057-1063. MIT Press.
Postscript
- August 1st (Stairmaster: Georgios)
- Policy invariance under reward transformations: Theory
and application to reward shaping, Andrew Y. Ng, Daishi Harada
and Stuart Russell. In Proceedings of the Sixteenth
International Conference on Machine Learning, 1999.
Postscript
- August 8 (Stairmaster: Georgios):
-
J. Bagnell and J. Schneider, Autonomous Helicopter Control using
Reinforcement Learning Policy Search Methods. Proceedings of the
International Conference on Robotics and Automation 2001, IEEE,
May, 2001.
Postscript
- August 14 (Stairmaster: Luke):
- D. Poole, ``Probabilistic Partial Evaluation: Exploiting rule
structure in probabilistic inference'', Proc. Fifteenth
International Joint Conference on Artificial Intelligence
(IJCAI-97), Nagoya, Japan, August 1997, pp. 1284-1291.
html
- D. Poole, ``Context-specific approximation in probabilistic
inference'', Proc. Fourteenth Conference on Uncertainty in
Artificial Intelligence (UAI-98), Madison, Wisconsin, pages
447-454, July 1998. html
- August 21 & 27 (Stairmaster: Michael):
- Stuart Geman, Donald Geman - "Stochastic Relaxation, Gibbs
Distributions, and the Bayesian Restoration of Images" IEEE
Transactions on Pattern Analysis and Machine Intelligence,
November 1984. (paper copy outside office NE43-781).