suivant: À propos de ce
monter: corpus_html
précédent: 10 Convergence de l'algorithme
  Table des matières
- Abott, 1952
-
Abott, E. (1952).
Flatland. A Romance in Many Dimensions.
Dover Publications, New York.
- Ameisen, 1999
-
Ameisen, J. (1999).
La scupture du vivant. Le suicide cellulaire ou la mort
créatrice.
Seuil.
- Bain, 1873
-
Bain, A. (1873).
Mind and Body. The Theories of Their Relation.
Henry King, London.
- Barto et al., 1983
-
Barto, A., Sutton, R., and Anderson, C. (1983).
Neurolike adaptive elements that can solve difficult learning control
problems.
IEEE Transactions on Systems, Man, and Cybernetics,
SMC13:834-846.
- Bersini et Gorrini, 1996
-
Bersini, H. and Gorrini, V. (1996).
Three connectionist implementations of dynamic programming for
optimal control: A preliminary comparative analysis.
In Workshop on Neural Networks for Identification and Control in
Robotics.
- Chrisman, 1991
-
Chrisman, L. (1991).
Reinforcement learning with perceptual aliasing: The perceptual
distinctions approach.
In Tenth National Conference on AI (AAAI).
- Damasio, 1994
-
Damasio, A. (1994).
Descartes'Error: Emotion, Reason and the Human Brain.
Picador.
- Damasio, 1999
-
Damasio, A. (1999).
Le sentiment même de soi - Corps, émotions, conscience.
Editions Odile Jacob Sciences.
- Davesne et Barret, 1999a
-
Davesne, F. and Barret, C. (1999a).
Constraint based memory units for reactive navigation learning.
In European Workshop on Learning Robots.
- Davesne et Barret, 1999b
-
Davesne, F. and Barret, C. (1999b).
Reactive navigation of a mobile robot using a hierarchical set of
learning agents.
In IROS'99.
- Dayan et Sejnowski, 1994
-
Dayan, P. and Sejnowski, T. (1994).
Td() converges with probability 1.
Machine Learning, 14.
- Edelman, 1992
-
Edelman, G. (1992).
Bright Air, Brilliant Fire: On the Matter of Mind.
Basic Books, New York.
- Gamow, 1963
-
Gamow, G. (1963).
Un, deux, trois ... l'infini.
Dunod.
- Hebb, 1949
-
Hebb, D. (1949).
The Organization of Behavior.
John Wiley & Sons, New York.
- Hilbert, 1928
-
Hilbert, D. et Ackermann, W. (1928).
Grundzuge der Theoretischen Logik.
Springer, Berlin.
- Hsu et al., 1990
-
Hsu, F., Anantharaman, T., Campbell, M., and Nowatzyk, A. (1990).
A grandmaster chess machine.
Scientific American, 263(4):11-50.
- James, 1890
-
James, W. (1890).
Principles of Psychology.
Henry Holt, New York.
- Kahneman et Tversky, 1979
-
Kahneman, D. and Tversky, A. (1979).
Prospect theory: An analysis of decision under risk.
Econometrica, 47:263-291.
- Lecerf, 1997
-
Lecerf, C. (1997).
Une leçon de piano ou la double boucle de l'apprentissage
cognitif, volume 3.
Travaux et Documents, Université Paris 8 Vincennes-Saint-Denis.
- Littman, 1994
-
Littman, M. (1994).
Memoryless policies: Theoretical limitations and practical results.
In Dave Cliff, Philip Husbands, J.-A. M. and Stewart W. Wilson, e.,
editors, Proceedings of the Third International Conference on Simulation
of Adaptive Behavior. MIT Press.
- Littman et al., 1995
-
Littman, M., Cassandra, A., and Kaelbling, L. (1995).
Learning policies for partially observable environments: Scaling up.
In Prieditis, A. and Stuart Russell, e., editors, Twelfth
International Conference on Machine Learning, pages 362-370. Morgan
Kaufmann.
- McCulloch et Pitts, 1943
-
McCulloch, W. and Pitts, W. (1943).
A logical calculus of the ideas immanent in nervous activity.
Bulletin of Mathematical Biophysics, 5:115-137.
- Michel, 1996
-
Michel, O. (1996).
Khepera simulator package version 2.0: Freeware mobile robot
simulator.
http://wwwi3s.unice.fr/~om/khep-sim.html.
- Mondada et al., 1994
-
Mondada, F., Franzi, E., and Ienne, P. (1994).
Mobile robot miniaturization: A tool for investigation in control
algorithms.
In Yoshikawa, T. and Miyazaki, F., editors, Proceedings of the
Third International Symposium on Experimental Robotics 1993, pages 501-513.
Springer Verlag,.
- Munos, 1997
-
Munos, R. (1997).
Apprentissage par Renforcement, Étude du cas Continu.
PhD thesis, EHESS, CEMAGREF.
- Munos, 1999
-
Munos, R. (1999).
Variable resolution discretization for high-accuracy solutions of
optimal control problem.
International Joint Conference on Artificial Intelligence.
- O'Reagan et Noë, 2001
-
O'Reagan, J. and Noë, A. (2001).
A sensorimotor account of vision and visual consciousness.
Behavioral and Brain Sciences, 24(5).
- Pendrith, 1999
-
Pendrith, M. (1999).
Reinforcement learning in situated agents: Some theoretical problems
and practical solutions.
In 8th European Workshop on Learning Robots.
- Pendrith et McGarity, 1998
-
Pendrith, M. and McGarity, M. (1998).
An analysis of direct reinforcement learning in non-markovian
domains.
The Fifteenth International Conference on Machine Learning.
- Pitrat, 1990
-
Pitrat, J. (1990).
Métaconnaissance - Futur de l'intelligence artificielle.
Hermès.
- Rosenblatt, 1958
-
Rosenblatt, F. (1958).
he perceptron: A probabilistic model for information storage and
organization in the brain.
Psychological Review, 65:386-408.
- Rumelhart et al., 1986
-
Rumelhart, D., Hinton, G., and Williams, R. (1986).
Learning internal representations by error propagation.
Nature, 323:533-536.
- Samuel, 1959
-
Samuel, A. (1959).
Some studies in machine learning using the game of checkers.
IBM Journal of Research and Development, 3:211-229.
- Sauvage, 1999
-
Sauvage, G. (1999).
Les marchés financiers. Entre hasard et raison: le facteur
humain.
Seuil.
- Shortliffe et Buchanan, 1975
-
Shortliffe, E. and Buchanan, B. (1975).
A model of inexact reasoning in medicine.
Mathematical Biosciences, 23:351-379.
- Thagard et Barnes, 1996
-
Thagard, P. and Barnes, A. (1996).
Emotional decisions.
Proceedings og the Eighteenth Annual Conference of The Cognitive
Science Society, pages 426-429.
- Thagard et Millgram, 1997
-
Thagard, P. and Millgram, E. (1997).
Inference to the best plan: A coherence theory of decision.
Goal-Driven Learning, pages 439-454.
- Turing, 1936
-
Turing, A. (1936).
On computable numbers, with an application to the
entscheidungsproblem.
Proceedings of the London Mathematical Society, 42(2):230-265.
- Turing, 1950
-
Turing, A. (1950).
Computing machinery and intelligence.
Mind, 59:433-460.
- Tversky et Kahneman, 1981
-
Tversky, A. and Kahneman, D. (1981).
The framing of decisions and the psychology of choice.
Science, 211:453-458.
- Watzlawick, 1991
-
Watzlawick, P. (1991).
Les cheveux du baron de münchhausen. Psychothérapie et réalité.
Seuil.
- Weizenbaum, 1976
-
Weizenbaum, J. (1976).
Computer Power and Human Reason.
W.H. Freeman.
Frédéric Davesne
2001-07-13