next up previous contents
suivant: À propos de ce monter: corpus_html précédent: 10 Convergence de l'algorithme   Table des matières

Bibliographie

Abott, 1952
Abott, E. (1952).
Flatland. A Romance in Many Dimensions.
Dover Publications, New York.

Ameisen, 1999
Ameisen, J. (1999).
La scupture du vivant. Le suicide cellulaire ou la mort créatrice.
Seuil.

Bain, 1873
Bain, A. (1873).
Mind and Body. The Theories of Their Relation.
Henry King, London.

Barto et al., 1983
Barto, A., Sutton, R., and Anderson, C. (1983).
Neurolike adaptive elements that can solve difficult learning control problems.
IEEE Transactions on Systems, Man, and Cybernetics, SMC13:834-846.

Bersini et Gorrini, 1996
Bersini, H. and Gorrini, V. (1996).
Three connectionist implementations of dynamic programming for optimal control: A preliminary comparative analysis.
In Workshop on Neural Networks for Identification and Control in Robotics.

Chrisman, 1991
Chrisman, L. (1991).
Reinforcement learning with perceptual aliasing: The perceptual distinctions approach.
In Tenth National Conference on AI (AAAI).

Damasio, 1994
Damasio, A. (1994).
Descartes'Error: Emotion, Reason and the Human Brain.
Picador.

Damasio, 1999
Damasio, A. (1999).
Le sentiment même de soi - Corps, émotions, conscience.
Editions Odile Jacob Sciences.

Davesne et Barret, 1999a
Davesne, F. and Barret, C. (1999a).
Constraint based memory units for reactive navigation learning.
In European Workshop on Learning Robots.

Davesne et Barret, 1999b
Davesne, F. and Barret, C. (1999b).
Reactive navigation of a mobile robot using a hierarchical set of learning agents.
In IROS'99.

Dayan et Sejnowski, 1994
Dayan, P. and Sejnowski, T. (1994).
Td($ \lambda$) converges with probability 1.
Machine Learning, 14.

Edelman, 1992
Edelman, G. (1992).
Bright Air, Brilliant Fire: On the Matter of Mind.
Basic Books, New York.

Gamow, 1963
Gamow, G. (1963).
Un, deux, trois ... l'infini.
Dunod.

Hebb, 1949
Hebb, D. (1949).
The Organization of Behavior.
John Wiley & Sons, New York.

Hilbert, 1928
Hilbert, D. et Ackermann, W. (1928).
Grundzuge der Theoretischen Logik.
Springer, Berlin.

Hsu et al., 1990
Hsu, F., Anantharaman, T., Campbell, M., and Nowatzyk, A. (1990).
A grandmaster chess machine.
Scientific American, 263(4):11-50.

James, 1890
James, W. (1890).
Principles of Psychology.
Henry Holt, New York.

Kahneman et Tversky, 1979
Kahneman, D. and Tversky, A. (1979).
Prospect theory: An analysis of decision under risk.
Econometrica, 47:263-291.

Lecerf, 1997
Lecerf, C. (1997).
Une leçon de piano ou la double boucle de l'apprentissage cognitif, volume 3.
Travaux et Documents, Université Paris 8 Vincennes-Saint-Denis.

Littman, 1994
Littman, M. (1994).
Memoryless policies: Theoretical limitations and practical results.
In Dave Cliff, Philip Husbands, J.-A. M. and Stewart W. Wilson, e., editors, Proceedings of the Third International Conference on Simulation of Adaptive Behavior. MIT Press.

Littman et al., 1995
Littman, M., Cassandra, A., and Kaelbling, L. (1995).
Learning policies for partially observable environments: Scaling up.
In Prieditis, A. and Stuart Russell, e., editors, Twelfth International Conference on Machine Learning, pages 362-370. Morgan Kaufmann.

McCulloch et Pitts, 1943
McCulloch, W. and Pitts, W. (1943).
A logical calculus of the ideas immanent in nervous activity.
Bulletin of Mathematical Biophysics, 5:115-137.

Michel, 1996
Michel, O. (1996).
Khepera simulator package version 2.0: Freeware mobile robot simulator.
http://wwwi3s.unice.fr/~om/khep-sim.html.

Mondada et al., 1994
Mondada, F., Franzi, E., and Ienne, P. (1994).
Mobile robot miniaturization: A tool for investigation in control algorithms.
In Yoshikawa, T. and Miyazaki, F., editors, Proceedings of the Third International Symposium on Experimental Robotics 1993, pages 501-513. Springer Verlag,.

Munos, 1997
Munos, R. (1997).
Apprentissage par Renforcement, Étude du cas Continu.
PhD thesis, EHESS, CEMAGREF.

Munos, 1999
Munos, R. (1999).
Variable resolution discretization for high-accuracy solutions of optimal control problem.
International Joint Conference on Artificial Intelligence.

O'Reagan et Noë, 2001
O'Reagan, J. and Noë, A. (2001).
A sensorimotor account of vision and visual consciousness.
Behavioral and Brain Sciences, 24(5).

Pendrith, 1999
Pendrith, M. (1999).
Reinforcement learning in situated agents: Some theoretical problems and practical solutions.
In 8th European Workshop on Learning Robots.

Pendrith et McGarity, 1998
Pendrith, M. and McGarity, M. (1998).
An analysis of direct reinforcement learning in non-markovian domains.
The Fifteenth International Conference on Machine Learning.

Pitrat, 1990
Pitrat, J. (1990).
Métaconnaissance - Futur de l'intelligence artificielle.
Hermès.

Rosenblatt, 1958
Rosenblatt, F. (1958).
he perceptron: A probabilistic model for information storage and organization in the brain.
Psychological Review, 65:386-408.

Rumelhart et al., 1986
Rumelhart, D., Hinton, G., and Williams, R. (1986).
Learning internal representations by error propagation.
Nature, 323:533-536.

Samuel, 1959
Samuel, A. (1959).
Some studies in machine learning using the game of checkers.
IBM Journal of Research and Development, 3:211-229.

Sauvage, 1999
Sauvage, G. (1999).
Les marchés financiers. Entre hasard et raison: le facteur humain.
Seuil.

Shortliffe et Buchanan, 1975
Shortliffe, E. and Buchanan, B. (1975).
A model of inexact reasoning in medicine.
Mathematical Biosciences, 23:351-379.

Thagard et Barnes, 1996
Thagard, P. and Barnes, A. (1996).
Emotional decisions.
Proceedings og the Eighteenth Annual Conference of The Cognitive Science Society, pages 426-429.

Thagard et Millgram, 1997
Thagard, P. and Millgram, E. (1997).
Inference to the best plan: A coherence theory of decision.
Goal-Driven Learning, pages 439-454.

Turing, 1936
Turing, A. (1936).
On computable numbers, with an application to the entscheidungsproblem.
Proceedings of the London Mathematical Society, 42(2):230-265.

Turing, 1950
Turing, A. (1950).
Computing machinery and intelligence.
Mind, 59:433-460.

Tversky et Kahneman, 1981
Tversky, A. and Kahneman, D. (1981).
The framing of decisions and the psychology of choice.
Science, 211:453-458.

Watzlawick, 1991
Watzlawick, P. (1991).
Les cheveux du baron de münchhausen. Psychothérapie et réalité.
Seuil.

Weizenbaum, 1976
Weizenbaum, J. (1976).
Computer Power and Human Reason.
W.H. Freeman.



Frédéric Davesne 2001-07-13