Bibliographie

suivant: À propos de ce monter: corpus_html précédent: 10 Convergence de l'algorithme Table des matières

Bibliographie

Abott, 1952: Abott, E. (1952).
Flatland. A Romance in Many Dimensions.
Dover Publications, New York.
Ameisen, 1999: Ameisen, J. (1999).
La scupture du vivant. Le suicide cellulaire ou la mort créatrice.
Seuil.
Bain, 1873: Bain, A. (1873).
Mind and Body. The Theories of Their Relation.
Henry King, London.
Barto et al., 1983: Barto, A., Sutton, R., and Anderson, C. (1983).
Neurolike adaptive elements that can solve difficult learning control problems.
IEEE Transactions on Systems, Man, and Cybernetics, SMC13:834-846.
Bersini et Gorrini, 1996: Bersini, H. and Gorrini, V. (1996).
Three connectionist implementations of dynamic programming for optimal control: A preliminary comparative analysis.
In Workshop on Neural Networks for Identification and Control in Robotics.
Chrisman, 1991: Chrisman, L. (1991).
Reinforcement learning with perceptual aliasing: The perceptual distinctions approach.
In Tenth National Conference on AI (AAAI).
Damasio, 1994: Damasio, A. (1994).
Descartes'Error: Emotion, Reason and the Human Brain.
Picador.
Damasio, 1999: Damasio, A. (1999).
Le sentiment même de soi - Corps, émotions, conscience.
Editions Odile Jacob Sciences.
Davesne et Barret, 1999a: Davesne, F. and Barret, C. (1999a).
Constraint based memory units for reactive navigation learning.
In European Workshop on Learning Robots.
Davesne et Barret, 1999b: Davesne, F. and Barret, C. (1999b).
Reactive navigation of a mobile robot using a hierarchical set of learning agents.
In IROS'99.
Dayan et Sejnowski, 1994: Dayan, P. and Sejnowski, T. (1994).
Td( $\lambda$ ) converges with probability 1.
Machine Learning, 14.
Edelman, 1992: Edelman, G. (1992).
Bright Air, Brilliant Fire: On the Matter of Mind.
Basic Books, New York.
Gamow, 1963: Gamow, G. (1963).
Un, deux, trois ... l'infini.
Dunod.
Hebb, 1949: Hebb, D. (1949).
The Organization of Behavior.
John Wiley & Sons, New York.
Hilbert, 1928: Hilbert, D. et Ackermann, W. (1928).
Grundzuge der Theoretischen Logik.
Springer, Berlin.
Hsu et al., 1990: Hsu, F., Anantharaman, T., Campbell, M., and Nowatzyk, A. (1990).
A grandmaster chess machine.
Scientific American, 263(4):11-50.
James, 1890: James, W. (1890).
Principles of Psychology.
Henry Holt, New York.
Kahneman et Tversky, 1979: Kahneman, D. and Tversky, A. (1979).
Prospect theory: An analysis of decision under risk.
Econometrica, 47:263-291.
Lecerf, 1997: Lecerf, C. (1997).
Une leçon de piano ou la double boucle de l'apprentissage cognitif, volume 3.
Travaux et Documents, Université Paris 8 Vincennes-Saint-Denis.
Littman, 1994: Littman, M. (1994).
Memoryless policies: Theoretical limitations and practical results.
In Dave Cliff, Philip Husbands, J.-A. M. and Stewart W. Wilson, e., editors, Proceedings of the Third International Conference on Simulation of Adaptive Behavior. MIT Press.
Littman et al., 1995: Littman, M., Cassandra, A., and Kaelbling, L. (1995).
Learning policies for partially observable environments: Scaling up.
In Prieditis, A. and Stuart Russell, e., editors, Twelfth International Conference on Machine Learning, pages 362-370. Morgan Kaufmann.
McCulloch et Pitts, 1943: McCulloch, W. and Pitts, W. (1943).
A logical calculus of the ideas immanent in nervous activity.
Bulletin of Mathematical Biophysics, 5:115-137.
Michel, 1996: Michel, O. (1996).
Khepera simulator package version 2.0: Freeware mobile robot simulator.
http://wwwi3s.unice.fr/~om/khep-sim.html.
Mondada et al., 1994: Mondada, F., Franzi, E., and Ienne, P. (1994).
Mobile robot miniaturization: A tool for investigation in control algorithms.
In Yoshikawa, T. and Miyazaki, F., editors, Proceedings of the Third International Symposium on Experimental Robotics 1993, pages 501-513. Springer Verlag,.
Munos, 1997: Munos, R. (1997).
Apprentissage par Renforcement, Étude du cas Continu.
PhD thesis, EHESS, CEMAGREF.
Munos, 1999: Munos, R. (1999).
Variable resolution discretization for high-accuracy solutions of optimal control problem.
International Joint Conference on Artificial Intelligence.
O'Reagan et Noë, 2001: O'Reagan, J. and Noë, A. (2001).
A sensorimotor account of vision and visual consciousness.
Behavioral and Brain Sciences, 24(5).
Pendrith, 1999: Pendrith, M. (1999).
Reinforcement learning in situated agents: Some theoretical problems and practical solutions.
In 8th European Workshop on Learning Robots.
Pendrith et McGarity, 1998: Pendrith, M. and McGarity, M. (1998).
An analysis of direct reinforcement learning in non-markovian domains.
The Fifteenth International Conference on Machine Learning.
Pitrat, 1990: Pitrat, J. (1990).
Métaconnaissance - Futur de l'intelligence artificielle.
Hermès.
Rosenblatt, 1958: Rosenblatt, F. (1958).
he perceptron: A probabilistic model for information storage and organization in the brain.
Psychological Review, 65:386-408.
Rumelhart et al., 1986: Rumelhart, D., Hinton, G., and Williams, R. (1986).
Learning internal representations by error propagation.
Nature, 323:533-536.
Samuel, 1959: Samuel, A. (1959).
Some studies in machine learning using the game of checkers.
IBM Journal of Research and Development, 3:211-229.
Sauvage, 1999: Sauvage, G. (1999).
Les marchés financiers. Entre hasard et raison: le facteur humain.
Seuil.
Shortliffe et Buchanan, 1975: Shortliffe, E. and Buchanan, B. (1975).
A model of inexact reasoning in medicine.
Mathematical Biosciences, 23:351-379.
Thagard et Barnes, 1996: Thagard, P. and Barnes, A. (1996).
Emotional decisions.
Proceedings og the Eighteenth Annual Conference of The Cognitive Science Society, pages 426-429.
Thagard et Millgram, 1997: Thagard, P. and Millgram, E. (1997).
Inference to the best plan: A coherence theory of decision.
Goal-Driven Learning, pages 439-454.
Turing, 1936: Turing, A. (1936).
On computable numbers, with an application to the entscheidungsproblem.
Proceedings of the London Mathematical Society, 42(2):230-265.
Turing, 1950: Turing, A. (1950).
Computing machinery and intelligence.
Mind, 59:433-460.
Tversky et Kahneman, 1981: Tversky, A. and Kahneman, D. (1981).
The framing of decisions and the psychology of choice.
Science, 211:453-458.
Watzlawick, 1991: Watzlawick, P. (1991).
Les cheveux du baron de münchhausen. Psychothérapie et réalité.
Seuil.
Weizenbaum, 1976: Weizenbaum, J. (1976).
Computer Power and Human Reason.
W.H. Freeman.

Frédéric Davesne 2001-07-13