Bültmann & Gerriets
Strength or Accuracy: Credit Assignment in Learning Classifier Systems
von Tim Kovacs
Verlag: Springer London
Reihe: Distinguished Dissertations
Hardcover
ISBN: 978-1-4471-1058-3
Auflage: Softcover reprint of the original 1st ed. 2004
Erschienen am 04.10.2012
Sprache: Englisch
Format: 235 mm [H] x 155 mm [B] x 18 mm [T]
Gewicht: 499 Gramm
Umfang: 328 Seiten

Preis: 160,49 €
keine Versandkosten (Inland)


Dieser Titel wird erst bei Bestellung gedruckt. Eintreffen bei uns daher ca. am 9. Oktober.

Der Versand innerhalb der Stadt erfolgt in Regel am gleichen Tag.
Der Versand nach außerhalb dauert mit Post/DHL meistens 1-2 Tage.

klimaneutral
Der Verlag produziert nach eigener Angabe noch nicht klimaneutral bzw. kompensiert die CO2-Emissionen aus der Produktion nicht. Daher übernehmen wir diese Kompensation durch finanzielle Förderung entsprechender Projekte. Mehr Details finden Sie in unserer Klimabilanz.
Klappentext
Inhaltsverzeichnis

Classifier systems are an intriguing approach to a broad range of machine learning problems, based on automated generation and evaluation of condi­ tion/action rules. Inreinforcement learning tasks they simultaneously address the two major problems of learning a policy and generalising over it (and re­ lated objects, such as value functions). Despite over 20 years of research, however, classifier systems have met with mixed success, for reasons which were often unclear. Finally, in 1995 Stewart Wilson claimed a long-awaited breakthrough with his XCS system, which differs from earlier classifier sys­ tems in a number of respects, the most significant of which is the way in which it calculates the value of rules for use by the rule generation system. Specifically, XCS (like most classifiersystems) employs a genetic algorithm for rule generation, and the way in whichit calculates rule fitness differsfrom earlier systems. Wilson described XCS as an accuracy-based classifiersystem and earlier systems as strength-based. The two differin that in strength-based systems the fitness of a rule is proportional to the return (reward/payoff) it receives, whereas in XCS it is a function of the accuracy with which return is predicted. The difference is thus one of credit assignment, that is, of how a rule's contribution to the system's performance is estimated. XCS is a Q­ learning system; in fact, it is a proper generalisation of tabular Q-learning, in which rules aggregate states and actions. In XCS, as in other Q-learners, Q-valuesare used to weightaction selection.



Introduction.- Learning Classifier Systems.- How Strength and Accuracy Differ.- What Should a Classifier System Learn?- Prospects for Adaption.- Classifier Systems and Q-Learning.- Conclusion.- Appendices.- Evaluation of Macroclassifiers.- Example XCS Cycle.- Learning from Reinforcement.- Generalisation Problems.- Value Estimation Algorithms.- Generalised Policy Iteration Algorithms.- Evolutionary Algorithms.- The Origins of Sarsa.- Notation.- References.


andere Formate
weitere Titel der Reihe