Sample efficient multiagent learning in the presence of Markovian agents /
نام عام مواد
[Book]
نام نخستين پديدآور
Doran Chakraborty
مشخصات ظاهری
نام خاص و کميت اثر
1 online resource (xvii, 147 pages)
فروست
عنوان فروست
Studies in computational intelligence ;
مشخصه جلد
volume 523
یادداشتهای مربوط به کتابنامه ، واژه نامه و نمایه های داخل اثر
متن يادداشت
Includes bibliographical references
یادداشتهای مربوط به مندرجات
متن يادداشت
Introduction -- Background -- Learn or Exploit in Adversary Induced Markov Decision Processes -- Convergence, Targeted Optimality and Safety in Multiagent Learning -- Maximizing -- Targeted Modeling of Markovian agents -- Structure Learning in Factored MDPs -- Related Work -- Conclusion and Future Work
بدون عنوان
0
یادداشتهای مربوط به خلاصه یا چکیده
متن يادداشت
The problem of Multiagent Learning (or MAL) is concerned with the study of how intelligent entities can learn and adapt in the presence of other such entities that are simultaneously adapting. The problem is often studied in the stylized settings provided by repeated matrix games (a.k.a. normal form games). The goal of this book is to develop MAL algorithms for such a setting that achieve a new set of objectives which have not been previously achieved. In particular this book deals with learning in the presence of a new class of agent behavior that has not been studied or modeled before in a MAL context: Markovian agent behavior. Several new challenges arise when interacting with this particular class of agents. The book takes a series of steps towards building completely autonomous learning algorithms that maximize utility while interacting with such agents. Each algorithm is meticulously specified with a thorough formal treatment that elucidates its key theoretical properties
ویراست دیگر از اثر در قالب دیگر رسانه
عنوان
Sample efficient multiagent learning in the presence of Markovian agents.
شماره استاندارد بين المللي کتاب و موسيقي
3319026054
موضوع (اسم عام یاعبارت اسمی عام)
موضوع مستند نشده
Machine learning
موضوع مستند نشده
Markov processes
موضوع مستند نشده
Multiagent systems
رده بندی کنگره
شماره رده
QA76
.
76
.
I58
نشانه اثر
C43
2014
نام شخص به منزله سر شناسه - (مسئولیت معنوی درجه اول )