Cooperation with Unknown Agents in Multi-agent Environment

Bašta, Přemysl

Spolupráce s neznámými agenty v multi-agentním prostředí

dc.contributor.advisor	Pilát, Martin
dc.creator	Bašta, Přemysl
dc.date.accessioned	2023-07-24T14:18:38Z
dc.date.available	2023-07-24T14:18:38Z
dc.date.issued	2023
dc.identifier.uri	http://hdl.handle.net/20.500.11956/181940
dc.description.abstract	Over the past few decades, we have witnessed great successes in the field of deep and reinforcement learning. Great achievements have been made in many competitive settings, both in single-agent and multi-agent environments, where AI has managed to outperform human experts and even entire teams of human experts. However, the situation is much more difficult when cooperation is required in purely cooperative environments. We first give a brief overview of reinforcement learning theory and current state of the art algorithms. We then extend the theory to multi-agent systems, where several related issues are discussed. And finally, we propose novel approaches of agent training where we use a simplified multi-agent cooperative cooking game environment based on the popular game Overcooked, we attempt to train agents that are robust and capable of ad hoc agent cooperation.	en_US
dc.description.abstract	V posledních několika desetiletích jsme byli svědky velkých úspěchů v oblasti hlubokého a zpětnovazebního učení. Velkých úspěchů bylo dosaženo v mnoha kompetitivních prostředích, a to jak v prostředí s jedním agentem, tak i v multiagentním prostředí, kde umělá inteligence dokázala překonat celé týmy lidských expertů. Situace se však zdá být mnohem obtížnější, pokud jsou multiagentní prostředí čistě kooperativní. V práci nejprve uvedeme stručný přehled teorie zpětnovazebního učení a současných populárních algoritmů. Poté teorii rozšíříme na multiagentní systémy kde se budeme zabývat problémy, které jsou s nimi spjaté. A nakonec navrhneme nové přístupy trénování agentů, kde se pomocí zjednodušeného prostředí kooperativní hry s více agenty založené na populární hře Overcooked pokusíme trénovat agenty, kteří jsou robustní a schopní spolupráce s neznámými agenty.	cs_CZ
dc.language	English	cs_CZ
dc.language.iso	en_US
dc.publisher	Univerzita Karlova, Matematicko-fyzikální fakulta	cs_CZ
dc.subject	Reinforcement Learning\|Multi-Agent Systems\|Ad-hoc Cooperation	en_US
dc.subject	Zpětnovazební učení\|Multiagentní systém\|Spolupráce s neznámými agenty	cs_CZ
dc.title	Cooperation with Unknown Agents in Multi-agent Environment	en_US
dc.type	diplomová práce	cs_CZ
dcterms.created	2023
dcterms.dateAccepted	2023-06-12
dc.description.department	Katedra teoretické informatiky a matematické logiky	cs_CZ
dc.description.department	Department of Theoretical Computer Science and Mathematical Logic	en_US
dc.description.faculty	Faculty of Mathematics and Physics	en_US
dc.description.faculty	Matematicko-fyzikální fakulta	cs_CZ
dc.identifier.repId	256769
dc.title.translated	Spolupráce s neznámými agenty v multi-agentním prostředí	cs_CZ
dc.contributor.referee	Straka, Milan
thesis.degree.name	Mgr.
thesis.degree.level	navazující magisterské	cs_CZ
thesis.degree.discipline	Informatika - Umělá inteligence	cs_CZ
thesis.degree.discipline	Computer Science - Artificial Intelligence	en_US
thesis.degree.program	Informatika - Umělá inteligence	cs_CZ
thesis.degree.program	Computer Science - Artificial Intelligence	en_US
uk.thesis.type	diplomová práce	cs_CZ
uk.taxonomy.organization-cs	Matematicko-fyzikální fakulta::Katedra teoretické informatiky a matematické logiky	cs_CZ
uk.taxonomy.organization-en	Faculty of Mathematics and Physics::Department of Theoretical Computer Science and Mathematical Logic	en_US
uk.faculty-name.cs	Matematicko-fyzikální fakulta	cs_CZ
uk.faculty-name.en	Faculty of Mathematics and Physics	en_US
uk.faculty-abbr.cs	MFF	cs_CZ
uk.degree-discipline.cs	Informatika - Umělá inteligence	cs_CZ
uk.degree-discipline.en	Computer Science - Artificial Intelligence	en_US
uk.degree-program.cs	Informatika - Umělá inteligence	cs_CZ
uk.degree-program.en	Computer Science - Artificial Intelligence	en_US
thesis.grade.cs	Výborně	cs_CZ
thesis.grade.en	Excellent	en_US
uk.abstract.cs	V posledních několika desetiletích jsme byli svědky velkých úspěchů v oblasti hlubokého a zpětnovazebního učení. Velkých úspěchů bylo dosaženo v mnoha kompetitivních prostředích, a to jak v prostředí s jedním agentem, tak i v multiagentním prostředí, kde umělá inteligence dokázala překonat celé týmy lidských expertů. Situace se však zdá být mnohem obtížnější, pokud jsou multiagentní prostředí čistě kooperativní. V práci nejprve uvedeme stručný přehled teorie zpětnovazebního učení a současných populárních algoritmů. Poté teorii rozšíříme na multiagentní systémy kde se budeme zabývat problémy, které jsou s nimi spjaté. A nakonec navrhneme nové přístupy trénování agentů, kde se pomocí zjednodušeného prostředí kooperativní hry s více agenty založené na populární hře Overcooked pokusíme trénovat agenty, kteří jsou robustní a schopní spolupráce s neznámými agenty.	cs_CZ
uk.abstract.en	Over the past few decades, we have witnessed great successes in the field of deep and reinforcement learning. Great achievements have been made in many competitive settings, both in single-agent and multi-agent environments, where AI has managed to outperform human experts and even entire teams of human experts. However, the situation is much more difficult when cooperation is required in purely cooperative environments. We first give a brief overview of reinforcement learning theory and current state of the art algorithms. We then extend the theory to multi-agent systems, where several related issues are discussed. And finally, we propose novel approaches of agent training where we use a simplified multi-agent cooperative cooking game environment based on the popular game Overcooked, we attempt to train agents that are robust and capable of ad hoc agent cooperation.	en_US
uk.file-availability	V
uk.grantor	Univerzita Karlova, Matematicko-fyzikální fakulta, Katedra teoretické informatiky a matematické logiky	cs_CZ
thesis.grade.code	1
uk.publication-place	Praha	cs_CZ
uk.thesis.defenceStatus	O