Bridging the Gap: Towards Unified Approach to Perfect and Imperfect Information Games

Moravčík, Matej

Překlenutí rozdílu: Jednotný přístup k hrám s úplnou a neúplnou informací.

dc.contributor.advisor	Hladík, Milan
dc.creator	Moravčík, Matej
dc.date.accessioned	2024-04-08T12:49:40Z
dc.date.available	2024-04-08T12:49:40Z
dc.date.issued	2024
dc.identifier.uri	http://hdl.handle.net/20.500.11956/188743
dc.description.abstract	From the onset of AI research, games have played an important part, serving as a benchmark for progress in artificial intelligence. Recent approaches using search in combination with learning from self-play have shown strong performance and the ability to generalize across a wide range of perfect information games. In contrast, the leading algorithms for imperfect information traditionally used a small, abstract version of a game and solved this abstraction in one go. This thesis introduces a chain of improvements for imperfect information algorithms that culminates in two significant milestones that helped bridge the gap between perfect and imperfect information games. The first milestone is DeepStack - the first agent that successfully used a combination of sound search and a learned value function in imperfect information games. This led to the first AI to achieve victory over human professional players in no-limit poker. The second milestone is Player of Games - a universal algorithm that can master both perfect and imperfect information games starting from scratch. 1	en_US
dc.description.abstract	Od začiatku výskumu umelej inteligencie zohrávali hry dôležitú úlohu a slúžili ako ukazo- vatel'pokroku v oblasti umelej inteligencie. Algoritmy, ktoré kombinovali prehl'adávanie so strojovým učením z vlastných skúseností, ukázali dobré výsledky a schopnost' gene- ralizovat' v širokej škále hier s úplnou informáciou. Na druhej strane, tradične najlepšie algoritmy pre hry s neúplnou informáciou používali zjednodušenú, abstraktnú verziu hry, ktorú potom riešili ako celok. Táto práca predstavuje rad vylepšení algoritmov s neúplnou informáciou a vyvrcholí dvoma významnými míl'nikmi, ktoré pomohli preklenút' rozdiel medzi hrami s úplnou a neúplnou informáciou. Prvým míl'nikom je DeepStack - prvý agent, ktorý úspešne kombinoval teoreticky korektné prehl'adávanie s evalučnou funkciou natrénovanou pomocou strojového učenia v hrách s neúplnou informáciou. DeepStack po- tom dosiahol historicky prvé vít'azstvo umelej inteligencie nad profesionálnymi hráčmi v pokri s neobmedzenými stávkami. Druhým míl'nikom je "Player of Games" - univerzálny algoritmus schopný ovládnut' oba typy hier, s úplnou aj neúplnou informáciou. 1	cs_CZ
dc.language	English	cs_CZ
dc.language.iso	en_US
dc.publisher	Univerzita Karlova, Matematicko-fyzikální fakulta	cs_CZ
dc.subject	teorie her\|prohledávání\|hry s neúplnou informací\|DeepStack\|Player of Games	cs_CZ
dc.subject	game theory\|search\|imperfect information games\|DeepStack\|Player of Games ii	en_US
dc.title	Bridging the Gap: Towards Unified Approach to Perfect and Imperfect Information Games	en_US
dc.type	dizertační práce	cs_CZ
dcterms.created	2024
dcterms.dateAccepted	2024-03-12
dc.description.department	Department of Applied Mathematics	en_US
dc.description.department	Katedra aplikované matematiky	cs_CZ
dc.description.faculty	Faculty of Mathematics and Physics	en_US
dc.description.faculty	Matematicko-fyzikální fakulta	cs_CZ
dc.identifier.repId	167568
dc.title.translated	Překlenutí rozdílu: Jednotný přístup k hrám s úplnou a neúplnou informací.	cs_CZ
dc.contributor.referee	Pilarski, Patrick
dc.contributor.referee	Kroupa, Tomáš
thesis.degree.name	Ph.D.
thesis.degree.level	doktorské	cs_CZ
thesis.degree.discipline	Computer Science - Theory of Computing, Discrete Models and Optimization	en_US
thesis.degree.discipline	Informatika - teorie, diskrétní modely a optimalizace	cs_CZ
thesis.degree.program	Computer Science - Theory of Computing, Discrete Models and Optimization	en_US
thesis.degree.program	Informatika - teorie, diskrétní modely a optimalizace	cs_CZ
uk.thesis.type	dizertační práce	cs_CZ
uk.taxonomy.organization-cs	Matematicko-fyzikální fakulta::Katedra aplikované matematiky	cs_CZ
uk.taxonomy.organization-en	Faculty of Mathematics and Physics::Department of Applied Mathematics	en_US
uk.faculty-name.cs	Matematicko-fyzikální fakulta	cs_CZ
uk.faculty-name.en	Faculty of Mathematics and Physics	en_US
uk.faculty-abbr.cs	MFF	cs_CZ
uk.degree-discipline.cs	Informatika - teorie, diskrétní modely a optimalizace	cs_CZ
uk.degree-discipline.en	Computer Science - Theory of Computing, Discrete Models and Optimization	en_US
uk.degree-program.cs	Informatika - teorie, diskrétní modely a optimalizace	cs_CZ
uk.degree-program.en	Computer Science - Theory of Computing, Discrete Models and Optimization	en_US
thesis.grade.cs	Prospěl/a	cs_CZ
thesis.grade.en	Pass	en_US
uk.abstract.cs	Od začiatku výskumu umelej inteligencie zohrávali hry dôležitú úlohu a slúžili ako ukazo- vatel'pokroku v oblasti umelej inteligencie. Algoritmy, ktoré kombinovali prehl'adávanie so strojovým učením z vlastných skúseností, ukázali dobré výsledky a schopnost' gene- ralizovat' v širokej škále hier s úplnou informáciou. Na druhej strane, tradične najlepšie algoritmy pre hry s neúplnou informáciou používali zjednodušenú, abstraktnú verziu hry, ktorú potom riešili ako celok. Táto práca predstavuje rad vylepšení algoritmov s neúplnou informáciou a vyvrcholí dvoma významnými míl'nikmi, ktoré pomohli preklenút' rozdiel medzi hrami s úplnou a neúplnou informáciou. Prvým míl'nikom je DeepStack - prvý agent, ktorý úspešne kombinoval teoreticky korektné prehl'adávanie s evalučnou funkciou natrénovanou pomocou strojového učenia v hrách s neúplnou informáciou. DeepStack po- tom dosiahol historicky prvé vít'azstvo umelej inteligencie nad profesionálnymi hráčmi v pokri s neobmedzenými stávkami. Druhým míl'nikom je "Player of Games" - univerzálny algoritmus schopný ovládnut' oba typy hier, s úplnou aj neúplnou informáciou. 1	cs_CZ
uk.abstract.en	From the onset of AI research, games have played an important part, serving as a benchmark for progress in artificial intelligence. Recent approaches using search in combination with learning from self-play have shown strong performance and the ability to generalize across a wide range of perfect information games. In contrast, the leading algorithms for imperfect information traditionally used a small, abstract version of a game and solved this abstraction in one go. This thesis introduces a chain of improvements for imperfect information algorithms that culminates in two significant milestones that helped bridge the gap between perfect and imperfect information games. The first milestone is DeepStack - the first agent that successfully used a combination of sound search and a learned value function in imperfect information games. This led to the first AI to achieve victory over human professional players in no-limit poker. The second milestone is Player of Games - a universal algorithm that can master both perfect and imperfect information games starting from scratch. 1	en_US
uk.file-availability	V
uk.grantor	Univerzita Karlova, Matematicko-fyzikální fakulta, Katedra aplikované matematiky	cs_CZ
thesis.grade.code	P
uk.publication-place	Praha	cs_CZ
uk.thesis.defenceStatus	O