Enhancing PPO with Intrinsic Rewards: A Study in the NetHack Environment
Zlepšení PPO pomocí vnitřních odměn: Studie v prostředí NetHack
bachelor thesis (DEFENDED)

View/ Open
Permanent link
http://hdl.handle.net/20.500.11956/193098Identifiers
Study Information System: 268805
Collections
- Kvalifikační práce [11342]
Author
Advisor
Referee
Pilát, Martin
Faculty / Institute
Faculty of Mathematics and Physics
Discipline
Computer Science with specialisation in Artificial Intelligence
Department
Department of Theoretical Computer Science and Mathematical Logic
Date of defense
6. 9. 2024
Publisher
Univerzita Karlova, Matematicko-fyzikální fakultaLanguage
English
Grade
Excellent