Laika diferences apmācīšanās nepārtrauktā darbību telpā

Bērziņa, Ginta

dc.contributor.advisor	Zuters, Jānis	en_US
dc.contributor.author	Bērziņa, Ginta	en_US
dc.contributor.other	Latvijas Universitāte. Datorikas fakultāte	en_US
dc.date.accessioned	2015-03-24T07:05:25Z
dc.date.available	2015-03-24T07:05:25Z
dc.date.issued	2014	en_US
dc.identifier.other	42628	en_US
dc.identifier.uri	https://dspace.lu.lv/dspace/handle/7/17091
dc.description.abstract	Autores izstrādātais bakalaura darbs “Laika diferences apmācīšanās nepārtrauktā darbību telpā” iekļauj gan pētījumu par pastiprinājuma vadītās mācīšanās algoritmiem, gan darba autores izstrādātas mācīšanās. Darba izstrādes laikā autore apguva pastiprinājuma vadītās mācīšanas pamatprincipus un laika diferences mācīšanās algoritmus (Q-learning un Sarsa). Autore darba izstrādes laikā izveidoja mācīšanās algoritmus dažādiem uzdevumiem, kuri tika simulēti virtuālā fiziskā pasaulē.Autores izveidotie mācīšanās algoritmi ir veidoti, lai mācīšanos varētu veikt, nezinot neko par vidi, bet novērojot iegūtos rezultātus reālajā laikā, proti, objektu pozīciju, pārvietošanās un rotācijas ātrumu, rotācijas leņķi. Autore izveidoja piecus mācīšanās algoritmus, kuri ir objekta uzsviešana noteiktā augstumā, objektu nokrišanas sinhronizācija, kārts rotācijas un kārts balansēšanas mācīšanās.	en_US
dc.description.abstract	Author of Temporal Difference Learning in Continuous Action Spaces in her Bachelor paper includes research about reinforcement learning and created reinforcement learning examples for various problems. The author studied reinforcement learning and temporal difference learning algorithms (Q-Learning and Sarsa).. A virtual world was created with physics engine to simulate real world, because learning tasks were meant to solve tasks, where learning was effected by gravity, air friction and weight of object. In order to apply learning, created algorithms uses only parameters, which are observed: position, movement and rotation speed and angle of object, therefore created algorithms doesn’t depend on knowing gravity, air friction and weight of object. The author created algorithms for five learning tasks. They are: throw object to specific height, synchronize object drop time, learn objects to fly and learn pole rotation.	en_US
dc.language.iso	N/A	en_US
dc.publisher	Latvijas Universitāte	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Datorzinātne	en_US
dc.title	Laika diferences apmācīšanās nepārtrauktā darbību telpā	en_US
dc.title.alternative	Temporal difference learning in countinuous action spaces	en_US
dc.type	info:eu-repo/semantics/bachelorThesis	en_US

Files in this item

Name:: 302-42628-Strode_Ginta_gs10022.pdf
Size:: 739.3Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Bakalaura un maģistra darbi (DF) / Bachelor's and Master's theses [3341]

Show simple item record