System Rewards
Home
Environment Space
Rewards
Current Issues
Future Work
About
To encourage the model to clear lines effectively, the following rewards were chosen:
+1 for each step
Lines Cleared ^2
E.g 1 line cleared this step = 1
E.g 4 lines cleared this step = 16
-2 for game end