Commit Graph

143 Commits

Author SHA1 Message Date
4fa10861bb update TF dependency to 1.8.0 2018-05-10 19:27:51 +02:00
6131d5b5f4 Added comments for Christoffer! 2018-05-10 19:25:28 +02:00
1aedc23de1 1-ply now works again. 2018-05-10 19:13:18 +02:00
2d84cd5a0b 1-ply now works again. 2018-05-10 19:06:53 +02:00
396d5b036d All values for boards and all rolls can now be calculated 2018-05-10 18:41:21 +02:00
4efb229d34 Added a lot of comments 2018-05-10 15:28:33 +02:00
f2a67ca92e All board reps should now work as input. 2018-05-10 10:49:25 +02:00
9cfdd7e2b2 Added a verbosity flag, --verbose, which allows for printing of
variables and such.
2018-05-10 10:39:22 +02:00
6429e0732c We should now be able to both train and eval as per usual.
I've added a file "global_step", which works as the new global_step
counter, so we can use it for exp_decay.
2018-05-09 23:15:35 +02:00
cb7e7b519c Getting closer to functionality. We're capable of evaluating moves
and a rework of global_step has begun, such that we now use
episode_count as a way of calculating exp_decay, which have been
implemented as a function.
2018-05-09 22:22:12 +02:00
9a2d87516e Ongoing rewrite of network to use an eager model. We're now capable of
evaluating a list of states with network.py. We can also save and
restore models.
2018-05-09 00:33:05 +02:00
7b308be4e2 Different implementations of different speed 2018-05-07 22:24:47 +02:00
ac6660e05b Added board-rep as cli argument, to state which input-board-rep to use.
Also fixed weird nesting of difference_in_values.
2018-05-06 20:52:35 +02:00
1f8485f54e No longer use n_ply, shit's too slow man.
Added extra logging, now logs the average difference in values
between trainings.
Also fixed bug with the length of quack-norm.
Also added cli argument; use-baseline, if set, the baseline-model
will be used.
2018-05-06 20:41:07 +02:00
1db469709a make_move now calls n_ply to search deeper and potentially give
better moves. It's hella fucking slow.
2018-05-02 01:06:23 +02:00
695a3d43db Fixed n_ply and actually added a comma in main.py. *clap Christoffer* 2018-05-01 20:39:29 +02:00
c530aa688d flipidip 2018-05-01 13:48:42 +02:00
3f6849048e added network_test and some comments 2018-04-29 12:14:14 +02:00
afa6504b05 ply again again 2018-04-26 16:49:49 +02:00
9428a00c11 add "--force-creation" flag to force model creation 2018-04-26 11:43:19 +02:00
48a5f6cbb6 Moved "do_ply" out of "calculate_2_ply", in an effort to be able to
eventually do further plies, however some rewriting of the current
"do_ply" will be needed, as described in a comment.
2018-04-26 09:42:03 +02:00
8899c5c2d9 Fixed potential bug in regards to scores in 2-ply calculation. 2018-04-25 00:51:04 +02:00
ea3f05846d Merge branch 'master' of https://gitfub.space/Pownie/backgammon 2018-04-24 22:31:18 +02:00
0509a51fd3 Added baseline model for testing 2018-04-24 22:30:58 +02:00
33a4b0db3c disallow using model "baseline" 2018-04-24 21:16:54 +02:00
349ad718f1 Moved gen_21_rolls into the 2-ply method, so it can be correctly used like the good helper method that it is 2018-04-23 00:45:31 +02:00
e5cc54d3e0 Added a normalised version of quack 2018-04-23 00:35:25 +02:00
160f5bd737 added some comments and removed some old code 2018-04-22 19:13:46 +02:00
77d82f6883 Added code for 2-ply look-ahead 2018-04-22 15:07:19 +02:00
1062b72bda fix typo 2018-04-19 16:04:49 +02:00
66589dfde3 fixed global step, now using exp decay 2018-04-19 16:01:19 +02:00
cba0f67ae2 fixed *the* bug 2018-04-19 15:22:00 +02:00
b6c52ba476 fix type error 2018-04-16 00:24:24 +02:00
8998dca1f2 remove @Pownie's debug print 2018-04-16 00:03:02 +02:00
611f6cdba0 Changed alpha to learning_rate 2018-04-15 23:53:35 +02:00
57fb1cb141 Merge branch 'master' of https://gitfub.space/Pownie/backgammon 2018-04-15 23:52:00 +02:00
cc1e010840 Uses proper board instead of Alex' drunken mistakes 2018-04-15 23:51:28 +02:00
f68d7a9ded add pygame to requirements.txt 2018-04-15 22:45:37 +02:00
f59fe27e5f You can now move off bar 2018-04-14 23:31:33 +02:00
7d29fc02f2 Added global step + exponential decay 2018-04-14 23:11:20 +02:00
1d9c94896d Red can go on bar as well now 2018-04-14 22:53:49 +02:00
716413e2b6 bar works somewhat if black goes on there. Still can't get off it 2018-04-14 22:51:41 +02:00
7993da0db7 Turns are now functioning 2018-04-14 18:47:38 +02:00
7764a70799 Changed calculate_legal_states to allow for possible face_value of 0 2018-04-14 14:51:50 +02:00
c08e7fe540 Few changes to board 2018-04-14 14:13:27 +02:00
dec12d989e Not fully implented board 2018-04-11 00:38:25 +02:00
4cdd1960a0 add pandas and matplotlib to Python package requirements 2018-03-28 15:37:48 +02:00
3bcb7c5df9 Merge branch 'rework-1' into 'master'
Rework 1

See merge request Pownie/backgammon!4
2018-03-28 13:32:58 +00:00
8764fadd6a train-evaluate-save 2018-03-28 15:32:22 +02:00
17f5b62e9b proper Tesauro board representation 2018-03-28 14:36:52 +02:00