Commit Graph

153 Commits

Author SHA1 Message Date
ba4ef86bb5 Board rep can now be inferred from file after being given once.
We can also evaluate multiple times by using the flag "--repeat-eval".
The flag defaults to 1, if not provided.
2018-05-12 12:14:47 +02:00
c3f5e909d6 flip is back 2018-05-11 21:47:48 +02:00
1aa9cf705f quack without leaks 2018-05-11 21:24:10 +02:00
383dd7aa4b code works again; quack gave ~3 times improvement for calc_moves 2018-05-11 20:13:43 +02:00
93188fe06b more quack for board 2018-05-11 20:07:27 +02:00
ffbc98e1a2 quack kind of works 2018-05-11 19:00:39 +02:00
03e61a59cf quack 2018-05-11 17:29:22 +02:00
93224864a4 More comments, backprop have been somewhat tested in the eager_main.py
and normal_main.py.
2018-05-11 13:35:01 +02:00
504308a9af Yet another input argument, "--ply", 0 for no look-ahead, 1 for a single
look-ahead.
2018-05-10 23:22:41 +02:00
3b57c10b5a Saves calling tf.reduce_mean on all values once. 2018-05-10 22:57:27 +02:00
4fa10861bb update TF dependency to 1.8.0 2018-05-10 19:27:51 +02:00
6131d5b5f4 Added comments for Christoffer! 2018-05-10 19:25:28 +02:00
1aedc23de1 1-ply now works again. 2018-05-10 19:13:18 +02:00
2d84cd5a0b 1-ply now works again. 2018-05-10 19:06:53 +02:00
396d5b036d All values for boards and all rolls can now be calculated 2018-05-10 18:41:21 +02:00
4efb229d34 Added a lot of comments 2018-05-10 15:28:33 +02:00
f2a67ca92e All board reps should now work as input. 2018-05-10 10:49:25 +02:00
9cfdd7e2b2 Added a verbosity flag, --verbose, which allows for printing of
variables and such.
2018-05-10 10:39:22 +02:00
6429e0732c We should now be able to both train and eval as per usual.
I've added a file "global_step", which works as the new global_step
counter, so we can use it for exp_decay.
2018-05-09 23:15:35 +02:00
cb7e7b519c Getting closer to functionality. We're capable of evaluating moves
and a rework of global_step has begun, such that we now use
episode_count as a way of calculating exp_decay, which have been
implemented as a function.
2018-05-09 22:22:12 +02:00
9a2d87516e Ongoing rewrite of network to use an eager model. We're now capable of
evaluating a list of states with network.py. We can also save and
restore models.
2018-05-09 00:33:05 +02:00
7b308be4e2 Different implementations of different speed 2018-05-07 22:24:47 +02:00
ac6660e05b Added board-rep as cli argument, to state which input-board-rep to use.
Also fixed weird nesting of difference_in_values.
2018-05-06 20:52:35 +02:00
1f8485f54e No longer use n_ply, shit's too slow man.
Added extra logging, now logs the average difference in values
between trainings.
Also fixed bug with the length of quack-norm.
Also added cli argument; use-baseline, if set, the baseline-model
will be used.
2018-05-06 20:41:07 +02:00
1db469709a make_move now calls n_ply to search deeper and potentially give
better moves. It's hella fucking slow.
2018-05-02 01:06:23 +02:00
695a3d43db Fixed n_ply and actually added a comma in main.py. *clap Christoffer* 2018-05-01 20:39:29 +02:00
c530aa688d flipidip 2018-05-01 13:48:42 +02:00
3f6849048e added network_test and some comments 2018-04-29 12:14:14 +02:00
afa6504b05 ply again again 2018-04-26 16:49:49 +02:00
9428a00c11 add "--force-creation" flag to force model creation 2018-04-26 11:43:19 +02:00
48a5f6cbb6 Moved "do_ply" out of "calculate_2_ply", in an effort to be able to
eventually do further plies, however some rewriting of the current
"do_ply" will be needed, as described in a comment.
2018-04-26 09:42:03 +02:00
8899c5c2d9 Fixed potential bug in regards to scores in 2-ply calculation. 2018-04-25 00:51:04 +02:00
ea3f05846d Merge branch 'master' of https://gitfub.space/Pownie/backgammon 2018-04-24 22:31:18 +02:00
0509a51fd3 Added baseline model for testing 2018-04-24 22:30:58 +02:00
33a4b0db3c disallow using model "baseline" 2018-04-24 21:16:54 +02:00
349ad718f1 Moved gen_21_rolls into the 2-ply method, so it can be correctly used like the good helper method that it is 2018-04-23 00:45:31 +02:00
e5cc54d3e0 Added a normalised version of quack 2018-04-23 00:35:25 +02:00
160f5bd737 added some comments and removed some old code 2018-04-22 19:13:46 +02:00
77d82f6883 Added code for 2-ply look-ahead 2018-04-22 15:07:19 +02:00
1062b72bda fix typo 2018-04-19 16:04:49 +02:00
66589dfde3 fixed global step, now using exp decay 2018-04-19 16:01:19 +02:00
cba0f67ae2 fixed *the* bug 2018-04-19 15:22:00 +02:00
b6c52ba476 fix type error 2018-04-16 00:24:24 +02:00
8998dca1f2 remove @Pownie's debug print 2018-04-16 00:03:02 +02:00
611f6cdba0 Changed alpha to learning_rate 2018-04-15 23:53:35 +02:00
57fb1cb141 Merge branch 'master' of https://gitfub.space/Pownie/backgammon 2018-04-15 23:52:00 +02:00
cc1e010840 Uses proper board instead of Alex' drunken mistakes 2018-04-15 23:51:28 +02:00
f68d7a9ded add pygame to requirements.txt 2018-04-15 22:45:37 +02:00
f59fe27e5f You can now move off bar 2018-04-14 23:31:33 +02:00
7d29fc02f2 Added global step + exponential decay 2018-04-14 23:11:20 +02:00