Commit Graph

48 Commits

Author SHA1 Message Date
a3b4a7480c Changed it to 1,0 and added a bunch of server stuff 2018-06-14 23:49:58 +02:00
ac6660e05b Added board-rep as cli argument, to state which input-board-rep to use.
Also fixed weird nesting of difference_in_values.
2018-05-06 20:52:35 +02:00
1f8485f54e No longer use n_ply, shit's too slow man.
Added extra logging, now logs the average difference in values
between trainings.
Also fixed bug with the length of quack-norm.
Also added cli argument; use-baseline, if set, the baseline-model
will be used.
2018-05-06 20:41:07 +02:00
1db469709a make_move now calls n_ply to search deeper and potentially give
better moves. It's hella fucking slow.
2018-05-02 01:06:23 +02:00
695a3d43db Fixed n_ply and actually added a comma in main.py. *clap Christoffer* 2018-05-01 20:39:29 +02:00
c530aa688d flipidip 2018-05-01 13:48:42 +02:00
3f6849048e added network_test and some comments 2018-04-29 12:14:14 +02:00
afa6504b05 ply again again 2018-04-26 16:49:49 +02:00
9428a00c11 add "--force-creation" flag to force model creation 2018-04-26 11:43:19 +02:00
48a5f6cbb6 Moved "do_ply" out of "calculate_2_ply", in an effort to be able to
eventually do further plies, however some rewriting of the current
"do_ply" will be needed, as described in a comment.
2018-04-26 09:42:03 +02:00
8899c5c2d9 Fixed potential bug in regards to scores in 2-ply calculation. 2018-04-25 00:51:04 +02:00
0509a51fd3 Added baseline model for testing 2018-04-24 22:30:58 +02:00
349ad718f1 Moved gen_21_rolls into the 2-ply method, so it can be correctly used like the good helper method that it is 2018-04-23 00:45:31 +02:00
e5cc54d3e0 Added a normalised version of quack 2018-04-23 00:35:25 +02:00
160f5bd737 added some comments and removed some old code 2018-04-22 19:13:46 +02:00
77d82f6883 Added code for 2-ply look-ahead 2018-04-22 15:07:19 +02:00
1062b72bda fix typo 2018-04-19 16:04:49 +02:00
66589dfde3 fixed global step, now using exp decay 2018-04-19 16:01:19 +02:00
cba0f67ae2 fixed *the* bug 2018-04-19 15:22:00 +02:00
611f6cdba0 Changed alpha to learning_rate 2018-04-15 23:53:35 +02:00
7d29fc02f2 Added global step + exponential decay 2018-04-14 23:11:20 +02:00
17f5b62e9b proper Tesauro board representation 2018-03-28 14:36:52 +02:00
fda2c6e08d parametric board representation in network 2018-03-28 12:00:47 +02:00
abce56dd40 fix typo 2018-03-27 23:13:59 +00:00
95b12a6c35 Added another board_rep 2018-03-28 00:33:39 +02:00
2654006222
fix wrongful mergings 2018-03-27 13:02:36 +02:00
c248ca0452 Merge branch 'fuck_git' into 'rework-1'
# Conflicts:
#   network.py
2018-03-27 10:15:51 +00:00
f43108c239 Training using slightly revamped version of our own board rep. Not sure if works yet. 2018-03-27 04:06:08 +02:00
006f791727 Functioning network using board representation shamelessly ripped from Tesauro 2018-03-27 02:26:15 +02:00
4c43bf19a3
Add evaluation variance benchmark
To do a benchmark for `pubeval`, run `python3 main.py --bench-eval-scores
--eval-methods pubeval`

Logs will be placed in directory `bench`

Use `plot_bench(data_path)` in `plot.py` for plotting
2018-03-26 16:45:26 +02:00
1f1e806306
fix errant whitespace 2018-03-26 15:55:48 +02:00
98c9af72e7 rework network 2018-03-22 15:30:47 +01:00
b7e6dd10af move evaluation code into network.py 2018-03-20 13:17:38 +01:00
99783ee4f8 clean up and move things to network.py 2018-03-20 13:03:21 +01:00
2fc7a2a09c fixed dumb bugs; still messy 2018-03-14 20:42:09 +01:00
55898d0e66 renaming parameters 2018-03-12 00:11:55 +01:00
9bc1a8ba9f save and restore number of trained episodes 2018-03-10 00:22:20 +01:00
150036a6cb
plot-plot 2018-03-08 17:13:25 +01:00
30183448ec woooow 2018-03-08 16:27:16 +01:00
bae1e73692 Now only using one bot again. Also changed learning rate to 0.1 2018-03-07 14:44:17 +01:00
11d25603cf Might be able to learn now (?) 2018-03-06 16:23:08 +01:00
e7fe827ceb Merge branch 'master' of gitfub.space:Pownie/backgammon 2018-03-06 12:22:45 +01:00
c9e4446a52 Custom activation (2*tanh(x)) function implemented with tensorflow primitives. 2018-03-06 12:19:04 +01:00
00033a7aca re-enable Tensorflow logging 2018-03-06 12:04:56 +01:00
5845edf084 works now 2018-03-06 11:53:42 +01:00
22870b90d3 things are better now 2018-03-06 11:06:38 +01:00
d3fe3c918c Potentially functioning network 2018-03-04 17:35:36 +01:00
c2118d0549 Added network 2018-02-07 15:31:05 +01:00