The matilda from gonmf

profiling with valgrind for most important improvement areas
playout caching, try only updating data when needed
when playing very fast the non-critical parts, GTP protocol, kifu, could be improved; this would improve the speed of fast optimization

Have Matilda play 19x19 matches on KGS while on downtime

Also consider using the free https://aws.amazon.com/ec2/ to run 9x9 matches.

Investigate how Michi is stronger with equal number of playouts #1

issue continues on #95

Given the implementation matilda should be stronger even playout per playout.

Matilda appears to be ~150 ELO weaker in 9x9 with 1000 simulations/turn! With large patterns disabled.

Possible causes:

Winrate against Michi-C, 1000 playouts/turn, no multi-threading: 23.8% (-202 ELO)

taking out parts of michi to see strength decreases

base wr against michief, 1k/turn, no o.b., no multithread: 25.4%, 67 games T(5.1 vs 70)
against michi, with o.b., multithreaded: 14.3%, 7 games, T(1.4 vs 137.1)

even just barebones UCT with no priors and random playouts beats matilda!!

base wr against michief, 1k/turn, no o.b., no multithread: 38.5%, 26 games T(5.4 vs 27.4)

what could it be?

self-atari prevention?
AMAF/RAVE again?

With Matilda final_score corrected

against michief: 86.9%, 61 games, T: 5.7 to 54.7
against michi-c: 18%, 50 games, T: 3.9 to 5
against michi-c with expand_visits=0: crashes
against michi-c with expand_visits=2, mat with e.v=4: 33.1% 178 games
against michi-c both with expand_visits=4: 31.5% 73 games, T: 3.6 to 5.3
^ without 3x3 patterns in playouts for michi-c: 42.9% 70 games
^ without capts in playouts for michi-c: 55.7% 70 games
^ without empty priors: 29.6%, 71 games
against michi-c both with expand_visits=4: 23.7% 169 games, T: 3.2 to 4.9

26.8%, 314, 3.3 vs 5

nothing works, even without fix_atari michi is close to 50% but beating matilda

AMAF again using both before and after traversions?

Might be unrelated but investigate why is the MSE equivalence parameter so different in michi and matilda.

Ditch UCB1-tuned like Michi

Optimize RAVE equiv/b for use without UCB check if ~2000 is really optimal point
Clean up references to UCB in code
Compare with the use of equiv=3500 that Michi uses

winrate vs michi-c now 37% (135 games, T 2.8 vs 5.3 no multithread)

winrate vs michi-c now 46.5% (310 games, T 2.6 vs 5.6 no multithread)

winrate vs michi-c now 48.5% (135 games, T 2.5 vs 5.6 no multithread)

Matilda vs Michi-C with 10k/turn: 27% (37 games, T16.6 vs 45.9)

issue continues on #95

Continuous integration failing in Mac OSX and BSD, and when using clang

Add option to perform a benchmark of the system, lasting 3 minutes or so.

Testing things like playout speed but attempting to lower the impact of the actual MCTS.

19x19 geometric time factor optimization

Improve time alloting

on-hold due to lag and other practical issues

Optimization related to research paper.

Allow for building a MCTS tree and saving it to file

To be used together with disabled or reduced transpositions table cleanup.

Frisbee Go fixes

allow illegal plays that may produce a legal play
relax tactical restrictions

Because of tactical restrictions it plays a weak Frisbee Go!

Implement joseki dictionaries

Review if the joseki extraction is correct when compared with the use of sgfutils sgfvarsplit, and think further on how to implement and use a joseki dictionary.

Add option to write log to stderr only.

Refactor sX by dX

Rename entry folder to main possibly?

MCTS and OpenMP tests

test using different OMP division of the main parallel for work (time based)
test using configurable time based vs playout based; with variable nr of playouts
test different OMP simulation batch sizes (time based)
test passing immediately if pass possible and winrate > 0.95
test stopping early if best play is significantly better than 2nd best (by quality? nr of sims?)

Support playing at OGS

Fix fixed sized int formatting warnings on clang/OSX

final_score estimation always returning n+komi

Very serious.

May have affected multiple optimizations:

removal of relaxed_eye and use of eye only
the improvements of PR #34
the tests of issue #5

Implement and optimize expansion delay (again)

Implement alternative to ring buffers - cause a lot of bugs!

Experiment with play grouping again

Play grouping did not work but looked promising. Think further on the criteria for grouping and see if the increased performance makes it worth it.

Simulate Japanese rules

count captured stones
discourage MCTS to play suicidal plays, slightly maximizing the score difference instead of win rate
add a scoring function by Japanese (area) scoring
switch to Japanese byo-yomi time system by default if the rules are Japanese
add an option to choose Chinese or Japanese rules

Implement this to some extent even if ignored by MCTS.

Overall architecture, assumptions and conventions.
Using parts of matilda in other programs.
User manual with options and examples of use of the different programs.
Short description of each component.

gonmf / matilda Goto Github PK

matilda's People

Contributors

Stargazers

Watchers

Forkers

matilda's Issues

Recommend Projects

Recommend Topics

Recommend Org