Giter VIP home page Giter VIP logo

rl-library's People

Contributors

btanner avatar

Watchers

 avatar

rl-library's Issues

Create the Phone Home Server

This is a big issue and it needs its own wiki page and people working on
it, but we need to write the phone home server or scripts.

Original issue reported on code.google.com by [email protected] on 9 Oct 2007 at 10:21

Local Glue isn't really Glue

The local glue code is cobbled together and needs to be written right and
tested.  IT doesn't implement everything, and I'm not sure its doing the
right thing in terms of time step counts and such.  Should look at the core
glue code and compare.

Original issue reported on code.google.com by [email protected] on 13 Sep 2007 at 1:11

Create a Java Tester for the Competition

This tester is also an experiment program.  It will be Java based and will
phone home first and load the experiment Jar over the Internet.

Maybe we can do some online scoreboarding as this thing runs and show other
people's results as it goes, with bar graphs or something.  What I mean is
that as its benchmarking you could literally have it now how well its doing
which would make it fun to watch.

The phoning home will need to be network code written independent of the glue.

Original issue reported on code.google.com by [email protected] on 26 Jul 2007 at 1:54

Ant Scripts to Build them All

Each of the projects now can be built with Ant scripts, we should write
some more to do the whole things at once... the equivalent of clean-all
make-all

Original issue reported on code.google.com by [email protected] on 26 Sep 2007 at 4:14

Value Function Resolution Slider needs to be dynamic

The value function resolution slider should only be there if you're using
it.  Not sure if that functionality should be moved into the visualization
somehow (maybe?) or what, but we need to fix it for the competition.

Original issue reported on code.google.com by [email protected] on 13 Sep 2007 at 12:44

Internet Loader

We should work a little harder on designing EnvironmentShell.EnvLoadingHelper

I Want 2 of them.  One that lists and loads environments from the local
file system.  The other will go onto the web.  This works and there is an
example of it in there after an if statement that is false.

Original issue reported on code.google.com by [email protected] on 26 Jul 2007 at 7:11

Visualization in General

On useful visualization component would be to show the value of all the
next actions.... this would allow the user to visualize things more easily.

However, perhaps this should be an agent simulator....

It could be a generic component, like the value function visualizor.  I
guess you could literally have people turn on and off different
visualizations (in the viz program), like turning on or off a monitor of
various conditions.  Basically, you could make it like a "build you own"
visualizer. Hmm.

Original issue reported on code.google.com by [email protected] on 26 Jul 2007 at 1:56

Figure out Jar signing

Find out about signing jars and obfuscating source or class files.

See what we can do about building an appropriate manifest too, with
copyright and credits.

Original issue reported on code.google.com by [email protected] on 12 Oct 2007 at 4:40

Building strings in java needs to be done with a string buffer

Java strings are immutable, and if you are building big ones they are horribly 
slow.  So, to serialize 
1600 observations that have 2 doubles each, building immutable strings took 
2182 ms.  This is 
compared to 240 ms to actually send the message over the network and receive a 
response.

After changing from immutable strings to using a stringbuffer and calling 
append, the time 
dropped from 2182 to 7 ms.

That's crazy.

Original issue reported on code.google.com by [email protected] on 23 Jul 2007 at 3:34

RL-Glue should be an externals link in the competition package

I thought I made this issue but I Can't find it.

I'd like the RL-Glue directory in the root of the training pack to actually
be an svn:externals to the RL-Glue repository.  I don't like the idea of
maintaining  2 copies of that code.

Adam has found some debug spew we need to clean up, and I think I have too
-- the JAVA rl glue seems to be printing task specs out.

Assigned to Mark Lee b/c he's the RL-Glue and SVN expert.

Original issue reported on code.google.com by [email protected] on 11 Oct 2007 at 9:50

Use the version

I created a mechanism for asking the environment for his RLVizVersion, and
I stored it in RLGlueLogic.

We need a good way to query this before starting down paths that will make
us send queries that won't be answered.


Original issue reported on code.google.com by [email protected] on 26 Jul 2007 at 7:41

Spelling of Queryable agent interface is wrong

In RLVizLib, there is an interface that is called QuerableAgent, it should
be Queryable agent.  We should fix this and propagate the changes to all of
our docs and sample agents, and the viz components and messages that use this.

Original issue reported on code.google.com by [email protected] on 5 Oct 2007 at 8:57

Find out if we need to unload

Right now the consoleTrainers don't really unload, they just load and load
and load again. Is this bad?

Look for memory leaks or other bad possibilities.

Original issue reported on code.google.com by [email protected] on 11 Oct 2007 at 11:11

Action Testing

For now, we are going to crash with a stack trace if invalid actions are
selected.  In future (competition) we should keep running more robustly.

Original issue reported on code.google.com by [email protected] on 3 Aug 2007 at 6:30

MountainCarAgent occasionally has double free errors

When I run MountainCarAgent against consoleTrainerJava, sometimes I get
double free errors.  I think it might be that cleanup() is or isn't being
called.

Should look into that, cause it looks bad.

RL_agent(7914) malloc: ***  Deallocation of a pointer not malloced:
0x3003e0; This could be a double free(), or free() called with the middle
of an allocated block; Try setting environment variable MallocHelp to see
tools to help debug
RL_agent(7914) malloc: ***  Deallocation of a pointer not malloced:
0x300400; This could be a double free(), or free() called with the middle
of an allocated block; Try setting environment variable MallocHelp to see
tools to help debug
RL_agent(7914) malloc: ***  Deallocation of a pointer not malloced:
0x3003e0; This could be a double free(), or free() called with the middle
of an allocated block; Try setting environment variable MallocHelp to see
tools to help debug
RL_agent(7914) malloc: ***  Deallocation of a pointer not malloced:
0x300400; This could be a double free(), or free() called with the middle
of an allocated block; Try setting environment variable MallocHelp to see
tools to help debug

Original issue reported on code.google.com by [email protected] on 12 Oct 2007 at 10:42

Copyright messages

Probably every single file we send out should have ownership and copyright
messages, as well as a license.

We should get on settling what this will be.

Original issue reported on code.google.com by [email protected] on 11 Oct 2007 at 7:31

Fault Tolerance

Need fault tolerance in the protocol.  Need a query system to see if the
agent or environment supports it, and everthing needs to work well if they
don't.

Original issue reported on code.google.com by [email protected] on 25 Jul 2007 at 11:50

SVN Externals to the C/C++ Files we need

The C/C++ files we need (ParameterHolder et. al?) probably shouldn't exist
both in the rl-competition repository and in the library.  We should decide
which is the "right" copy (probably the library) and then only point to it.

Original issue reported on code.google.com by [email protected] on 11 Oct 2007 at 10:27

Need a more robust visualizer system

There should be a folder of jars and .dylibs (maybe just jars) for
visualizing and all of the visualizer specific code should be OUT of the
source for bt-Viz and the VizLib.  They should be independent, either
inside the environment project or someplace else.

Original issue reported on code.google.com by [email protected] on 18 Sep 2007 at 4:58

Things need to be faster

Should investigate on how to make all of the connectivity stuff faster. 
Not sure if that should be via compressions, or being judicious, but I'd
like the visualization to be even faster and able to handle higher
resolution of value functions.

Original issue reported on code.google.com by [email protected] on 25 Jul 2007 at 11:48

ParameterHolder updates

1)  We should figure out exactly what characters that break the parameter
Holder (like currently, having :," ", or "_" in either the names of the
parameters or their values (if using strings) might break things.

So, I propose a few changes
a) When parsing messages in RL-Viz, we shouldn't split on spaces all the
time, we should use spaces to split, TO, FROM, CMD, etc... but when we get
to VALS (payload) we should just take the whole rest of the string.  This
means that spaces in the payload won't break the payload parse.  This
should be one change in GenericMessage.

b) In ParameterHolder, when someone sets a variable name or a Strign
parameter, we should EXPLICITLY check right then for "_" and ":" and throw
an exception or die if we can't use it.  Or else we can replace it with
another character like ; and print a warning.  If we do this on both sides,
then invalidly specified parameters will work without crashing

c)  Test all of this and make sure we can't break it.

Original issue reported on code.google.com by [email protected] on 30 Jul 2007 at 6:25

Documentation

We're pretty desperate for documentation.

Someone needs to put together a getting started guide and get it online.

This has to wait a few days, until the trainers and how to run them settles
a bit.

Original issue reported on code.google.com by [email protected] on 11 Oct 2007 at 7:32

Website updates

There are various updates that need to be made to the competition website,
not sure what they all are, but that's important.

Original issue reported on code.google.com by [email protected] on 11 Oct 2007 at 10:16

Tetris needs a way to communicate the rows and cols to the agent

In Tetris, its important to tell the agent the number of Rows and Columns.
 We've decided it makes sense to use the messaging system for this.  So:

1)  The benchmark needs a way to ask the environment for the rows and cols
2)  The benchmark needs to send these to the agent
3)  The agent needs to parse these

We should be able to do this on the environment side using RLViz messages,
and on the agent side using  RLViz messages on the Java side and some
simple C/Python code otherwise.

Original issue reported on code.google.com by [email protected] on 10 Oct 2007 at 7:41

Python Trainer

We should create a Python Trainer that does exactly what the Java and CPP
trainers do.

Original issue reported on code.google.com by [email protected] on 11 Oct 2007 at 11:23

Setup Testing Harness

Here's what would be cool.

A bash script, which when run, goes through a bunch of scenarios, like
running a bunch of combinations of agents and trainers, such that in 60
seconds, one of us can verify that the combinations of things all work.

It's not perfect, but it would help.

Original issue reported on code.google.com by [email protected] on 12 Oct 2007 at 9:26

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.