gmontamat / gentun Goto Github PK
View Code? Open in Web Editor NEWHyperparameter tuning for machine learning models using a distributed genetic algorithm
License: Apache License 2.0
Hyperparameter tuning for machine learning models using a distributed genetic algorithm
License: Apache License 2.0
I use GoogleColab
First i installed using this command :
%%bash
git clone https://github.com/gmontamat/gentun
cd gentun
python setup.py install
then i imported package using this command
from gentun import GeneticAlgorithm, Population, XgboostIndividual
i face this error
ImportError Traceback (most recent call last)
<ipython-input-2-a67f03c5af3f> in <module>()
1 from sklearn.datasets import fetch_california_housing
----> 2 from gentun import GeneticAlgorithm, Population, XgboostIndividual
ImportError: cannot import name 'GeneticAlgorithm' from 'gentun' (unknown location)
---------------------------------------------------------------------------
NOTE: If your import is failing due to a missing package, you can
manually install dependencies using either !pip or !apt.
To view examples of installing some common dependencies, click the
"Open Examples" button below.
---------------------------------------------------------------------------
after excuting this command output is
import gentun
dir(gentun)
['__doc__',
'__file__',
'__loader__',
'__name__',
'__package__',
'__path__',
'__spec__']
The distributed version of the algorithm receives results from the workers without validating their work. This is safe when one has control of all the instances running the worker nodes but it is dangerous when one shares the queue of jobs on the internet.
When opening the genetic algorithm to the world, a malicious user may return a fake metric claiming to have trained the model using k-fold cross validation, corrupting the overall result of the parameter tuning.
Some options to prevent this from happening are to:
Hi, in this test file,
import mnist
but no module named mnist.
I guess probably that some directory or files named as mnist.py is missing?
Hi, thanks for the code.
I am trying to run the distributed training however, I have got below error:
Using TensorFlow backend.
Initializing a random population. Size: 20
Starting genetic algorithm...
Evaluating generation #1...
Traceback (most recent call last):
File "/media/vip/Program/mobeen/gentun/tests/mnist_server.py", line 23, in <module>
ga.run(50)
File "/media/vip/Program/mobeen/gentun/gentun/algorithms.py", line 29, in run
self.evolve_population()
File "/media/vip/Program/mobeen/gentun/gentun/algorithms.py", line 72, in evolve_population
fittest = self.population.get_fittest()
File "/media/vip/Program/mobeen/gentun/gentun/server.py", line 105, in get_fittest
self.evaluate_in_parallel()
File "/media/vip/Program/mobeen/gentun/gentun/server.py", line 113, in evaluate_in_parallel
RpcClient(None, None, **self.credentials).purge()
File "/media/vip/Program/mobeen/gentun/gentun/server.py", line 31, in __init__
result = self.channel.queue_declare(exclusive=True)
TypeError: queue_declare() missing 1 required positional argument: 'queue'
Process finished with exit code 1
I solved this error as suggested here giving an empty queue.
After this I got another error as below:
Using TensorFlow backend.
Initializing a random population. Size: 20
Starting genetic algorithm...
Evaluating generation #1...
Traceback (most recent call last):
File "/media/vip/Program/mobeen/gentun/tests/mnist_server.py", line 23, in <module>
ga.run(50)
File "/media/vip/Program/mobeen/gentun/gentun/algorithms.py", line 29, in run
self.evolve_population()
File "/media/vip/Program/mobeen/gentun/gentun/algorithms.py", line 72, in evolve_population
fittest = self.population.get_fittest()
File "/media/vip/Program/mobeen/gentun/gentun/server.py", line 105, in get_fittest
self.evaluate_in_parallel()
File "/media/vip/Program/mobeen/gentun/gentun/server.py", line 113, in evaluate_in_parallel
RpcClient(None, None, **self.credentials).purge()
File "/media/vip/Program/mobeen/gentun/gentun/server.py", line 33, in __init__
self.channel.basic_consume(self.on_response, no_ack=True, queue=self.callback_queue)
TypeError: basic_consume() got an unexpected keyword argument 'no_ack'
Process finished with exit code 1
By the feel of these errors I think I am using some other version of RabbitMQ.
I installed RabbitMQ using:
sudo apt-get install rabbitmq-server
Is it a version mismatch or am I missing something?
Publish this library in PyPI.
@ahmadmobeen I was able to replicate your problem, thanks for reporting it. It's due to pika updating its API. Please use pika==1.1.0 (which should be compatible with your rabbitmq version) and the
dev
branch code of client.py and server.py. I will merge withmaster
sometime later.edit: I'm also working on the dockerfile for the gentun server to simplify running the distributed version of the genetic algorithm.
I tried this method and it worked for generation # 1.
@gmontamat Thank you for your reply. I was able to run the code by doing some small changes in server.py and client.py as shown here
I tried this and it worked for generation # 1.
got the following error in both cases:
Using TensorFlow backend.
Initializing a random population. Size: 20
Starting genetic algorithm...
Evaluating generation #1...
[*] Got fitness for individual 0
[*] Got fitness for individual 2
[*] Got fitness for individual 1
[*] Got fitness for individual 4
[*] Got fitness for individual 3
[*] Got fitness for individual 6
[*] Got fitness for individual 8
[*] Got fitness for individual 10
[*] Got fitness for individual 5
[*] Got fitness for individual 9
[*] Got fitness for individual 12
[*] Got fitness for individual 11
[*] Got fitness for individual 7
[*] Got fitness for individual 13
[*] Got fitness for individual 16
[*] Got fitness for individual 15
[*] Got fitness for individual 17
[*] Got fitness for individual 14
[*] Got fitness for individual 19
[*] Got fitness for individual 18
Fittest individual is:
{'S_1': '011', 'S_2': '1000010100'}
Fitness value is: 0.9978
Evaluating generation #2...
Traceback (most recent call last):
File "/media/vip/Program/mobeen/gentun/tests/mnist_server.py", line 23, in <module>
ga.run(50)
File "/media/vip/Program/mobeen/gentun/gentun/algorithms.py", line 29, in run
self.evolve_population()
File "/media/vip/Program/mobeen/gentun/gentun/algorithms.py", line 72, in evolve_population
fittest = self.population.get_fittest()
File "/media/vip/Program/mobeen/gentun/gentun/server.py", line 105, in get_fittest
self.evaluate_in_parallel()
File "/media/vip/Program/mobeen/gentun/gentun/server.py", line 113, in evaluate_in_parallel
RpcClient(None, None, **self.credentials).purge()
File "/media/vip/Program/mobeen/gentun/gentun/server.py", line 28, in __init__
self.connection = pika.BlockingConnection(self.parameters)
File "/home/vip/anaconda3/envs/keras/lib/python3.7/site-packages/pika-1.1.0-py3.7.egg/pika/adapters/blocking_connection.py", line 359, in __init__
self._impl = self._create_connection(parameters, _impl_class)
File "/home/vip/anaconda3/envs/keras/lib/python3.7/site-packages/pika-1.1.0-py3.7.egg/pika/adapters/blocking_connection.py", line 450, in _create_connection
raise self._reap_last_connection_workflow_error(error)
pika.exceptions.ProbableAuthenticationError: ConnectionClosedByBroker: (403) 'ACCESS_REFUSED - Login was refused using authentication mechanism PLAIN. For details see the broker logfile.'
Process finished with exit code 1
Originally posted by @ahmadmobeen in #17 (comment)
Why is my recognition rate 90% at the beginning, and where can I adjust it to start from 0?
We need to retrieve test datasets from somewhere else or update the scikit learn library so that MNIST download works again.
Redis is gaining more traction and it's simpler to use and setup than RabbitMQ. Replace the latter with Redis.
Use GitFlow branching strategy
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.