Hello, many thanks for providing your great library. I tried to deve

Is it possible to run Ridge() without memmap? about reservoirpy HOT 2 CLOSED

reservoirpy commented on May 24, 2024

Is it possible to run Ridge() without memmap?

from reservoirpy.

Comments (2)

nTrouvain commented on May 24, 2024

Hello Peter !

For now, memmap are required to use the Ridge node, as it was designed to allow parallel computation of linear regression. This parallel computation relies on shared arrays between processes, and the only safe and easy way to do this is to use memory mapped objects. This behavior will probably change in the future, as memory mapped arrays are not well supported on all platforms.

Could you provide a more explicit example of what you are trying to do?

Also, thank you very much for your interest in ReservoirPy. As adding a scikit-learn adapter is part of the library future features plan, do not hesitate to ask for help, submit your code through a pull request and suggest any change in the current code. We would be really happy to count you among the contributors!

from reservoirpy.

renierts commented on May 24, 2024

Hi Nathan!

Thanks for your explanation. I already thought that this was the reason for using memory mapping.

In PyRCN, we use base objects (scikit-learn BaseEstimator etc.) to define our building blocks. The advantage is now that we can use e.g. RandomizedSearchCV for hyperparameter tuning.

A very simple example of what I want to do is the following code snippet (from RandomizedSearchCV):

from sklearn.datasets import load_iris
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import RandomizedSearchCV
from scipy.stats import uniform


iris = load_iris()
logistic = LogisticRegression(solver='saga', tol=1e-2, max_iter=200,
                              random_state=0)
distributions = dict(C=uniform(loc=0, scale=4),
                     penalty=['l2', 'l1'])
clf = RandomizedSearchCV(logistic, distributions, random_state=0)  # TODO: replace logistic by an ESN
search = clf.fit(iris.data, iris.target)
search.best_params_

Now, I need an adapter so that I can replace logistic by an ESN from reservoirpy. And this seems to work fine for everything but the memory mapping. The problem, is that RandomizedSearchCV copies the object to get optimized. I assume that now multiple instances are trying to access the memmapped files in the same time.

Do you have an offline regression node that allows to sequentially fit the linear regression? This would already solve my problem.

Regarding your future plan to add an adapter between reservoirpy and scikit-learn - I will definitely provide you the adapter as soon as it works.

from reservoirpy.

Recommend Projects

Is it possible to run Ridge() without memmap? about reservoirpy HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent