Comments (6)
Parallelly, I was hoping to explore gcloud compute ssh
to see if that would help us in any way, in case this doesn't fall through.
from einstein.
I tried multiple times yesterday to create a cluster using initialization action gs://uga-dsp/scripts/conda-dataproc-bootstrap.sh
. But it started giving some errors.
- Creating the conda folder
- Downloading the anaconda
- All the files are getting deleted (probably an error is occurring during installation and everything is getting deleted)
from einstein.
Today, I was successful to create the cluster with following specs
- region: us east
- default worker nodes, memory
- initialization action:
gs://uga-dsp/scripts/conda-dataproc-bootstrap.sh
Please try to create a cluster and check if you are facing any problems
from einstein.
Two things I can point to there;
(a) Can you point to a specific error in the logs which might have been leading to this? We could then reach out to Dr. Quinn accordingly.
(b) Is it more like a chance thing, where the script is working some times, and the other times, it is not? If so, do you think it is worth to spend time on it, taking into consideration the deadlines?
from einstein.
I don't have any specific logs relating to the error. Also I don't think it is a chance thing because the initialization action never failed today as I tested by creating 5 clusters today.
The important thing i noticed is if you try to run the VM instances as soon as you created a cluster with cluster status 'provisioning', the anaconda is not getting installed. It is taking almost 10 min before the cluster is getting ready with everything installed.
from einstein.
That's weird, considering it didn't take as much time during P1.
Okay, we will go ahead and close this issue here, and come back to addressing consistency in initialization actions when we are well ahead with the code base.
from einstein.
Related Issues (15)
- Created a Base Class which can be used in all regression models. HOT 1
- need a parameter model to return a dictionary also HOT 2
- Dataset for models HOT 1
- Metrics to track model performance HOT 3
- Continuous integration HOT 13
- Class/Variable names HOT 1
- defining pipelines HOT 6
- Whether `Model` class in base.py should remain abstract or not HOT 4
- Lock on 'develop' branch HOT 6
- Encryption of the csv file for data security HOT 3
- Error while testing HOT 9
- Unit tests for models HOT 2
- Adding a cross validation step HOT 2
- PVLib ForecastModel class HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from einstein.