Comments (5)
我检查了docker ps,
federatedai/federation:1.0.2-release "/bin/sh -c 'cd /dat…" About an hour ago Up 25 minutes 9394/tcp confs-10004_federation_1
这个容器一直没有启动,我重复启动几次都失败了。
重新运行sh docker-auto-deploy.sh解决了这个问题。如果有人明白之前发生了什么错误,可以继续评论,谢谢!
from kubefate.
我部署1.3的时候出现了同样的问题,但是confs-10000_federation_1容器正常启动,部署单一服务器运行toyexample没有问题,但是部署两方服务器运行失败,报错如下,您现在知道是什么原因了吗?
stdout:{
"retcode": 100,
"retmsg": "rpc request error: <_Rendezvous of RPC that terminated with:\n\tstatus = StatusCode.UNAVAILABLE\n\tdetails = "io exception"\n\tdebug_error_string = "{"created":"@1591020347.435712485","description":"Error received from peer","file":"src/core/lib/surface/call.cc","file_line":1017,"grpc_message":"io exception","grpc_status":14}"\n>"
}
Traceback (most recent call last):
File "run_toy_example.py", line 197, in
exec_toy_example(runtime_config)
File "run_toy_example.py", line 161, in exec_toy_example
jobid = exec_task(dsl_path, runtime_config)
File "run_toy_example.py", line 91, in exec_task
"failed to exec task, status:{}, stderr is {} stdout:{}".format(status, stderr, stdout))
ValueError: failed to exec task, status:100, stderr is None stdout:{'retcode': 100, 'retmsg': 'rpc request error: <_Rendezvous of RPC that terminated with:\n\tstatus = StatusCode.UNAVAILABLE\n\tdetails = "io exception"\n\tdebug_error_string = "{"created":"@1591020347.435712485","description":"Error received from peer","file":"src/core/lib/surface/call.cc","file_line":1017,"grpc_message":"io exception","grpc_status":14}"\n>'}
from kubefate.
遇到了同样的问题。请问问题解决了吗? @yuyan110nan
from kubefate.
我是用阿里云服务器搭建的两台服务器,云服务器之前如果是不同地域的话默认通讯端口不通,需要开通高速通道(收费)创建安全组后才能开放端口
from kubefate.
相同地域的话要开放哪些端口才能正常运行啊。
from kubefate.
Related Issues (20)
- 在guest方命令行flow mode load 报错 HOT 1
- docker-compose部署所需要的docker和docker-compose版本是什么? HOT 1
- 加载模型出错 HOT 1
- No module named 'fate_llm' HOT 3
- 运行docker_deploy.sh报错
- 升级后训练模型时出错 HOT 1
- kubefate中的fate-serving无法启动服务
- feldmanverifiablesum未获得结果 HOT 1
- KubeFATE的docker-compose部署脚本有一个bug
- kubefate serving proxy: unable to create new native thread HOT 1
- Is kubefate ready for fate 2.0 HOT 3
- 想部署并使用fate LLM报错
- k8s 部署时报helm install error, failed to download "kubefate/fate" at version "v2.0.0-beta" 怎么解决 HOT 3
- KubeFATE v2.0.0 docker-deploy文档有误 HOT 1
- 请问下docker-compose安装的方式,有没有哪个设置可以挂载外部路径 HOT 1
- 1.10.0 k8s deployment hdfs error 255
- 请问"no service id"是什么原因造成的
- 多方构建FATE时的两个部署问题 HOT 2
- 建议v2.0.0文档补充
- k8s部署2.0.0版本,最后部署fate服务报helm install Cluster error HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kubefate.