Comments (6)
我想问下,这个ps-plus跟ps-lite这个比较呢。里面的什么设计让它可以做到大规模的并发。我看到里面使用了seastar这个组件,是因为使用了这个组件末
from x-deeplearning.
广告场景下的模型训练有两个特点,一个是超高的稀疏特征维度,另一个是超大规模的样本数量,这两个特点决定了XDL需要在大规模并发(并发>200)的条件下保证系统性能具有良好的水平扩展能力,根据我们的实际测试,tensorflow在并发超过200的情况下的水平扩展能力已经很差,而XDL可以做到在1000并发内的近似线性扩展。
from x-deeplearning.
@songyue1104 这个并发说得是在线inference时的情况吗? 能给出一些细节和数据吗?
from x-deeplearning.
@songyue1104 麻烦再问下,这个有开发对应的webUI,类似tensorboard的可视化工具 以及history server这些吗
from x-deeplearning.
@esail 此处的并发指的是离线train时worker的并发度。ps-plus的failover本身会比较耗时,不能在线inference使用。开发这个模块的初衷是为了适应大规模数据下高并发的吞吐能力,这个是相比于tensorflow ps的核心优势,另外ps-plus模块还支持非id化数据存储查询,流式写出增量模型的功能。
from x-deeplearning.
@esail 配套的可视化调优工具目前在开发进程中。
from x-deeplearning.
Related Issues (20)
- Fail to find glog
- 集群化部署有例子可以验证是否成功么
- 训练时报错
- TDM中叶子结点的embdding是怎么来的
- insufficient processing units
- invalid batch
- TDM serving编译时报错
- AUC bug report
- TDM3.0 beam search采样不会很耗时吗 HOT 3
- 话说现在用TensorFlow分布式训练CTR模型的思路是? HOT 1
- 如何在标准文本数据格式中表示sparse feature的 segments HOT 1
- xdl有没有各个函数功能的详细文档 HOT 1
- build_final_net return a tuple but use as list
- pip install xdl
- help!!! Illegal instruction (core dumped)
- tdm 单机预测评估时卡住没反应 HOT 2
- SyncHook 没有在 worker 之间形成同步?
- ZfsKv是做什么用的?
- mac M2 芯片的可以安装xdl吗? HOT 1
- TDM 的train.py执行异常, item_emb 为空
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from x-deeplearning.