Comments (6)
首先确认是否接的是部署chengYing的机器,如果是的话,需要注释/data/easymanager/environment_init.sh中ntp相关代码;
否则进入部署chengying机器: 修改/data/easymanager/environment_init.sh,将第二行set -ex注释掉,然后重启agent: sh /opt/dtstack/easymanager/easyagent/easyagent.sh restart观察报错信息
from chengying.
我这边添加了两个主机节点,一个是chengying的服务器自身(192.168.14.6),另一个是其他服务器(192.168.14.223),这两台主机都是添加失败。14.6这个主机我可以按照你说的方法试试,不过现在我要怎么删除主机重新添加呢?下架的时候提示agent在运行,无法下架,没有找到可以停止的agent的操作入口,于是我就把主机节点上的agent两个进程杀掉,但是不起效果。过了一会儿chengying上还是显示agent在运行中,无法删除,然后过了好大一会儿再去主机节点上看,agent进程又自动重启了。所以我想问要怎么操作来下架主机,官方的操作手册上是否可以完善说明,另外对于大家的提问,是否可以整理到QA的部分,方便高效排查问题。
from chengying.
解决了:1. 先把已部署的组件卸载;2. 去主机节点上把/opt/dtstack/easymananger目录删除;3. 在主机节点上杀掉easyagent进程;4. 等个一小会儿chengying判断心跳丢失,认为主机下线,然后就可以下架主机,重新添加了。
另外,我想问一下为么主机初始化要删除unzip的命令,搞不懂,很奇怪呀。
from chengying.
很诡异呀,初始化脚本删除unzip命令,部署组件的时候又要用unzip来解压/opt/dtstack/tmp目录下的zip包,解压不了然后就失败了。往192.168.14.223部署组件,部署失败,查看/opt/dtstack/tmp目录下TopLab.zip还在 /opt/dtstack/top-lab/TopLab目录下是空的,下面的是部署日志:
*************************** 1.TopLab ***************************
部署服务名称: TopLab
部署服务版本: 1.0.1
部署开始时间: 2023-09-14 09:24:39
部署结束时间: 2023-09-14 09:24:41
部署结果: install fail
部署摘要: exec agent install err: + agent_zip=TopLab.zip
+ app_dir=/opt/dtstack/top-lab/TopLab
+ agent_bin=/opt/dtstack/top-lab/TopLab/./start.sh
+ run_user=root
+ data_dir=
+ unzip_tmp_dir=/opt/dtstack/tmp
+ DOWNLOAD_URL=http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ trap '[ "$?" -eq 0 ] || read -p "Looks like something went wrong in step ´$STEP´"' EXIT
+ install
+ STEP='install agent'
+ echo 'Use the curl download and install Please Waiting...'
Use the curl download and install Please Waiting...
+ '[' '!' -d /opt/dtstack/tmp ']'
+ mkdir -p /opt/dtstack/tmp
+ cd /opt/dtstack/tmp
+ curl -L -O -s http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ install_agent
+ mkdir -p /opt/dtstack/top-lab/TopLab
+ unzip -o /opt/dtstack/tmp/TopLab.zip -d /opt/dtstack/top-lab/TopLab
+ '[' 127 -eq 0 ']'
+ read -p 'Looks like something went wrong in step ´install agent´'
部署配置:
{
"ServiceDisplay": "top-lab",
"Version": "1.0.1",
"Instance": {
"Logs": [
"logs/output.log"
],
"HealthCheck": {
"Shell": "curl http://192.168.14.6:8097/healthcheck",
"Period": "1m",
"StartPeriod": "30s",
"Timeout": "10s",
"Retries": 3
},
"RunUser": "root",
"Cmd": "./start.sh start \u003e logs/top-lab.log 2\u003e\u00261",
"PostDeploy": "",
"PostUpGrade": "",
"PostUndeploy": "",
"UnInstall": "",
"PrometheusPort": "",
"MaxReplica": "3",
"UpdateRecreate": true
},
"Group": "top-ai",
"Config": {
"self_ip": {
"Default": {
"Host": [
"localhost.localdomain"
],
"IP": [
"192.168.14.223"
],
"NodeId": 1,
"Select": null,
"SingleIndex": 0,
"UnSelect": null
},
"Desc": "internal",
"Type": "internal",
"Value": {
"Host": [
"localhost.localdomain"
],
"IP": [
"192.168.14.223"
],
"NodeId": 1,
"Select": null,
"SingleIndex": 0,
"UnSelect": null
}
},
"service_port": {
"Default": "8097",
"Desc": "internal",
"Type": "internal",
"Value": "8097"
},
"top_lab_ip_port": {
"Default": "192.168.14.223:8097",
"Desc": "internal",
"Type": "internal",
"Value": "192.168.14.223:8097"
}
},
"BaseProduct": "",
"BaseProductVersion": "",
"BaseService": "",
"BaseParsed": false,
"BaseAtrribute": ""
}
部署事件:
+------------+---------------- 1 --------------------------+
| 组件信息 | 1
| 事件类型 | install
| 事件时间 | {2023-09-14 09:24:42 +0800 CST true}
| 事件结果 | exec agent install err: + agent_zip=TopLab.zip
+ app_dir=/opt/dtstack/top-lab/TopLab
+ agent_bin=/opt/dtstack/top-lab/TopLab/./start.sh
+ run_user=root
+ data_dir=
+ unzip_tmp_dir=/opt/dtstack/tmp
+ DOWNLOAD_URL=http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ trap '[ "$?" -eq 0 ] || read -p "Looks like something went wrong in step ´$STEP´"' EXIT
+ install
+ STEP='install agent'
+ echo 'Use the curl download and install Please Waiting...'
Use the curl download and install Please Waiting...
+ '[' '!' -d /opt/dtstack/tmp ']'
+ mkdir -p /opt/dtstack/tmp
+ cd /opt/dtstack/tmp
+ curl -L -O -s http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ install_agent
+ mkdir -p /opt/dtstack/top-lab/TopLab
+ unzip -o /opt/dtstack/tmp/TopLab.zip -d /opt/dtstack/top-lab/TopLab
+ '[' 127 -eq 0 ']'
+ read -p 'Looks like something went wrong in step ´install agent´'
| 事件详情 | api/v2/instance/1/event?eventId=156
+------------+---------------- 2 --------------------------+
| 组件信息 | 1
| 事件类型 | uninstall
| 事件时间 | {2023-09-13 09:49:48 +0800 CST true}
| 事件结果 | uninstall success
| 事件详情 | api/v2/instance/1/event?eventId=20
+------------+---------------- 3 --------------------------+
| 组件信息 | 1
| 事件类型 | install
| 事件时间 | {2023-09-12 19:20:57 +0800 CST true}
| 事件结果 | exec agent install err: + agent_zip=TopLab.zip
+ app_dir=/opt/dtstack/top-lab/TopLab
+ agent_bin=/opt/dtstack/top-lab/TopLab/sh
+ run_user=root
+ data_dir=
+ unzip_tmp_dir=/opt/dtstack/tmp
+ DOWNLOAD_URL=http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ trap '[ "$?" -eq 0 ] || read -p "Looks like something went wrong in step ´$STEP´"' EXIT
+ install
+ STEP='install agent'
+ echo 'Use the curl download and install Please Waiting...'
Use the curl download and install Please Waiting...
+ '[' '!' -d /opt/dtstack/tmp ']'
+ mkdir -p /opt/dtstack/tmp
+ cd /opt/dtstack/tmp
+ curl -L -O -s http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ install_agent
+ mkdir -p /opt/dtstack/top-lab/TopLab
+ unzip -o /opt/dtstack/tmp/TopLab.zip -d /opt/dtstack/top-lab/TopLab
+ '[' 127 -eq 0 ']'
+ read -p 'Looks like something went wrong in step ´install agent´'
| 事件详情 | api/v2/instance/1/event?eventId=1
from chengying.
我现在手动在主机上把unzip和zip命令装好,重新部署组件,又提示没有权限,可是我给的用户已经具备了sudo+NOPASSWD的权限呀,这为什么提示无权限呢。
昨天搞了一天也是各种稀奇古怪的问题,准备放弃了。。。
from chengying.
重新添加主机,更换root用户执行(主机编辑界面如能支持修改用户就方便了,现在必须重复#41 (comment) 里面的4步,再重新添加主机,麻烦呀)。
更换root用户,主机初始化脚本会删除unzip和zip命令,然后手动重新安装这两个工具,执行部署组件操作,这次又提示脚本不存在
| 事件结果 | exec start err: run agent 1835428a-c6e0-47da-970b-171da52a406b error: fork/exec TopLab/start.sh: no such file or directory
可是查看文件是存在的呀,/opt/dtstack/top-lab/TopLab/start.sh ,真是崩溃啊
from chengying.
Related Issues (20)
- tc and cgroup are not implemented HOT 4
- SQL注入问题 HOT 1
- hadoop和hive安装失败 HOT 1
- chengying支持部署到国产化系统中吗? HOT 3
- 二维码失效了
- 帮助文档的快速体验地址无法访问
- [部署中心][集群详情][集群命令]无法展示产品包部署的真实日志。
- 添加主机时,如果主机与em是同一台会提示主机初始化失败 HOT 4
- 添加主机时IP地址处理bug
- 登录默认密码没有 HOT 2
- 添加主机操作系统版本7.4以下初始化失败 HOT 2
- dt-alert模块是什么作用,没看到源码和Dockerfile HOT 1
- config_paths挂载文件前台无法显示内容 HOT 3
- service name top-lab is invalid accessing config HOT 2
- 部署摘要: agent异常退出:agent run error(unexpected):stop supervisor: e8771b72-8e28-49d8-ab25-3b9ff4cbf37b HOT 7
- 健康检查的脚本,执行返回校验存在bug
- 启动不报错,却把已启动的服务给停掉了 HOT 3
- 这个项目是不维护了吗?
- chengying支持部署flink吗? HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chengying.