Giter VIP home page Giter VIP logo

Comments (6)

wangqi811 avatar wangqi811 commented on May 23, 2024

首先确认是否接的是部署chengYing的机器,如果是的话,需要注释/data/easymanager/environment_init.sh中ntp相关代码;
否则进入部署chengying机器: 修改/data/easymanager/environment_init.sh,将第二行set -ex注释掉,然后重启agent: sh /opt/dtstack/easymanager/easyagent/easyagent.sh restart观察报错信息

from chengying.

danny-zhu avatar danny-zhu commented on May 23, 2024

我这边添加了两个主机节点,一个是chengying的服务器自身(192.168.14.6),另一个是其他服务器(192.168.14.223),这两台主机都是添加失败。14.6这个主机我可以按照你说的方法试试,不过现在我要怎么删除主机重新添加呢?下架的时候提示agent在运行,无法下架,没有找到可以停止的agent的操作入口,于是我就把主机节点上的agent两个进程杀掉,但是不起效果。过了一会儿chengying上还是显示agent在运行中,无法删除,然后过了好大一会儿再去主机节点上看,agent进程又自动重启了。所以我想问要怎么操作来下架主机,官方的操作手册上是否可以完善说明,另外对于大家的提问,是否可以整理到QA的部分,方便高效排查问题。

from chengying.

danny-zhu avatar danny-zhu commented on May 23, 2024

解决了:1. 先把已部署的组件卸载;2. 去主机节点上把/opt/dtstack/easymananger目录删除;3. 在主机节点上杀掉easyagent进程;4. 等个一小会儿chengying判断心跳丢失,认为主机下线,然后就可以下架主机,重新添加了。

另外,我想问一下为么主机初始化要删除unzip的命令,搞不懂,很奇怪呀。
image

from chengying.

danny-zhu avatar danny-zhu commented on May 23, 2024

很诡异呀,初始化脚本删除unzip命令,部署组件的时候又要用unzip来解压/opt/dtstack/tmp目录下的zip包,解压不了然后就失败了。往192.168.14.223部署组件,部署失败,查看/opt/dtstack/tmp目录下TopLab.zip还在 /opt/dtstack/top-lab/TopLab目录下是空的,下面的是部署日志:

*************************** 1.TopLab ***************************
部署服务名称: TopLab
部署服务版本: 1.0.1
部署开始时间: 2023-09-14 09:24:39
部署结束时间: 2023-09-14 09:24:41
部署结果: install fail
部署摘要: exec agent install err: + agent_zip=TopLab.zip
+ app_dir=/opt/dtstack/top-lab/TopLab
+ agent_bin=/opt/dtstack/top-lab/TopLab/./start.sh
+ run_user=root
+ data_dir=
+ unzip_tmp_dir=/opt/dtstack/tmp
+ DOWNLOAD_URL=http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ trap '[ "$?" -eq 0 ] || read -p "Looks like something went wrong in step ´$STEP´"' EXIT
+ install
+ STEP='install agent'
+ echo 'Use the curl download and install Please Waiting...'
Use the curl download and install Please Waiting...
+ '[' '!' -d /opt/dtstack/tmp ']'
+ mkdir -p /opt/dtstack/tmp
+ cd /opt/dtstack/tmp
+ curl -L -O -s http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ install_agent
+ mkdir -p /opt/dtstack/top-lab/TopLab
+ unzip -o /opt/dtstack/tmp/TopLab.zip -d /opt/dtstack/top-lab/TopLab
+ '[' 127 -eq 0 ']'
+ read -p 'Looks like something went wrong in step ´install agent´'

部署配置: 
{
	"ServiceDisplay": "top-lab",
	"Version": "1.0.1",
	"Instance": {
		"Logs": [
			"logs/output.log"
		],
		"HealthCheck": {
			"Shell": "curl http://192.168.14.6:8097/healthcheck",
			"Period": "1m",
			"StartPeriod": "30s",
			"Timeout": "10s",
			"Retries": 3
		},
		"RunUser": "root",
		"Cmd": "./start.sh start \u003e logs/top-lab.log 2\u003e\u00261",
		"PostDeploy": "",
		"PostUpGrade": "",
		"PostUndeploy": "",
		"UnInstall": "",
		"PrometheusPort": "",
		"MaxReplica": "3",
		"UpdateRecreate": true
	},
	"Group": "top-ai",
	"Config": {
		"self_ip": {
			"Default": {
				"Host": [
					"localhost.localdomain"
				],
				"IP": [
					"192.168.14.223"
				],
				"NodeId": 1,
				"Select": null,
				"SingleIndex": 0,
				"UnSelect": null
			},
			"Desc": "internal",
			"Type": "internal",
			"Value": {
				"Host": [
					"localhost.localdomain"
				],
				"IP": [
					"192.168.14.223"
				],
				"NodeId": 1,
				"Select": null,
				"SingleIndex": 0,
				"UnSelect": null
			}
		},
		"service_port": {
			"Default": "8097",
			"Desc": "internal",
			"Type": "internal",
			"Value": "8097"
		},
		"top_lab_ip_port": {
			"Default": "192.168.14.223:8097",
			"Desc": "internal",
			"Type": "internal",
			"Value": "192.168.14.223:8097"
		}
	},
	"BaseProduct": "",
	"BaseProductVersion": "",
	"BaseService": "",
	"BaseParsed": false,
	"BaseAtrribute": ""
}
部署事件: 
+------------+---------------- 1 --------------------------+
| 组件信息     | 1
| 事件类型     | install
| 事件时间     | {2023-09-14 09:24:42 +0800 CST true}
| 事件结果     | exec agent install err: + agent_zip=TopLab.zip
+ app_dir=/opt/dtstack/top-lab/TopLab
+ agent_bin=/opt/dtstack/top-lab/TopLab/./start.sh
+ run_user=root
+ data_dir=
+ unzip_tmp_dir=/opt/dtstack/tmp
+ DOWNLOAD_URL=http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ trap '[ "$?" -eq 0 ] || read -p "Looks like something went wrong in step ´$STEP´"' EXIT
+ install
+ STEP='install agent'
+ echo 'Use the curl download and install Please Waiting...'
Use the curl download and install Please Waiting...
+ '[' '!' -d /opt/dtstack/tmp ']'
+ mkdir -p /opt/dtstack/tmp
+ cd /opt/dtstack/tmp
+ curl -L -O -s http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ install_agent
+ mkdir -p /opt/dtstack/top-lab/TopLab
+ unzip -o /opt/dtstack/tmp/TopLab.zip -d /opt/dtstack/top-lab/TopLab
+ '[' 127 -eq 0 ']'
+ read -p 'Looks like something went wrong in step ´install agent´'

| 事件详情     | api/v2/instance/1/event?eventId=156

+------------+---------------- 2 --------------------------+
| 组件信息     | 1
| 事件类型     | uninstall
| 事件时间     | {2023-09-13 09:49:48 +0800 CST true}
| 事件结果     | uninstall success
| 事件详情     | api/v2/instance/1/event?eventId=20

+------------+---------------- 3 --------------------------+
| 组件信息     | 1
| 事件类型     | install
| 事件时间     | {2023-09-12 19:20:57 +0800 CST true}
| 事件结果     | exec agent install err: + agent_zip=TopLab.zip
+ app_dir=/opt/dtstack/top-lab/TopLab
+ agent_bin=/opt/dtstack/top-lab/TopLab/sh
+ run_user=root
+ data_dir=
+ unzip_tmp_dir=/opt/dtstack/tmp
+ DOWNLOAD_URL=http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ trap '[ "$?" -eq 0 ] || read -p "Looks like something went wrong in step ´$STEP´"' EXIT
+ install
+ STEP='install agent'
+ echo 'Use the curl download and install Please Waiting...'
Use the curl download and install Please Waiting...
+ '[' '!' -d /opt/dtstack/tmp ']'
+ mkdir -p /opt/dtstack/tmp
+ cd /opt/dtstack/tmp
+ curl -L -O -s http://192.168.14.6:8864/easyagent/top-lab/1.0.0/TopLab.zip
+ install_agent
+ mkdir -p /opt/dtstack/top-lab/TopLab
+ unzip -o /opt/dtstack/tmp/TopLab.zip -d /opt/dtstack/top-lab/TopLab
+ '[' 127 -eq 0 ']'
+ read -p 'Looks like something went wrong in step ´install agent´'

| 事件详情     | api/v2/instance/1/event?eventId=1

from chengying.

danny-zhu avatar danny-zhu commented on May 23, 2024

我现在手动在主机上把unzip和zip命令装好,重新部署组件,又提示没有权限,可是我给的用户已经具备了sudo+NOPASSWD的权限呀,这为什么提示无权限呢。
昨天搞了一天也是各种稀奇古怪的问题,准备放弃了。。。

from chengying.

danny-zhu avatar danny-zhu commented on May 23, 2024

重新添加主机,更换root用户执行(主机编辑界面如能支持修改用户就方便了,现在必须重复#41 (comment) 里面的4步,再重新添加主机,麻烦呀)。
更换root用户,主机初始化脚本会删除unzip和zip命令,然后手动重新安装这两个工具,执行部署组件操作,这次又提示脚本不存在

| 事件结果 | exec start err: run agent 1835428a-c6e0-47da-970b-171da52a406b error: fork/exec TopLab/start.sh: no such file or directory
可是查看文件是存在的呀,/opt/dtstack/top-lab/TopLab/start.sh ,真是崩溃啊

from chengying.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.