Giter VIP home page Giter VIP logo

rad's Issues

登录功能不可用

rad -t http://example.com -wait-login
使用该命令,命令提示“进行手动登录”,但是登录界面未加载,浏览器为空白页,任意键后浏览器加载目标页面,直接进入扫描,手动输入功能不可用。

建议增加忽略证书校验的功能

在命令行或者配置文件中加入是否忽略证书校验的选项,这样被动扫描的时候就不用倒入扫描器的证书了,更方便一些。

只要在启动chrome的时候 添加 --ignore-certificate-errors 参数即可

新增功能建议

1、单个标签页的timeout时间设定
2、单个域名爬取时的最大时间设定

exec_path: "离谱"

离谱了,这么个问题折腾半小时。然后自己解决了
exec_path: 的值不能有双引号。
1

2

如过遇到路径有空格怎么办呢(我没测试,不知道支持不)

chrome failed to start

[ERRO] 2020-10-23 09:43:21 [default:radium.go:151] chrome failed to start:

chrome failed to start:
已添加过环境变量,chrome也在默认路径,也是最新版。
但xray的--browser-crawler和 rad都无法使用。
image

image

image

image

invalid memory address or nil pointer dereference

对webgoat进行登录爬取,并且在disallowed-urls里配置了logout的排除,但是没爬多少数据就报错了
环境:

Rad 0.4/ebbb3ce3

配置文件:

enable-image-display: false                  # 启用图片显示,适用于需要验证码登录的情况,启用wait-login自动开启
load-wait: 0                                 # 页面加载完毕后的等待时间,单位秒,网速不佳时可尝试调大该值
exec-path: ""                                # 启动chrome的路径
disable-headless: false                      # 禁用无头模式
request-config:                              # 请求头配置
  user-agent: ""                             # 请求user-agent配置
  headers:                                   # 请求header配置
  - key: ""                                  # header的key
    value: ""                                # header的value
  cookies:                                   # 请求cookie配置
  - name: ""                                 # cookie的name
    value: ""                                # cookie的value
restrictions-on-urls:                        # 对爬取的URL的一些限制项
  disallowed-suffix: []                      # 不允许的文件后缀
  disallowed-keywords-in-path-and-query: []  # 不允许的URL关键字
  disallowed-domain: []                      # 不允许的域名
  disallowed-urls: ["[a-z/]*logout[a-z/]*"]                        # 不允许的URL(正则)
  allowed-domains: []                        # 允许的域名,起始目标会被自动加入其中
  allowed-urls: []                           # 允许的URL(正则)
restrictions-on-requests:                    # 对请求行为的一些限制项
  max-concurrent: 10                         # 最大页面并发(不大于10)
  max-depth: 5                               # 最大页面深度限制
  max-click-depth: 5                         # 一个页面中最大点击深度限制
  max-count-of-page: 1000                    # 最多爬取的页面数量限制
  max-click-or-event-trigger: 1000           # 单个页面中最大点击或事件触发次数(不大于10000)
  click-or-event-interval: 1000              # 点击间隔,单位毫秒

image

http-equiv="refresh" 跳转跟随

rad是否可以提供这个功能,在遇到 http-equiv="refresh" 时,爬虫可以跟随到 /admin 路径下

<head>
       <meta http-equiv="refresh" content="1;URL='/admin'"/>
</head>

exec: "google-chrome": executable file not found in %PATH%错误怎么解决

原谅菜鸟,第一步就被绊倒了
rad_windows_amd64.exe打开-t xxxx.com,提示错误如下
image
chang's

[ERRO] 2020-10-14 18:16:49 [default:radium.go:151] exec: "google-chrome": executable file not found in %PATH%
exec: "google-chrome": executable file not found in %PATH%
尝试在系统设置-环境变量中添加google浏览器的路径,但仍然提示上述错误
image

rad运行会停止

  • 我把rad命令使用;``或&&`连接起来形成类似这样的语句的时候:
  • ./rad_linux_amd64 -t http://xxxx.com --http-proxy 127.0.0.1:7777 &&./rad_linux_amd64 -t http://xxasdaxx.com --http-proxy 127.0.0.1:7777 &&./rad_linux_amd64 -t https://xxasxxdaxx.com --http-proxy 127.0.0.1:7777 &&./rad_linux_amd64 -t https://xxxxasdaxx.com --http-proxy 127.0.0.1:7777 (当然比现在的还要长,大概上千个)
  • 会出现爬到某一个网站的某一个页面的时候直接停掉,就停在那里不动,也不进行下一个目标的爬取,也不结束程序,直到人为操控,这是咋回事呢?
  • 应该并不是另一边xray那边队列长度的限制,因为xray那边都扫描完了,这边的还是不动

功能建议

我看到rad是爬了一些接口,可以单独把js爬出来么🌹。
例如网页为

<script src="/js/jquery-1.11.0.min.js" charset="gbk"></script>
--
  | <script src="http://target1.com/new/pc/js/https_util.min.js?v=20180607"></script>
<script src="http://target2.com/new/pc/js/https_util.min.js?v=20180607"></script>
<script src="http://target3.com/new/pc/js/https_util.min.js?v=20180607"></script>

提取结果
``

0.4版本报错

root@iZt4n1sswjid2nrd8b0cetZ:~/testrad# ./rad_linux_amd64 -t https://example.com

██████╗ █████╗ ██████╗
██╔══██╗██╔══██╗██╔══██╗
██████╔╝███████║██║ ██║
██╔══██╗██╔══██║██║ ██║
██║ ██║██║ ██║██████╔╝
╚═╝ ╚═╝╚═╝ ╚═╝╚═════╝

[Rad 0.4/ebbb3ce3]
Build: [2021-04-30] [linux/amd64] [RELEASE/unknown]
Compiler Version: go version go1.15.6 linux/amd64

[ERRO] 2021-05-01 02:16:40 [rad:radium.go:209] [launcher] Failed to get the debug url: [0501/021639.748738:ERROR:zygote_host_impl_linux.cc(90)] Running as root without --no-sandbox is not supported. See https://crbug.com/638180.

[launcher] Failed to get the debug url: [0501/021639.748738:ERROR:zygote_host_impl_linux.cc(90)] Running as root without --no-sandbox is not supported. See https://crbug.com/638180.

功能建议

  1. 在爬取之前进行目录扫描,增加入口点(可加入选项)
  2. http隐藏参数爆破
  3. json格式输出(post包要带data)(最好可直接打印出来和存成文件)可参考crawlergo输出形式

添加--http-proxy 参数后爬取结果异常

如题,添加--http-proxy 参数后爬取结果异常

PS C:\Users\Admin\Downloads> .\rad_windows_amd64.exe -t http://testphp.vulnweb.com

██████╗  █████╗ ██████╗
██╔══██╗██╔══██╗██╔══██╗
██████╔╝███████║██║  ██║
██╔══██╗██╔══██║██║  ██║
██║  ██║██║  ██║██████╔╝
╚═╝  ╚═╝╚═╝  ╚═╝╚═════╝

[Rad 0.2/638f30fd]
Build: [2020-09-10] [linux/amd64] [RELEASE/__unknown__]
Compiler Version: go version go1.14.4 linux/amd64

[INFO] 2020-09-10 19:32:53 [collector:mitm.go:226] loading cert from ./ca.crt
[INFO] 2020-09-10 19:32:53 [collector:mitm.go:227] loading cert key from ./ca.key
[INFO] 2020-09-10 19:32:54 [collector:mitm.go:55] no global filter config found, use default
[INFO] 2020-09-10 19:32:54 [collector:mitm.go:108] no global mitm config found, use default
[INFO] 2020-09-10 19:32:54 [collector:mitm.go:264] starting mitm server at 127.0.0.1:64253
GET http://testphp.vulnweb.com/
POST http://testphp.vulnweb.com/search.php?test=query
GET http://testphp.vulnweb.com/index.php
GET http://testphp.vulnweb.com/categories.php
GET http://testphp.vulnweb.com/artists.php
GET http://testphp.vulnweb.com/disclaimer.php
GET http://testphp.vulnweb.com/AJAX/index.php
GET http://testphp.vulnweb.com/guestbook.php
GET http://testphp.vulnweb.com/userinfo.php
GET http://testphp.vulnweb.com/login.php
GET http://testphp.vulnweb.com/privacy.php
GET http://testphp.vulnweb.com/Mod_Rewrite_Shop/
GET http://testphp.vulnweb.com/AJAX/artists.php
GET http://testphp.vulnweb.com/hpp/
POST http://testphp.vulnweb.com/userinfo.php
GET http://testphp.vulnweb.com/AJAX/infoartist.php?id=1
GET http://testphp.vulnweb.com/Mod_Rewrite_Shop/Details/network-attached-storage-dlink/1/
GET http://testphp.vulnweb.com/Mod_Rewrite_Shop/Details/web-camera-a4tech/2/
GET http://testphp.vulnweb.com/AJAX/categories.php
GET http://testphp.vulnweb.com/AJAX/titles.php
GET http://testphp.vulnweb.com/listproducts.php?cat=1
GET http://testphp.vulnweb.com/signup.php
POST http://testphp.vulnweb.com/guestbook.php
GET http://testphp.vulnweb.com/hpp/?pp=12
GET http://testphp.vulnweb.com/Mod_Rewrite_Shop/BuyProduct-1/
POST http://testphp.vulnweb.com/AJAX/showxml.php
GET http://testphp.vulnweb.com/Mod_Rewrite_Shop/RateProduct-1.html
GET http://testphp.vulnweb.com/Mod_Rewrite_Shop/BuyProduct-3/
GET http://testphp.vulnweb.com/Mod_Rewrite_Shop/RateProduct-3.html
GET http://testphp.vulnweb.com/hpp/params.php?aaaa%2F=%26%2325552%3B%26%2320132%3B
GET http://testphp.vulnweb.com/artists.php?artist=1
GET http://testphp.vulnweb.com/comment.php?aid=1
GET http://testphp.vulnweb.com/Mod_Rewrite_Shop/BuyProduct-2/
GET http://testphp.vulnweb.com/Mod_Rewrite_Shop/RateProduct-2.html
GET http://testphp.vulnweb.com/hpp/params.php?p=valid&pp=12
POST http://testphp.vulnweb.com/secured/newuser.php
GET http://testphp.vulnweb.com/listproducts.php?artist=1
POST http://testphp.vulnweb.com/comment.php
GET http://testphp.vulnweb.com/showimage.php?file=./pictures/1.jpg
GET http://testphp.vulnweb.com/comment.php?pid=1
GET http://testphp.vulnweb.com/showimage.php?file=./pictures/1.jpg&size=160
GET http://testphp.vulnweb.com/product.php?pic=1
PS C:\Users\Admin\Downloads> .\rad_windows_amd64.exe -t http://testphp.vulnweb.com --http-proxy http://127.0.0.1:7777

██████╗  █████╗ ██████╗
██╔══██╗██╔══██╗██╔══██╗
██████╔╝███████║██║  ██║
██╔══██╗██╔══██║██║  ██║
██║  ██║██║  ██║██████╔╝
╚═╝  ╚═╝╚═╝  ╚═╝╚═════╝

[Rad 0.2/638f30fd]
Build: [2020-09-10] [linux/amd64] [RELEASE/__unknown__]
Compiler Version: go version go1.14.4 linux/amd64

GET http://testphp.vulnweb.com/
PS C:\Users\Admin\Downloads>

除了 http://testphp.vulnweb.com 这个站点之外,尝试过其他站点也存在这种情况,由于授权的原因,无法提交示例

设置不允许url关键字不起作用

设置如图,

设置register,forum,search为不爬取的关键字,但是rad还是爬取了包含这几个关键字的url,请问是我yaml语法错误吗?
dSPigU.png

运行rad报错

chrome-linux版本为79
[ERRO] 2021-10-28 15:05:48 [rad:radium.go:209] [launcher] Failed to get the debug url: [1028/150547.437603:ERROR:zygote_host_impl_linux.cc(89)] Running as root without --no-sandbox is not supported. See https://crbug.com/638180.

[launcher] Failed to get the debug url: [1028/150547.437603:ERROR:zygote_host_impl_linux.cc(89)] Running as root without --no-sandbox is not supported. See https://crbug.com/638180.

rad_config.yml

如果把rad添加到环境变量,rad_config.yml这个配置文件会生成在cmd运行的位置,默认是C:\User\user,找了半天才找到

新增功能建议

  • 看起来只能针对某个单一的域名进行爬取,建议增加一个批量读取的URL的

    • 如:本地新建txt文本,把需要爬取的目标放到里面,然后批量爬取
  • 另外只能针对高级版进行深度融合吗?社区办支持吗?文档里面没有说明~

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.