Comments (5)
哈哈 没有scrapy那么麻烦辣~~
其实很多人的需求很简单 要么是有反爬的要么是没有反爬的,我准备标准化三套代码 一套是直接爬的那种 不用登录什么限制都没有的 一套是加入模拟登录,加入随机UserAgent , 第三套是加入更换sessionid,cookie的那种
然后标准化输出,主要是 数据和舆情信息,类似你们的Mod的概念,我觉得ricequant的mod想法很赞.
这样大家就可以根据自己要爬的网站的需求,以及目标网站的反爬level进行自主选择框架,以标准化的形式存进数据库可以很容易集成到原来的框架里面来.
类似脚手架的感觉,这样比较方便公共代码复用
from quantaxis.
准备把爬虫框架组合,标准化一下,搞成爬虫脚手架
from quantaxis.
@yutiansut 哈哈 赞赞哒 你这是打算做一个scrapy啊
from quantaxis.
@yutiansut 恩恩 不过scrapy的 pipeline机制还是可以借鉴的 每个人的需求可能都比较简单,但每个人的业务场景未必是一样的,所以如果做通用性的scaffold 还是比较难的。
from quantaxis.
那个mod的模式确实挺好的
from quantaxis.
Related Issues (20)
- 初始化数据报错 HOT 5
- QA_User和QA_Risk是不是被弃用了,但是在qactabase.py中还有在使用的地方
- Quantaxis 2.0.0.dev33版本的回测样例 HOT 3
- docker环境下的安装失败 HOT 1
- 无法获取通达信板块数据,Error save_tdx.QA_SU_save_stock_block exception HOT 1
- 使用docker pull 时报网络错误 HOT 3
- 回测示例求教:基于RSRS的ETF轮动策略(closed) HOT 1
- 大数据AI推荐
- 安装依赖时,报错metadata-generation-failed
- 有港股的财务数据吗 HOT 1
- 报jqdatasdk not installed HOT 2
- 请问下大盘指数日线图和分时图是如何save 的,存在哪个表里
- 关于自由现金流FINONE(322)和(321)的精确定义?
- Failed to init data in quantaxis cli HOT 2
- qatrader运行异常 HOT 2
- qa沟通渠道问题 HOT 2
- QUANTAXIS 的一般/高级财务方法 报错 KeyError: '581'
- save all: xdxr 不会增量更新。
- QATdx 期货数据 get_history_transaction_data 分时或者分笔数据没有夜盘数据
- 有没有人知道如何通过docker搭建2.0版本 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from quantaxis.