Comments (1)
对于搜狗获取,情况如下:
- 进度:
- 2021-12-22: 基于
playwright
的数据获取脚本基本完成
- 2021-12-22: 基于
- 问题:
- 方案:2021-12-22: 基于
playwright
调用无头浏览器增加ua的形式进行微信最新文章抓取
数据格式:
{
"doc_author": "howie6879",
"doc_content": "",
"doc_ts": 1639702080,
"doc_date": "2021-12-17 08:48",
"doc_des": "本周推荐游戏程序员的读书笔记,致敬。",
"doc_id": "bd998b9c43ba2d91fd6be9f833ecb634",
"doc_image": "http://mmbiz.qpic.cn/mmbiz_jpg/YRBRJvZXcIVBtU4gtNsZrRQtDLDS725uEGsCGXHbq7GzfDK2KumHOSKkA6TiaWLia1co96EzPqHRoiac7w7wtqlkg/0?wx_fmt=jpeg",
"doc_keywords": [],
"doc_link": "https://mp.weixin.qq.com/s?src=11×tamp=1640227638&ver=3513&signature=KSf-sAynN5L4LZlsLccoZvT7BT2C6BOcinT77piilqyZnDkcBAy8xpN5o1E8XIKNlBei5CiWNuWJ7e8OzqzyvsY6Fr-aF60Sc6mXJLExQrCNDgGf1V-F8LmOuyCxPVZv&new=1",
"doc_name": "我的周刊(第018期)",
"doc_source": "2c_wechat",
"doc_source_account_intro": "编程、兴趣、生活",
"doc_source_account_nick": "howie_locker",
"doc_source_meta_list": [
"howie_locker",
"编程、兴趣、生活"
],
"doc_source_name": "老胡的储物柜",
"doc_type": "article"
}
from liuli.
Related Issues (20)
- 怎么在windows远程连接mongdb数据库? HOT 4
- 搜狗接口只能读到第一条文章 HOT 3
- 采集器有时候会抽风显示url无效 HOT 2
- 带有空格的公众号采集总是失败 HOT 7
- 希望增加功能,取消生成的RSS中的updated的变动 HOT 3
- 多人使用,推送消息的时候有人不喜欢我关注的微信公众号 HOT 1
- 采集的文章中如果含有视频,就会无法播放 HOT 1
- 如有公众号每天发布同样标题的文章,但内容不同时,备份到Github时会有冲突
- 为什么pro.env里要写27017? compose不是写了映射出去是27027吗? compose里的27027有何作用? HOT 2
- 有计划支持 SQLite 数据库吗? HOT 1
- Pipenv install looks not initializing Playwright correctly HOT 1
- 0.24版本参照教程无法启动schedule HOT 1
- ERROR Liuli 执行失败! HOT 3
- docker启动:database name cannot be the empty string HOT 1
- doc.liuli.io 无法打开 HOT 3
- 文章备份到本地是空白的,RSS全文解析时地址似乎自动识别的/liuli_wechat/老胡的储物柜之后的地址没有带上前缀 HOT 1
- Demo尝试失败
- 请教一下:RSS 输出链接为本地链接,而且content 为空 HOT 1
- 求助:日志正常,但备份器内容为空 HOT 1
- 采集失败,日志如下 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from liuli.