Light

hell-to-heaven / crawl_vip_paid_manhua Goto Github PK

View Code? Open in Web Editor NEW

4.0 1.0 1.0 148 KB

突发奇想，想通过爬虫破解付费漫画章节下载

Python 100.00%

crawl_vip_paid_manhua's Introduction

crawl_vip_paid_manhua

突发奇想，想通过爬虫破解付费漫画章节下载

** 更新：付费章节毫无头绪，url参数找不到构造方法（加密参数无法看出规律，也不知道是不是md5还是腾讯自家的加密方式。。。。），selenium就不用说了。。。。看了一天头昏脑涨。。先缓缓吧。。

更新：

花了大半天，想明白两件事：
1 不要挑战大厂，尤其是BAT带头的互联网企业，人家的前端工程师不是吃素的。。。
2 不用非钻牛角尖，换个漫画网站爬也一样。。。。。。。

换了网站之后，又发现这个网站也不是ajax来渲染内容的，但为啥response回来的和看到的源码就是不一样。。。。
然后只能用selenium，然后又作死想用多进程爬得快一点。。。
然后就状况百出：
* ip代理池爬取的是免费的ip，果然不稳定得一批。。基本上爬个半分钟出一次错
* retry模块用的不熟练，调试发现写的越多逻辑越迷

反正最后爬下来了，妖怪名单！！！一共四百多章节（曾经很喜欢的一部漫画，后来因为收费就放弃了，然鹅我卷土重来了~万能的互联网）

crawl_vip_paid_manhua's People

Contributors

Stargazers

Watchers

Forkers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.