Giter VIP home page Giter VIP logo

crawl_vip_paid_manhua's Introduction

crawl_vip_paid_manhua

突发奇想,想通过爬虫破解付费漫画章节下载


** 更新: 付费章节毫无头绪,url参数找不到构造方法(加密参数无法看出规律,也不知道是不是md5还是腾讯自家的加密方式。。。。),selenium就不用说了。。。。看了一天头昏脑涨。。先缓缓吧。。


更新:

花了大半天,想明白两件事:
1 不要挑战大厂,尤其是BAT带头的互联网企业,人家的前端工程师不是吃素的。。。
2 不用非钻牛角尖,换个漫画网站爬也一样。。。。。。。


换了网站之后,又发现这个网站也不是ajax来渲染内容的,但为啥response回来的和看到的源码就是不一样。。。。
然后只能用selenium,然后又作死想用多进程爬得快一点。。。
然后就状况百出:
* ip代理池爬取的是免费的ip,果然不稳定得一批。。基本上爬个半分钟出一次错
* retry模块用的不熟练,调试发现写的越多逻辑越迷


反正最后爬下来了,妖怪名单!!!一共四百多章节(曾经很喜欢的一部漫画,后来因为收费就放弃了,然鹅我卷土重来了~万能的互联网)
pic1 pic2

crawl_vip_paid_manhua's People

Contributors

hell-to-heaven avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

3341964027

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.