Giter VIP home page Giter VIP logo

crawler's Introduction

read stars forks issues

Crawler

有感而发写的爬虫脚本。欢迎 start 和 提 issues 呀

目录

  • dongchedi 抓取懂车帝下所有汽车品牌、汽车类型、汽车图片。并支持下载。

  • jianshu 抓取简述30日热搜文章数据,存入mysql,用作数据可视化。在特定文章下刷一些评论。

  • lagou 抓取拉勾教育专栏文章&视频(必须是已付费的)

  • mm131 抓取xinggan分类下的所有图片。支持并行下载。(挂了)

  • movie 从一些资源站搜索并抓取影视的下载连接,部分支持在线观看。目前收录15个资源站。

结果展示

iShot2021-08-07 10.47.48.png

l2Tu6S.jpg

crawler's People

Contributors

dependabot[bot] avatar tf2jaguar avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

crawler's Issues

mm131抓取未考虑ID在4位数的情况

源代码,直接取的4位,如果出现3位,2位,就会出现字符
latest_id = hrefs[index][-9:-5]

用/分割,取倒数第一个即可
latest_id = hrefs[index][-9:-5].split("/")[-1]

懒得推送了..直接写出来了

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.