Giter VIP home page Giter VIP logo

Comments (2)

HEUDavid avatar HEUDavid commented on May 18, 2024

哈哈哈, 真的有人用啊,我去==,get_datetime(self, s)格式化时间函数抛出异常了,的确不影响使用,我爬的时候没考虑19年以前的数据,所以像这样的格式的时间没有处理。

爬取发博人的详细数据也可以呀,目前爬取的博主主页,地址里的数字部分就是用户的id,可以用这id拿过来单独爬这个用户的数据,地区、年龄、性别等等用户属性。我还有个想法爬这个用户的所有微博来为其打标签。

目前实现的的基于关键字的爬虫,可以控制爬某个地区的微博。

额,这是我毕业设计的一部分内容,后面是做文本分析的内容了,还剩的爬虫部分只能后面有时间在写了。

from weibospider.

lovesnacks avatar lovesnacks commented on May 18, 2024

哈哈对啊!!超好用!!来自一个不会写代码却被老师要求爬微博的小渣渣~~再次感谢
我再去找找爬用户的代码吧~~~文本分析有代码吗嘿嘿嘿,方便的话可以上传吗?
祝你毕业顺利啦!!

from weibospider.

Related Issues (1)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.