Giter VIP home page Giter VIP logo

Comments (12)

687766616e avatar 687766616e commented on May 23, 2024 2

fofa導出建议:
導出格式: {http://|https://}(可選){ip|hostname}{:port}}(可選)

from ecommercecrawlers.

Hatcat123 avatar Hatcat123 commented on May 23, 2024 1

您好,目前有一个需求是像搜狗微信一样按关键词搜索公众号文章,获得文章标题、链接、公众号名称,请问贵团队知道有技术路径可以实现这个需求吗?为什么搜狗微信老是搜索不全呢?

搜狗微信只能搜索部分数据,拿不到全部的数据,我们之前和您的想法一致,但是在实践中放弃了这种做法。我们的做法是:定时监控采集某个相关领域的所有公众号的文章,然后再做搜索。http://wechat.doonsec.com
当然这样也还是不能根据关键字得到100%的数据,只能保证在我监控公众号内的范围。除非你能拥有腾讯微信数据库的权限,这看起来似乎没有可能。

from ecommercecrawlers.

ggann9527 avatar ggann9527 commented on May 23, 2024

DianpingCrawler/dazhong.py
51行:‘res = requests.get(self._cur_request_css_url, headers=self._default_headers, cookies=self._cookies)’
应该为‘res = requests.get(url, headers=self._default_headers, cookies=self._cookies)’

from ecommercecrawlers.

Hatcat123 avatar Hatcat123 commented on May 23, 2024

DianpingCrawler/dazhong.py
51行:‘res = requests.get(self._cur_request_css_url, headers=self._default_headers, cookies=self._cookies)’
应该为‘res = requests.get(url, headers=self._default_headers, cookies=self._cookies)’

好的,你的意见真的很有用,同时你可以进行pR提交

from ecommercecrawlers.

687766616e avatar 687766616e commented on May 23, 2024

DianpingCrawler/dazhong.py
51行:‘res = requests.get(self._cur_request_css_url, headers=self._default_headers, cookies=self._cookies)’
应该为‘res = requests.get(url, headers=self._default_headers, cookies=self._cookies)’

好的,你的意见真的很有用,同时你可以进行pR提交

google translate? 感覺怪怪的...😅

from ecommercecrawlers.

Hatcat123 avatar Hatcat123 commented on May 23, 2024

添加影评分析爬虫

from ecommercecrawlers.

gaowei1012 avatar gaowei1012 commented on May 23, 2024

老哥,最近爬虫风声很近,慎搞慎搞

from ecommercecrawlers.

TianmaBifang avatar TianmaBifang commented on May 23, 2024

运行环境,用到的依赖,使用方法,写清楚

from ecommercecrawlers.

Hatcat123 avatar Hatcat123 commented on May 23, 2024

这个已经加入到projects计划。项目比较多正在整理,完善

from ecommercecrawlers.

wangx404 avatar wangx404 commented on May 23, 2024

界面完全可以用PyQt来写,比TK好用。

from ecommercecrawlers.

QuentinDan avatar QuentinDan commented on May 23, 2024

您好,目前有一个需求是像搜狗微信一样按关键词搜索公众号文章,获得文章标题、链接、公众号名称,请问贵团队知道有技术路径可以实现这个需求吗?为什么搜狗微信老是搜索不全呢?

from ecommercecrawlers.

siyu1992 avatar siyu1992 commented on May 23, 2024

您好,安居客的经纪人信息爬虫目前好像网站上是用的二维码转到小程序去才能爬到电话号码,这个点能做吗?感谢感谢

from ecommercecrawlers.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.