18520339 / facebook-data-extraction Goto Github PK

Experience for effectively fetching Facebook data by Querying Graph API with Account-based Token and Operating undetectable scraping Bots to extract Client/Server-side Rendered content

Home Page: https://www.youtube.com/watch?v=Q4oAsz__e_M

License: MIT License

Python 65.98% JavaScript 34.02%

facebook facebook-graph-api proxy browser-fingerprinting scraping crawling automation selenium tor-network

facebook-data-extraction's Issues

This script doesn't work on FB pages with new layout

Hello!
Thanks for your script! But I found it only works on FB pages with the old layout, and it doesn't work on the new version.
Maybe because the parameter of CSS is different?
Can you help me with it?
Thanks!

friends list data

thanks for the great repo , kindly it's possible to extract friends data

best regards

Proxy List Exception

Hi, thanks for sharing this great project. While I am trying to test it using Tor browser configuration, and setting:
USE_PROXY = False in crawler.py file. I keep getting the following error:

raise ProxyListException("list is empty") http_request_randomizer.requests.errors.ProxyListException.ProxyListException: list is empty
in the requestProxy.py file. Anything that probably went wrong?
Also, is there any tutorial that covers the configuration when using Tor as a browser to collect data?

Thanks

In posts with 1 comment , this comment is not crawled.
In posts with 2 comments, only 1 comment is crawled.
In posts with 48 comments, only 2 comments are crawled.

I have tried increasing VIEW_MORE_CMTS number but it seemingly doesn't change anything.
Is anyone else experiencing this?

18520339 / facebook-data-extraction Goto Github PK

facebook-data-extraction's Issues

This script doesn't work on FB pages with new layout

friends list data

Proxy List Exception

browser redirect to the login page

Không lấy được dữ liệu bài viết nhóm

Comments are partially crawled

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent