Scrap facebook live gaming page and extracts live users and push them to faktory worker to parse their detail(name, uid, username, number of follower, number of likes, contact details - email, social links) and their posts(post-id, text, datetime, hashtags, links, images) and finally save them in MongoDB.
- Install faktory server.
- Install MongoDB and create database
aggero_fband two collectionuser_detailsandposts. - pip install -r requirements.txt
- Change
URL_FACTORYpassword inutils.pyfile.
-
Navigate to
mainpackage. -
First run
python3 consumer.py -
Then
python3 producer.py -nup 10 -nps 3. You must need to provide-nupand-nps-nup- Number of live users to parse- Choices :
intvalue greater than 0 orstrvalueall
- Choices :
-nps- Number of scroll while parsing user posts. Every scroll gives about 18 posts.- Choices :
intvalue greater than 0 orstrvalueall
- Choices :
-nup 10 -nps 3works best for testing,allmay be used in production.
-
Scraping should be started now.
- Configure Tor for proxy.
- Build a error database and send daily error report email to admin.
- By default I am running one worker process but you can modify it in
consumer.pyfile as number of cores in your pc. - Create index for MongoDB.
- Write tests.