GitHunt

About

This project is a Scrapy crawler for nairaland.
The spider crawls nairaland users, topics and comments and stores them into a mongo db.
The current structure is :

Topic
=====
PageID
View Count

User
====
Sex

Comment
=======
Text
Timestamp
Attachments
Quoted Text         

Usage

  1. Install requirements

pip install -r requirements.txt
  1. Start the crawler
scrapy runspider nairaland_crawler

Note : You should edit the mongo details in nailand/settings.py if yours are different.

Languages

Python100.0%

Contributors

Created November 26, 2019
Updated July 16, 2021