Skip to content

abhi267266/book-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scrapy Web Crawler

This is a web crawling project built using the Scrapy framework in Python.

Overview

This project contains a spider crawls through the /books.toscrape.com and collect the books data from all the page of the website, also as it scraps I have created middleware and pipeline so it can directly get stored in mysql database or run

scrapy crawl booksider -O to-the-file.json or .csv

Getting Started

Prerequisites

  • Python 3.6 or higher
  • Scrapy (pip install scrapy)

Installing

Clone this repository:

git clone https://github.com/yourname/scrapy-crawler.git
cd scrapy-crawler

About

this is a book scraper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages