Scrapy Web Crawler

This is a web crawling project built using the Scrapy framework in Python.

Overview

This project contains a spider crawls through the /books.toscrape.com and collect the books data from all the page of the website, also as it scraps I have created middleware and pipeline so it can directly get stored in mysql database or run

scrapy crawl booksider -O to-the-file.json or .csv

Getting Started

Prerequisites

Python 3.6 or higher
Scrapy (pip install scrapy)

Installing

Clone this repository:

git clone https://github.com/yourname/scrapy-crawler.git
cd scrapy-crawler

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
spiders		spiders
README.md		README.md
__init__.py		__init__.py
items.py		items.py
middlewares.py		middlewares.py
pipelines.py		pipelines.py
settings.py		settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrapy Web Crawler

Overview

Getting Started

Prerequisites

Installing

About

Releases

Packages

Languages

abhi267266/book-scraper

Folders and files

Latest commit

History

Repository files navigation

Scrapy Web Crawler

Overview

Getting Started

Prerequisites

Installing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages