Skip to content

An extremely fast script for scraping file hosting site catbox.moe for any file extension.

License

Notifications You must be signed in to change notification settings

LolcatsV2/catbox-scraper

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Catbox Scraper

An extremely fast python script for scraping and downloading random files from Catbox, a file-hosting site.

The script takes from file extensions specified under config.yaml, generates random urls and checks to see if they are valid. If they are, it downloads them and stores them in extension-sorted folders.

https://files.catbox.moe/[a-z0-9]{6}.(extension) is the format for URL generation.

Demonstration

demonstration.mp4

Installation and Usage

You will need:

  • Python
  • Git
git clone https://github.com/dootss/catbox-scraper.git
cd catbox-scraper
pip install -r requirements.txt
python main.py

and the script will handle everything else from there!

Press CTRL+C to stop the script.

Configuration

If you wish to change the extensions the script attempts to check for, simply edit config.yaml's file_extensions field with the extensions you wish to check for:

image

By default, the script checks for the following: png, gif, jpg, jpeg, webm, mp4

NOTICE

I am not responsible for any consequences that come from using this script! Catbox is a file hosting site, and files found on it can be unpredictable. You'll definitely find a LOT of NSFW images as a result of running this; Catbox is used a fair amount by anonymous communities like 4chan.

About

An extremely fast script for scraping file hosting site catbox.moe for any file extension.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%