Skip to content

Latest commit

 

History

History
56 lines (40 loc) · 1.17 KB

README.md

File metadata and controls

56 lines (40 loc) · 1.17 KB

PROXY SCRAPPER

Proxy Scrapper that scrapes proxy and check the proxies (http only) from ~40 source. Need help to fix shit code.

Download

  • HTTP(s)

    # Original
    wget https://raw.githubusercontent.com/ReCaree/proxy-scrapper/master/proxy/http.txt
    # Duplicates Removed
    wget https://raw.githubusercontent.com/ReCaree/proxy-scrapper/master/proxy/http-removed.txt
  • SOCKS4

    # Original
    wget https://raw.githubusercontent.com/ReCaree/proxy-scrapper/master/proxy/socks4.txt
    # Duplicates Removed
    wget https://raw.githubusercontent.com/ReCaree/proxy-scrapper/master/proxy/socks4-removed.txt
  • SOCKS5

    # Original
    wget https://raw.githubusercontent.com/ReCaree/proxy-scrapper/master/proxy/socks5.txt
    # Duplicates Removed
    wget https://raw.githubusercontent.com/ReCaree/proxy-scrapper/master/proxy/socks5-removed.txt

Setup

  • Clone this repository and install requirement with:

    pip install -r requirements.txt
  • Run the scrapper.

    python scrapper.py

Todo

  • Add SOCK 4/5 checker
  • Better multithreading

Contributing

Fell free to contribute. Add fixes or source.