Auto Scraper

Usage

Set your scraper-config.js how you'd like it; according to this format:

{
    "baseUrl": <target website>,
    "groups": {
        <group-name>:{
            "urls":[
                <sub-url of page>
            ],
            "selector": <jquery selector of parent-most item>,
            "outputFile":<output file to make (must be comma delimmited csv)>,
            "scrape": {
                <field name in camel case>: {
                    "selector": <jquery selector inside parent selector>,
                    "name":<name to put on top row of csv (column header)>,
                    "options": {
                        "attribute": <attribute on html element, eg: href>,
                        "prependBaseUrl":<boolean, if you want to add the base url to the beginning>,
                        "excludeSiblings":<boolean, if you want the text nodes inside a node ignoring the other children elements>
                    }
                }
                ... other field names you want ...
            }
        },
        ... other groups you want ...
    }
}

Repo includes example scraping from J&J Cards and Collectibles in Waterloo.

License

MIT. Go nuts.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
jjtoys-config.json		jjtoys-config.json
package-lock.json		package-lock.json
scraper-config.json		scraper-config.json
scraper.js		scraper.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Auto Scraper

Usage

License

About

Releases

Packages

Languages

tm2josep/autoScraper

Folders and files

Latest commit

History

Repository files navigation

Auto Scraper

Usage

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages