The library is composed by different collaborators, each with its specific responsibility. Each class tries to expose a minimal public APIs in the form of __call__ or __iter__ methods (when generators are used).
The classes are aimed to work with one image at time, in case you need to transform and augment multiple images, avoid creating multiple instances of the classes, just change the argument of the __call__ function (but for Persister, which need a new instance and/or instance attribute modification).

Labeller

The target label is extracted directly by inspecting the image name and trying to extract meaningful information (customisable).

lbl = Labeller(digits=10)

lbl('resources/bag.png')
'bag'

lbl('resources/109-602-3906-001-c-suit-veletta-albino.jpg')
'1096023906'

Normalizer

The images are normalized by:

resizing them to the specified max size (default to 256 pixels)
optionally applying a squared, transparent/backgound canvas and centering the image on it, thus avoiding any deformation

norm = Normalizer(size=128, canvas=True)
img = norm('resources/bag.png')
img.shape
(128, 128, 4)

Augmenter

The number of images is augmented by two orders of magnitude (depending on the cutoff float attribute) by applying different transformations to the original one.
Transformations are applied by using generators, thus saving memory consumption.

aug = Augmenter(cutoff=.5)
aug('resources/bag.png')
<generator object Augmenter.__call__ at 0x125354480>

Persister

Images are persisted upon normalization and augmentation, by specifying an action function that accepts the name of the file (original basename suffixed by an index) and a BytesIO object containing the image data stream.
The persister supports both a filename path and, optionally, a stream-like object (in case the file is not yet persisted to disk).
The persister supports iteration by yielding the image label and the function return value (typically the saved path), allowing to generate CSV files specific to cloud platforms (i.e. Google Vision APIs).

def persist(name, stream):
    filename = f'temp/{name}'
    with open(filename, 'wb') as f:
        f.write(stream.getvalue())
    return filename

pers = Persister('resources/skirt.jpg', action=perist)
for label, filename in pers:
    print(label, filename)

Zipper

In case you need an archive with each normalised augmentations within the recognised label subfolder, you can rely on the Zipper interface: it creates a ZIP file on current path, by scanning the specified folder for PNG or JPG images.

zipper = Zipper('.resources/', normalizer=image.Normalizer(16), augmenter=image.Augmenter(.05))
zipper()

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
imgaug		imgaug
resources		resources
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Contents

Scope

Setup

Versions

Virtualenv

Installation

Tests

APIs

Labeller

Normalizer

Augmenter

Persister

Zipper

About

Releases

Packages

Languages

License

costajob/image_augmenter

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Scope

Setup

Versions

Virtualenv

Installation

Tests

APIs

Labeller

Normalizer

Augmenter

Persister

Zipper

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages