Skip to content

This repository includes pneumonia detection on Chest X-ray Images by using Deep Learning(Keras).

License

Notifications You must be signed in to change notification settings

edaaydinea/Pneumonia-Detection-on-Chest-Xray-Images-with-Deep-Leaning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pneumonia Detection on Chest X-ray Images with Deep Learning (Keras)

This repository includes the slides and coding parts for this project.

This project is carried out by Eda AYDIN, Nesibe Betül DÖNER, Okan TOPAL, Berkay DİRİL under the supervision of Engin Deniz ALPMAN in the Data Science for the Public Good program

The dataset of this project is obtained from the Kaggle - Chest X-ray Images(Pneumonia

Note: The data sets to be used in the project comply with the health-ethical rules and are suitable for use as a license.

Notevook : Link

A. BUSINESS UNDERSTANDING

Pneumonia Challenges to WaitinG Solve

  • Lungs are the most affected organs of COVID-19
  • Early Detection for high results in the treatment of the disease

Our Goal

  • Pinpoint detection of pneumonia disease from chest X-ray images

Context

Example of Chest X-rays in patients

The normal chest X-ray(left panel) depicts clear lungs without any areas of abnormal opacification in the image. Bacterical pneumonia (middle panel) typically exhibits a focal lobar consolidation, in this case in the right upper lobe (white arrows), whereas viral pneumonia(right panel) manifets with a more diffuse "interstitial" pattern in both lungs.

B. DATA UNDERSTANDING

Collectiong of the Raw Data

The dataset is organized into 3 folders (train, test, val) and contains subfolders of each image category (Pneumonia / Normal). There are 5,863 X-Ray images (JPEG) and 2 categories(Pneumonia/Normal)

Chest X-ray images (anterior-posterior) were selected from retrospective cohorts of pediatric patients of one to five years old from Guangzhou Women and Children’s Medical Center, Guangzhou. All chest X-ray imaging was performed as part of patients’ routine clinical care.

For the analysis of chest x-ray images, all chest radiographs were initially screened for quality control by removing all low quality or unreadable scans. The diagnoses for the images were then graded by two expert physicians before being cleared for training the AI system. In order to account for any grading errors, the evaluation set was also checked by a third expert.

Changing Environment

  • Used Tensorflow to change environment from CPU to GPU for working fastly in modelling part.

Data Loading

  • Directories were taken from different folders for each image files to be used in the visualization, augmentation and modeling phases.

Labelling

  • 0 and 1 labels were added to work in the modelling phase to the pictures determined as pneumonia and normal.

Data Visualization

Normal / Pneumonia Image Visualization fro Train Dataset

normal-pneumonia-images

Plotting the Distribution with Bar and Pie Chart)

Bar Chart Pie Chart
distribution-with-bar distribution-with-pie

Data Preparation

Changing the each image size to prefered size

  • Changed the each image size to 150x150

Grayscale Normalization

  • A grayscale normalization was performed to reduce the effect of lighting differences.
  • The data were normalized from 0–1 by dividing the train, validation, and test data by 255.

Reshaping the data for deep learning model

  • Each image size was reshaped from -1 to 1 to use the images as NumPy arrays in the deep learning model.

C. DATA AUGMENTATION

data-augmentation

  • Techniques applied as data augmentation for train dataset:
    • Randomly rotate some training images by 30 degrees
    • Randomly Zoom by 20% some training image
    • Randomly shift images horizontally by 10% of the width
    • Randomly shift images vertically by 10% of the height
    • Randomly flip image horizontally

D. DATA MODELLING

  • The CNN model was built by using Keras and the accuracy result on test data is : 95.27%
  • ReduceLRonPlateau was used in the model to reduce the learning rate when the model stopped improving the metric.

E. EVALUATION

Accuracy and Loss Graphs accuracy-loss-graphs

Some of the Correctly Predicted Classes

correct-image

Some of the Inccorectly Predicted Classes

false-images

G. RESOURCES

Notebook Resources

Website Resources

Releases

No releases published

Packages

No packages published