trying to build a method to identify diseases by the way they look in xrays

going about this project, first thing i did was setup my env using source venv/bin/activate
once my env is setup, i installed the requirements i decided on earlier pip install -r requirements.txt
since all that is done, i setup the method to commit my progress as it comes
then i mapped the directory for the project so i will start creating the subfolders for the project
having done that, i want to use an existing medical imaging dataset i found on kaggle. so let me use the pneumonia detection dataset first. intend to use this chest xray images for pneumonia: https://www.kaggle.com/datasets/paultimothymooney/chest-xray-pneumonia
currently waiting for the download, so i can place the data in the data/train, data/test and data/val folders of the project. so i can have all sanple images of the identifiers for the diseases ready

i wont lie, this dataset is so large it's 2gb despite compressed to fit.

i already fixed the whole data gathering, fucking 2gb worth of images to use. anyways, i want to write a script for preprocessing the images to help apply, resize and even normalize data augmentation
so i did the function to preprocess the image identified to resize, and read and then return the image. i also did the function to handle data augmentation like the zoom range, horizontal flip, all those image adjustment features. so the data_augmentation() function is to return an ImageDataGenerator instance so as to perform realtime data augmentation while training the model
i want to work on building and training the model. but i am having worries if i should build the CNN model from scratch or i should just transfer learning with a pre-trained model like ResNet or VGG16.
i decided to just transfer learning with a pre-trained model, but i am using vgg16. what i did was simply used the vgg16 model as a base and added custom layers on top for binary classification in src/model_training.py
let me now train the model since i have the model already. will just use the data i imported.
i am done building the model and training the model, i need to visualize the model's performance metrics at this point at least, to decide the accuracy, precision and get to plot it over all training epochs hehe.
wo! let me just write a script to tain and evaluate the model, let me say: train_and_evaluate.py
oh shit! i think i just messed up the whole thing. let me create another model evaluation script. i made wrong twists with the import statements especially the fact that i should not have import y_pred from scipy.special so let me fix that.
since everything wan stress me, i downloaded weights online and put it in weights/vgg16_weights_tf_dim_ordering_tf_kernels_notop.h5. but then, now i need to change the whole base model in the build model function to work with path instead of direct call.
making matters worse, this thing no wan use .h5; it wants .keras. which kind wahala be this bai. it is like i will go and change stuff in my train_model function from checkpoint = ModelCheckpoint('best_model.h5', monitor='val_loss', save_best_only=True) to checkpoint = ModelCheckpoint('best_model.keras', monitor='val_loss', save_best_only=True)
finally o, result don show:

it has been three months, and i have decided to quickly work it out by getting it usable to the public. i just added a src/predict.py script to allow anyone who wants to run tests with scan images tho. it will work out. today is 23/12/2024.

Now, i added a tests/scans folder to put pseudo tests like i am a doctor or lab scientist. so will use that to match test first.
Having done that, i have updated the predict.py file with the link for single scan image, and want to run it python3 src/predict.py. will show the workings.

maybe because i just bought another pc to run my AI expeditions i am having issues with my virtual environment, especially since it is windows and i have setup WSL, but seems broken. currently fixing it fully. once i do, i will push fully again. shouldn't take me past a few minutes.

quickly removed the evn rm -rf venv, and restarted the env python3 -m venv venv and activated it source venv/bin/activate. going to reinstall all dependencies again and follow up. the reason for this journal is because i made this open source, so whoever likes this would see what i did and have been up to.

while i am fixing that, let me create a section on how to use this.

For Everyone's Use

It's worth noting that this project uses a Convolutional Neural Network (CNN) built on the VGG16 architecture for binary image classification, specifically for detecting pneumonia from chest X-rays.

Interesting Things Currently

Pretrained VGG16 as the base model
Custom layers for binary classification
Support for training, evaluation, and prediction

How to Setup

Clone the Repository:

git clone https://github.com/1cbyc/image_classification.git
cd image_classification

Create a Virtual Environment:

python3 -m venv venv
source venv/bin/activate

Install Dependencies:
```
pip install -r requirements.txt
```
Download Pretrained Weights:
- Download the VGG16 weights from this link.
- Place the file in the models/ directory of this project.

Prepare the Dataset:

You should organize your dataset like this:

data/
├── train/
│   ├── class1/
│   └── class2/
├── validation/
│   ├── class1/
│   └── class2/

Training the Model

You should edit the train_and_evaluate.py file to configure the dataset paths:
```
train_data_dir = "data/train"
val_data_dir = "data/validation"
```
Then, run the training script:
```
python train_and_evaluate.py
```
The best model will be saved as models/best_model.keras.

After training, need help using the Model for Prediction?

Using The Model for Prediction

Please make sure the trained model is available:
```
ls models/best_model.keras
```
Then, update the src/predict.py file with the path to your test image:
```
image_path = "tests/scans/IM-0029-0001.jpeg"
```
Run the prediction script:
```
python src/predict.py
```
The output will display the prediction and confidence score:
```
Prediction: Pneumonia (Confidence: 0.95)
```

From where I'm standing, I think I need to make a webpage to allow people just upload scans directly and it will match with what is on the training model. I have no interest in allowing normies have to open the code every damn time.

Looking to create a small landing page in templates to allow file upload and then, will be set to run python3 app.py and hold my fucking peace!

encountered a small issue with my vritual environment again, trying to fix that. it's not skill issues anyways, so we move.

Name		Name	Last commit message	Last commit date
Latest commit History 572 Commits
data		data
notebooks		notebooks
readme-images		readme-images
src		src
tests		tests
weights		weights
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
predict.py		predict.py
train_and_evaluate.py		train_and_evaluate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

trying to build a method to identify diseases by the way they look in xrays

For Everyone's Use

Interesting Things Currently

How to Setup

Training the Model

Using The Model for Prediction

About

Releases

Packages

Languages

License

1cbyc/image_classification

Folders and files

Latest commit

History

Repository files navigation

trying to build a method to identify diseases by the way they look in xrays

For Everyone's Use

Interesting Things Currently

How to Setup

Training the Model

Using The Model for Prediction

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages