Blood Cell Analysis using Machine Learning
[Data Science Final Project]

Classifying eight types of blood cells using image data. It covers data exploration, preprocessing, feature extraction comparison (HOG, ViT, PCA, t-SNE), model development (including DNNs, CNNs, ResNet, ViT), evaluation, and interpretability analysis.

Live Preview & Project Access

You can access a live preview of the interactive HTML document here:

Via `htmlpreview.github`:

Part 1
Part 2

Via `NBViewer`:

Part 1
Part 2

Note that it may take a few moments for the files to load.

Research Methodology

The project adopts a sequential methodology:

Initial data exploration and validation
Feature engineering (HOG vs. deep features with ViT)
Dimensionality reduction and feature space visualization
Baseline modeling with classical ML approaches
Deep learning with custom CNN architectures
Transfer learning with pre-trained models (ResNet18, ViT)
Model interpretability through SHAP analysis

The detailed research process, experimental notebooks, and in-depth analysis are available in the project's notebooks.

Results

Transfer learning with robust architectures like ResNet18 and ViT proved superior to training simpler models or custom CNNs from scratch, achieving near-perfect classification accuracy by effectively learning discriminative visual patterns. The SHAP analysis confirmed that the models focused on biologically relevant features such as nuclear morphology and cell size.

Core Project Structure

This system employs various deep learning architectures to enhance diagnostic accuracy.

`/src/main.py`

Entry point that coordinates all components of the blood cell classification system.

`/src/data/`

preprocessing.py: Functions for image data processing, validation, and preparation.
dataset.py: PyTorch dataset classes and data loading utilities for efficient data handling.

`/src/models/`

neural_networks.py: Neural network architecture definitions including SimpleNN, LightCNN, CNNModel, ResNet18, and ViT implementations.
training.py: Training and evaluation functions with support for early stopping and metrics tracking.

`/src/features/`

extraction.py: Feature extraction methods including HOG and deep learning approaches using pre-trained vision transformers.

`/src/visualization/`

explorer.py: Data exploration and visualization functions for dataset analysis and result interpretation.

`/src/metrics/`

evaluation.py: Model evaluation metrics and performance visualization tools.

`/src/utils/`

setup.py: Environment setup and installation utilities.
config.py: Configuration management for consistent experiment parameters.

Author

Yehonatan Keypur

Grade: 100

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Analysis_Core		Analysis_Core
.gitattributes		.gitattributes
README.md		README.md
[Final_Version]_Blood_Cells_ML[Part_1].html		[Final_Version]_Blood_Cells_ML[Part_1].html
[Final_Version]_Blood_Cells_ML[Part_2].html		[Final_Version]_Blood_Cells_ML[Part_2].html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Blood Cell Analysis using Machine Learning
[Data Science Final Project]

Live Preview & Project Access

Via `htmlpreview.github`:

Via `NBViewer`:

Research Methodology

Results

Core Project Structure

`/src/main.py`

`/src/data/`

`/src/models/`

`/src/features/`

`/src/visualization/`

`/src/metrics/`

`/src/utils/`

Author

About

Uh oh!

Releases

Packages

Uh oh!

Languages

yehonatanke/Blood-Cell-Analysis-using-Machine-Learning

Folders and files

Latest commit

History

Repository files navigation

Blood Cell Analysis using Machine Learning [Data Science Final Project]

Live Preview & Project Access

Via htmlpreview.github:

Via NBViewer:

Research Methodology

Results

Core Project Structure

/src/main.py

/src/data/

/src/models/

/src/features/

/src/visualization/

/src/metrics/

/src/utils/

Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Blood Cell Analysis using Machine Learning
[Data Science Final Project]

Via `htmlpreview.github`:

Via `NBViewer`:

`/src/main.py`

`/src/data/`

`/src/models/`

`/src/features/`

`/src/visualization/`

`/src/metrics/`

`/src/utils/`

Packages