We all end up with hell lot of images to be deleted at the end of each semester. I've trained a CNN model to predict such images and extract them out of WhatsApp Images directory
Images dataset can be found on Google Drive
like this:
Requirements:
Instructions:
- Download and extract the project folder.Open the terminal in the extracted folder and Install dependencies using
pip install -r requirements.txt. - Connect your Smartphone to your system(Linux/Ubuntu OS), mount
Internal Storageand copy the absolute path to the WhatsApp folder, to know the absolute path open a terminal inWhatsAppfolder and runpwdcommand(in Linux OS) or copy the location by choosing properties from right-click menu (in Windows OS) . - Run the
extract.pyscript bypython extract.pyand paste the copied path when asked to. The script will create a new folder namednotesin yourWhatsApp Imagefolder and move the study notes images to it.
We've trained the model on about 1000 images and using Keras' data augmentation pipeline. Currently the model is 92% accurate on my dataset. Feel free to add your own data and train the model on it to make the model more accurate. To add your own data, create a data folder in behind_the_scenes folder, create two subfolders 1 and 0 inside data, in 1 put study notes and put all other important images in 0. See behind_the_scenes folder for more info.
