Tracking issue for the results of different network architectures #194

marco-c · 2018-06-09T07:25:37Z

Shashi456 · 2018-06-09T07:35:30Z

@marco-c I think it'll be a good idea if we kept a list of the current architectures in a todo list format here. And whenever a new architecture was added we would update the list. This could help us keep track

marco-c · 2018-06-09T07:37:43Z

@marco-c I think it'll be a good idea if we kept a list of the current architectures in a todo list format here. And whenever a new architecture was added we would update the list. This could help us keep track

Yes, this was exactly my idea :)

When you finish one, tell me here and I'll update the list.
If you want to add one, do the same.

Trion129 · 2018-06-09T07:47:12Z

:D Oh this is nice idea! One place to show all benchmarks!

sdv4 · 2018-06-09T22:18:07Z

It looks like network.py contains implementations for 'inception', 'vgglike', 'vgg16', 'vgg19', 'simnet', and 'simnetlike' architectures. Are there other architectures that still need to be implemented?

Shashi456 · 2018-06-11T06:54:26Z

@marco-c i think we need to start thinking about a benchmark, when we start training these networks we will need to benchmark them against something (like human accuracy in cifar challenges) , what do you think?

and we havent added resnet to our networks yet.

sagarvijaygupta · 2018-06-11T14:59:30Z

I was working on pretrained VGG16 model. Got validation accuracy of 80%. I was not able to save to file though (because of bug which will be fixed by #201 ).

Epoch 50/50
83/82 [==============================] - 152s 2s/step - loss: 0.0243 - accuracy: 0.9672 - val_loss: 0.1793 - val_accuracy: 0.7841
Epoch 00050: val_accuracy did not improve from 0.80966
[0.1576942801475525, 0.8004807692307693]

sagarvijaygupta · 2018-06-11T16:18:41Z

Should we create a directory where models will be saved? And should we change

autowebcompat/train.py

Line 84 in 80fd975

    
           callbacks_list = [ModelCheckpoint('best_train_model.hdf5', monitor='val_accuracy', verbose=1, save_best_only=True, mode='max'), timer]

with a name like user_best_VGG16_model or something like that? So that we can get a link between train_info file and the model?

marco-c · 2018-06-12T05:28:17Z

It looks like network.py contains implementations for 'inception', 'vgglike', 'vgg16', 'vgg19', 'simnet', and 'simnetlike' architectures. Are there other architectures that still need to be implemented?

As @Shashi456 said, ResNet. There are also other architectures that we might add, but I would focus on getting at least something basic working and then we can try to improve on it.

@marco-c i think we need to start thinking about a benchmark, when we start training these networks we will need to benchmark them against something (like human accuracy in cifar challenges) , what do you think?

The benchmark could be #195.

I was working on pretrained VGG16 model. Got validation accuracy of 80%.

80% is impressive for a first try! But it might be due to class imbalance, we should take it into account.

Should we create a directory where models will be saved? And should we change

I'm thinking of creating another repository where we store the models and setting it as a submodule of this repository (like data and tools).

with a name like user_best_VGG16_model or something like that? So that we can get a link between train_info file and the model?

Yes, linking the train_info file and the model should be done, not sure about the name though.

sagarvijaygupta · 2018-06-12T15:23:37Z

80% is impressive for a first try! But it might be due to class imbalance, we should take it into account.

So should we consider Confusion Matrix for class imbalance? Or should we make the training dataset itself balanced (something similar to pretrain.py)?

Yes, linking the train_info file and the model should be done, not sure about the name though.

We can simply name the model same as the name of the train_info file if that feels good?

Also I wanted to know that is there any particular reason we have implemented VGG16 and others as functions and not used the predefined ones available in Keras?

sagarvijaygupta · 2018-06-12T18:42:26Z

Update : VGG16 pretrained with imagenet
High accuracy was indeed because of class imbalance. On checking results of predicted values after the latest model (which gave 90% accuracy on 15 epochs) I found that we have class ratio of 36:380

I have attached the text file generated for the training.
f8ece846acde_16_26_2018_06_12.txt

Shashi456 · 2018-06-12T18:57:47Z

@sagarvijaygupta @marco-c I think we definitely need to handle class imbalance before taking these accuracy values . Because with a class I'm balance too high we would reach a certain amount of accuracy even if all the predictions were 'y'.

marco-c · 2018-06-13T04:17:14Z

So should we consider Confusion Matrix for class imbalance? Or should we make the training dataset itself balanced (something similar to pretrain.py)?

I think we should consider confusion matrix. Making the training dataset balanced is feasible for pretrain.py because we have infinite training examples, but for train.py we have only a limited dataset.

marco-c · 2018-06-13T04:17:35Z

We can simply name the model same as the name of the train_info file if that feels good?

Sounds good to me!

marco-c · 2018-06-13T04:18:44Z

Also I wanted to know that is there any particular reason we have implemented VGG16 and others as functions and not used the predefined ones available in Keras?

If we can reuse them, we definitely should. The first network I wrote was the "vgg-like" one, so clearly it wasn't available in Keras. Then when we added more I forgot there were already some available in Keras.

marco-c · 2018-06-13T04:19:05Z

@sagarvijaygupta @marco-c I think we definitely need to handle class imbalance before taking these accuracy values . Because with a class I'm balance too high we would reach a certain amount of accuracy even if all the predictions were 'y'.

Indeed, this is probably what's happening with the 90% accuracy.

sagarvijaygupta · 2018-06-13T06:50:57Z

@marco-c Should I create separate PR for each model which is available in https://keras.io/applications/? And for using pretrained models should we pass in argparse like --weights=imagenet?

marco-c · 2018-06-15T05:56:04Z

@marco-c Should I create separate PR for each model which is available in https://keras.io/applications/?

Yes, but this is not a high priority. It doesn't matter for now if we keep our own implementation or if we reuse the already existing ones.

And for using pretrained models should we pass in argparse like --weights=imagenet?

Sounds good to me!

sagarvijaygupta · 2018-06-15T06:44:22Z

@marco-c using pre-trained weights might be simpler if we directly use keras models.

sagarvijaygupta · 2018-06-15T06:47:38Z

@marco-c I totally forgot to remove the prediction layer (softmax one) while using pre-trained VGG16. We should not take those values as of now!

sdv4 · 2018-06-15T17:53:40Z

Just a heads up, I am going to start testing the VGG 19 (from scratch) architecture. I will open a PR for this too.

sagarvijaygupta · 2018-06-16T18:45:33Z

Network - ResNet50
Pretrained - Imagenet
Optimiser - sgd
Epochs - 20
Accuracy - 85.81%
65d451c26877_18_38_2018_06_16.txt

Confusion Matrix:

132 41
16 195

marco-c · 2018-06-17T02:02:01Z

@sagarvijaygupta is this for Y vs D + N or Y + D vs N?

sagarvijaygupta · 2018-06-17T03:56:13Z

@marco-c This is for Y vs D + N. Its in the file. 😄

sagarvijaygupta · 2018-06-17T04:01:55Z

Network - vgg16
Pretrained - Imagenet
Optimiser - sgd
Epochs - 20
Accuracy - 87.25%
65d451c26877_19_39_2018_06_16.txt

Confusion Matrix:

141 32
23 188

sagarvijaygupta · 2018-06-18T15:37:19Z

Network - vgg19
Pretrained - Imagenet
Optimiser - sgd
Epochs - 20
Accuracy - 86.77%
4aa405f41ed8_14_47_2018_06_18.txt

Confusion Matrix:

148 26
30 180

sdv4 · 2018-06-19T17:27:05Z

I have been having a difficult time using Colab over the past few days. Most times that I run my notebook, the process is killed. I have been trying to figure out why, and stumbled upon this post:

https://stackoverflow.com/questions/48750199/google-colaboratory-misleading-information-about-its-gpu-only-5-ram-available

I am also on the west coast of Canada, where the author of the post is also located. I managed to get one good run late last night, where I ran the training for over 80 epochs. However the output wasn't saved anywhere that I could find. Note that I am running the notebook that exists on my forked repo on github.

@sagarvijaygupta where is your output being saved?

I am trying to run the training again right now with my Google Drive mounted in Colab, but haven't been able to have a successful run over the past few hours (due to the issue linked above), and the fact that there are no GPU backends available.

@marco-c is there another cloud based GPU service that you would recommend?

That being said, when running train.py via the notebook on Colab, the best val_accuracy achieved was around 85.7%, after over 50 epochs. However, when I run train.py locally on my machine (with no GPU), I get a val_accuracy of 95.2% after 4 epochs. I am trying to figure out why this is, but wanted to post the info in case the reason is obvious to someone.

sagarvijaygupta · 2018-06-19T18:04:34Z

@sdv4 First of all, for the problem of GPU memory being shown nearly 500MB I did find a simple solution which works for me. Whenever you execute train.py you will see the amount of memory available like totalMemory: 11.17GiB freeMemory: 11.10GiB. If freeMemory is nearly 500 MB just restart the runtime (Runtime->Restart Runtime...). When you restart your runtime you don't need to reclone. Only your local variables will be lost. Mostly it is solved in one try. You won't get OOM or resource exhausted type error with it. Though my machine also dies sometime while training is going on and I too feel it won't be feasible to run training for larger epochs over Colab.
Also not always GPU backends are available so sometimes it is luck!
My output is being saved in the colab runtime only. If I want to save a model, I upload it to my drive.
Regarding your accuracy issue. Change number of epochs to 4 only, and run the training and upload the text file generated. It might be the case that number of samples you are taking might not be the total number of samples available (some might be missing).

sdv4 · 2018-06-19T23:37:31Z

@sagarvijaygupta Regarding the accuracy issue, here is the text file after only one epoch, where val_accuracy is at 90.6%:

Shanes-MacBook-Pro.local_13_01_2018_06_19.txt

The number of training, test, and validation samples are the same as in the last txt you shared.

Also, thanks for the Colab tips. Good to know it isn't just here where there is a problem.

sagarvijaygupta · 2018-06-20T00:37:29Z

@sdv4 your classification type is different. That might be a reason for it.
labels:-
y - 2120
n - 593
d - 1136
This is the breakdown of labels.csv, so I am pretty sure that's the reason.

sdv4 · 2018-06-20T05:28:36Z

@sagarvijaygupta yes, you were right. The numbers are more what was expected once I corrected the classification type:

Network - vgg19
Pretrained - none
Optimiser - sgd
Epochs - 50
Accuracy - 84.61%

258a95a88d5c_01_09_2018_06_21.txt

Confusion matrix:

[[136 39]
[ 26 215]]

sagarvijaygupta · 2018-06-20T05:33:22Z

@sdv4 there is nothing as correcting the classification type. Your results were for different classification and they were correct for that. I guess we want results for both!

Shashi456 · 2018-06-20T11:53:42Z

@marco-c do you think there's a neater way to record these observations, the issue will get pretty verbose afterwards and it will get harder to track the benchmarks

marco-c · 2018-06-20T22:50:45Z

I think I'll just remove the comments at some point, and put the summary of the results in the first comment.

sdv4 · 2018-06-21T04:37:35Z

Heads up, I am going to start testing the VGG16 and VGG-like architectures (from scratch variant).

sdv4 · 2018-06-22T07:59:59Z

Network - VGG16
Pretrained - None
Optimiser - SGD
Epochs - 50
Accuracy - 80.29 %

6c685b649c2b_07_55_2018_06_22.txt

Confusion matrix:
[[142 61]
[ 22 191]]

marco-c · 2018-07-25T23:43:58Z

I've added usernames close to the networks people are testing, so we know who's testing what.

This was referenced Jun 9, 2018

Try using a network pretrained on ImageNet (e.g. VGG16) #7

Closed

Try using other network architectures #1

Closed

Try using a network pretrained using Resnet 50 #112

Closed

marco-c changed the title ~~Results~~ Tracking issue for the results of different network architectures Jun 9, 2018

marco-c added the nn label Jun 9, 2018

marco-c added this to the 3a. Accuracy improvements milestone Dec 7, 2018

Tracking issue for the results of different network architectures #194

Tracking issue for the results of different network architectures #194

Comments

marco-c commented Jun 9, 2018 • edited Loading

Shashi456 commented Jun 9, 2018

marco-c commented Jun 9, 2018

Trion129 commented Jun 9, 2018

sdv4 commented Jun 9, 2018

Shashi456 commented Jun 11, 2018

sagarvijaygupta commented Jun 11, 2018

sagarvijaygupta commented Jun 11, 2018

marco-c commented Jun 12, 2018

sagarvijaygupta commented Jun 12, 2018

sagarvijaygupta commented Jun 12, 2018

Shashi456 commented Jun 12, 2018

marco-c commented Jun 13, 2018

marco-c commented Jun 13, 2018

marco-c commented Jun 13, 2018

marco-c commented Jun 13, 2018

sagarvijaygupta commented Jun 13, 2018

marco-c commented Jun 15, 2018

sagarvijaygupta commented Jun 15, 2018

sagarvijaygupta commented Jun 15, 2018

sdv4 commented Jun 15, 2018

sagarvijaygupta commented Jun 16, 2018

marco-c commented Jun 17, 2018

sagarvijaygupta commented Jun 17, 2018

sagarvijaygupta commented Jun 17, 2018

sagarvijaygupta commented Jun 18, 2018

sdv4 commented Jun 19, 2018

sagarvijaygupta commented Jun 19, 2018 • edited Loading

sdv4 commented Jun 19, 2018

sagarvijaygupta commented Jun 20, 2018 • edited Loading

sdv4 commented Jun 20, 2018 • edited Loading

sagarvijaygupta commented Jun 20, 2018

Shashi456 commented Jun 20, 2018

marco-c commented Jun 20, 2018

sdv4 commented Jun 21, 2018

sdv4 commented Jun 22, 2018

marco-c commented Jul 25, 2018

marco-c commented Jun 9, 2018 •

edited

Loading

sagarvijaygupta commented Jun 19, 2018 •

edited

Loading

sagarvijaygupta commented Jun 20, 2018 •

edited

Loading

sdv4 commented Jun 20, 2018 •

edited

Loading