Try using a network pretrained using Resnet 50 #112

vikasmahato · 2018-03-10T08:20:30Z

References #7

This paper provides various benchmarks for pretrained models used for transfer learning.
https://openreview.net/pdf?id=Bygq-H9eg

It also suggests that using Resnet should perform better as compared to VGG.

marco-c · 2018-03-11T19:29:13Z

There are already two PRs about this: #61 and #105.

marco-c · 2018-03-11T19:29:44Z

N.B.: They are adding the architecture, but nothing regarding a pretrained network

vrishank97 · 2018-03-11T19:37:16Z

Great. Should I work on this in network.py ?

marco-c · 2018-03-11T19:38:36Z

Yes, the first step would be to figure out how to use an already pretrained network, since the size of our images are different than the default VGG, Resnet, etc.

vrishank97 · 2018-03-11T19:53:46Z

I'm considering squashing images to 224x224. As we are mainly concerned with UI/UX elements and not text, I don't think squashing will affect performance.

marco-c · 2018-03-11T20:04:56Z

Yeah, that's one of the option that we should try

vikasmahato · 2018-03-12T06:25:24Z

@vrishank97 I was working on this issue. However if you want to take it please let me know.

vrishank97 · 2018-03-12T12:27:19Z

@vikasmahato Since we are just experimenting with different ways of getting the transfer learning to work, we'll get results faster if we work in parallel. I will try downscaling images to imagenet dimensions and re-training resnet 50 on it. What approach are you currently working on?

vikasmahato · 2018-03-12T16:25:20Z

@vrishank97 I was also thinking the same. However since you are already doing it I'll try to use Inception V3 and see how it performs.

vrishank97 · 2018-03-12T16:33:43Z

@vikasmahato Great. Can you try using the Inception-Resnet-v2 instead? It had a higher accuracy on imagenet.

https://research.googleblog.com/2016/08/improving-inception-and-image.html

vikasmahato · 2018-03-12T16:35:13Z

@vrishank97 Sure!

Shashi456 · 2018-03-12T16:52:31Z

@vrishank97 @vikasmahato I wanted to ask you both what are the steps in using any pretrained network , are you finetuning the last layer ?

vrishank97 · 2018-03-12T16:57:39Z

We freeze the convo layers and retrain the fully connected layers with a custom softmax output layer.

vrishank97 · 2018-03-12T16:58:49Z

Here are some resources
https://blog.keras.io/building-powerful-image-classification-models-using-very-little-data.html
https://towardsdatascience.com/transfer-learning-using-keras-d804b2e04ef8

and a pre-existing tool by tensorflow
https://www.tensorflow.org/tutorials/image_retraining

Here they generate bottleneck features to train a model, doing so speeds up the process as we don't have to keep running the computationally expensive convolution operations.

marco-c · 2018-03-12T17:16:23Z

We should try both approaches: freeze all except the top one and also keep training everything.

vrishank97 · 2018-03-12T17:22:08Z

@marco-c The dimensions shouldn't be an issue for transfer learning. All images are resized by prepare_images() in utils.py. We only need to load imagenet weights.

marco-c · 2018-03-12T17:31:24Z

@marco-c The dimensions shouldn't be an issue for transfer learning. All images are resized by prepare_images() in utils.py. We only need to load imagenet weights.

Yes, but by resizing we might be losing some information.

vrishank97 · 2018-03-12T17:35:55Z

Agreed. But I think its mainly the text areas where we lose information, major UI elements would still be recognisable, especially if we use Inception. It has an input of 299x299 instead of 224x224, so lower information loss.

marco-c · 2018-03-12T17:38:47Z

Yes, hopefully yes.

marco-c · 2018-06-09T07:46:11Z

Closing in favor of #194.

Repository owner deleted a comment from vrishank97 Mar 19, 2018

marco-c added the AVAILABLE - OPEN PR TO BE ASSIGNED TO THE ISSUE label Mar 19, 2018

marco-c removed the AVAILABLE - OPEN PR TO BE ASSIGNED TO THE ISSUE label May 11, 2018

marco-c closed this as completed Jun 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try using a network pretrained using Resnet 50 #112

Try using a network pretrained using Resnet 50 #112

vikasmahato commented Mar 10, 2018 •

edited

Loading

marco-c commented Mar 11, 2018

marco-c commented Mar 11, 2018

vrishank97 commented Mar 11, 2018

marco-c commented Mar 11, 2018

vrishank97 commented Mar 11, 2018

marco-c commented Mar 11, 2018

vikasmahato commented Mar 12, 2018

vrishank97 commented Mar 12, 2018

vikasmahato commented Mar 12, 2018

vrishank97 commented Mar 12, 2018

vikasmahato commented Mar 12, 2018

Shashi456 commented Mar 12, 2018

vrishank97 commented Mar 12, 2018

vrishank97 commented Mar 12, 2018 •

edited

Loading

marco-c commented Mar 12, 2018

vrishank97 commented Mar 12, 2018 •

edited

Loading

marco-c commented Mar 12, 2018

vrishank97 commented Mar 12, 2018

marco-c commented Mar 12, 2018

marco-c commented Jun 9, 2018

Try using a network pretrained using Resnet 50 #112

Try using a network pretrained using Resnet 50 #112

Comments

vikasmahato commented Mar 10, 2018 • edited Loading

marco-c commented Mar 11, 2018

marco-c commented Mar 11, 2018

vrishank97 commented Mar 11, 2018

marco-c commented Mar 11, 2018

vrishank97 commented Mar 11, 2018

marco-c commented Mar 11, 2018

vikasmahato commented Mar 12, 2018

vrishank97 commented Mar 12, 2018

vikasmahato commented Mar 12, 2018

vrishank97 commented Mar 12, 2018

vikasmahato commented Mar 12, 2018

Shashi456 commented Mar 12, 2018

vrishank97 commented Mar 12, 2018

vrishank97 commented Mar 12, 2018 • edited Loading

marco-c commented Mar 12, 2018

vrishank97 commented Mar 12, 2018 • edited Loading

marco-c commented Mar 12, 2018

vrishank97 commented Mar 12, 2018

marco-c commented Mar 12, 2018

marco-c commented Jun 9, 2018

vikasmahato commented Mar 10, 2018 •

edited

Loading

vrishank97 commented Mar 12, 2018 •

edited

Loading

vrishank97 commented Mar 12, 2018 •

edited

Loading