Implement a DOM-based technique as a baseline #195

marco-c · 2018-06-09T18:28:14Z

We should compare our technique based on CNN with a technique that doesn't use machine learning.

sagarvijaygupta · 2018-06-09T18:38:35Z

We should collect screenshots with DOM information also. Presently we don't have them.

marco-c · 2018-06-09T18:51:19Z

We have implemented collecting DOM information too, but we haven't collected any.

Shashi456 · 2018-06-12T11:09:45Z

@marco-c do we simply need to run collect.py to start collecting data with dom info?

marco-c · 2018-06-13T04:06:27Z

Yes, I think so. We implemented it recently. You should run it for a couple of websites and check that it is actually generating correct data.

Shashi456 · 2018-06-22T17:20:10Z

@marco-c although the dom info seems to be getting collected properly it is very slow in doing so. and could you tell me how i could add them to the repo since it's a git lfs file ?

marco-c · 2018-06-22T23:27:01Z

@marco-c although the dom info seems to be getting collected properly it is very slow in doing so. and could you tell me how i could add them to the repo since it's a git lfs file ?

Yes, because of all the time we have to wait to be sure we have loaded everything the crawler is quite slow.
You can add them normally, git lfs is completely transparent.

marco-c changed the title ~~Implement a DOM-based technique without using CNN as a baseline~~ Implement a DOM-based technique as a baseline Jun 9, 2018

marco-c mentioned this issue Jun 12, 2018

Tracking issue for the results of different network architectures #194

Open

16 tasks

marco-c added this to the 1. Classifier with baseline accuracy/precision milestone Dec 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement a DOM-based technique as a baseline #195

Implement a DOM-based technique as a baseline #195

marco-c commented Jun 9, 2018

sagarvijaygupta commented Jun 9, 2018

marco-c commented Jun 9, 2018

Shashi456 commented Jun 12, 2018

marco-c commented Jun 13, 2018

Shashi456 commented Jun 22, 2018

marco-c commented Jun 22, 2018

Implement a DOM-based technique as a baseline #195

Implement a DOM-based technique as a baseline #195

Comments

marco-c commented Jun 9, 2018

sagarvijaygupta commented Jun 9, 2018

marco-c commented Jun 9, 2018

Shashi456 commented Jun 12, 2018

marco-c commented Jun 13, 2018

Shashi456 commented Jun 22, 2018

marco-c commented Jun 22, 2018