rebuild resnet using blocks #156

brettkoonce · 2019-05-09T02:19:14Z

No description provided.

saeta

This looks awesome! I'll let @jekbradbury review as well, but thank you very much.

Couple quick thoughts:

How does the compile time of swift-models change with the new version of ResNet? :-D
To simplify things a bit, could you make imageSize optional (either in this PR or a follow-on PR)? ResNet is fully convolutional and works with multiple image sizes without changing the model. Having "imageSize" be a required parameter (and a single scalar instead of a (height, width) tuple) might imply to people that the ResNet implementation only works for a single (square) image size. (Thus discouraging folks from using techniques like progressive resizing.)
In a follow-up PR, let's discuss making the ResNet implementation public so that folks can (in their own Swift projects or notebooks) import the canonical ResNet and use it themselves.

Thank you again. This is super awesome to see.

All the best,
-Brennan

brettkoonce · 2019-05-10T21:49:37Z

re 1) Not sure if this is just my system (running semi-current tf-swift master), but the existing version in models repo takes 43 seconds to compile here. Using this code takes ~3-4 seconds to compile.
re 2) Not sure how the current codebase handles arbitrary input, but certainly happy to rework things however desired.
re 3) That is fine, but having said that it would probably be good to refactor this a bit more first. If there is a way to parameterize the BasicBlock/ResidualIdentityBlock logic then this file could be cut in half again and everything reduced to a single ResNet class, which I think would be simpler for people new to computer vision. This torchvision logic is nice: https://github.com/pytorch/vision/blob/50d54a82d1479ffb6dd7469ed05fccdf290a1d84/torchvision/models/resnet.py#L216

Let me know!

rxwei · 2019-05-10T22:03:13Z

re 1) Not sure if this is just my system (running semi-current tf-swift master), but the existing version in models repo takes 43 seconds to compile here. Using this code takes ~3-4 seconds to compile.

This is expected because the compiler is bugged. The implementation detail is that we've turned off a part of the Swift compiler that optimizes large structures in order to suppress other bugs. We are actively working on this and expect this to be fixed within a week or two.

jekbradbury · 2019-05-11T02:59:03Z

2. could you make imageSize optional

This would be a good change. The reason imageSize exists at all is to distinguish between ResNets designed for CIFAR image sizes and those designed for ImageNet sizes; my impression is that the ImageNet-sized networks also work fine for images of other similar sizes.

brettkoonce · 2019-05-12T21:53:29Z

I can work on coming up with a way to add an input type to specify cifar, imagenet and generic variants. In the same vein, can come back to refactoring things more down the road.

ResNet/ResNet.swift

rxwei · 2019-05-14T00:44:51Z

ResNet/ResNet.swift

+        case resNet152
+    }
+
+    init(kind: Kind, type: InputKind) {


If kind conflicts with the first argument label, you can call it inputKind:. Relatedly, would DataKind be a better name for the type? If so, dataKind: would be a good label name. Same for other initializers that take DataKind.

moved to: inputKind: Kind, dataKind: DataKind, let me know what you think!

rebuild resnet using blocks

be0fc5e

rxwei requested a review from jekbradbury May 9, 2019 04:02

saeta approved these changes May 9, 2019

View reviewed changes

jekbradbury approved these changes May 11, 2019

View reviewed changes

use InputType to determine filter sizes/class counts

a7c3047

rxwei reviewed May 13, 2019

View reviewed changes

ResNet/ResNet.swift Outdated Show resolved Hide resolved

ResNet/ResNet.swift Outdated Show resolved Hide resolved

ResNet/ResNet.swift Outdated Show resolved Hide resolved

ResNet/ResNet.swift Outdated Show resolved Hide resolved

naming/formatting tweaks

6ed2af6

brettkoonce force-pushed the resnet-block-cleanup branch from 56371e4 to 6ed2af6 Compare May 14, 2019 00:09

rxwei reviewed May 14, 2019

View reviewed changes

brettkoonce added 2 commits May 13, 2019 20:26

clarify enum names

cd2f911

match input naming scheme

bfece34

brettkoonce mentioned this pull request Jun 18, 2019

rebuild resnet block based approach #170

Merged

brettkoonce closed this Jun 18, 2019

pschuh pushed a commit to pschuh/swift-models that referenced this pull request Jul 30, 2019

Added support for a 'Tensor.gathering(where:)'. (tensorflow#156)

0e3b3f4

brettkoonce mentioned this pull request Dec 12, 2019

Why is ResNet fixed to only two datasets? #253

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

rebuild resnet using blocks #156

rebuild resnet using blocks #156

Uh oh!

brettkoonce commented May 9, 2019

Uh oh!

saeta left a comment •

edited

Loading

Uh oh!

brettkoonce commented May 10, 2019

Uh oh!

rxwei commented May 10, 2019

Uh oh!

jekbradbury commented May 11, 2019

Uh oh!

brettkoonce commented May 12, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rxwei May 14, 2019

Uh oh!

brettkoonce May 14, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rebuild resnet using blocks #156

rebuild resnet using blocks #156

Uh oh!

Conversation

brettkoonce commented May 9, 2019

Uh oh!

saeta left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

brettkoonce commented May 10, 2019

Uh oh!

rxwei commented May 10, 2019

Uh oh!

jekbradbury commented May 11, 2019

Uh oh!

brettkoonce commented May 12, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rxwei May 14, 2019

Choose a reason for hiding this comment

Uh oh!

brettkoonce May 14, 2019

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

saeta left a comment •

edited

Loading