Implement more layers that are available in Keras #54

rxwei · 2019-03-11T22:03:30Z

No description provided.

tanmayb123 · 2019-03-18T03:40:11Z

@rxwei Quick question on this: Do we also want to add layers like Add Subtract Multiply Concatenate etc.? Two things to note:

If we do, we could just sequence these operations instead of having to run the operations on the Tensors.
How would a layer handle an arbitrary length of inputs? I'm aware of how to send multiple pre-defined inputs using a struct, but could you just pass an array of Tensors (i.e. [Tensor<Scalar>]) for many inputs?

rxwei · 2019-03-18T03:55:46Z

Hi @tanmayb123, let’s not add those operator-like layers for now and defer it to further discussions. Ideally, we would want functions to be able to conform to the layer protocol, but it’s currently impossible in Swift, so we’ll probablu need a single type wrapper that can turn any differentiable function to a layer.

As for arbitrary-length input/output, there’s ongoing work on making Array conform to Differentiable. It should be resolved next week or so.

rxwei · 2019-03-20T01:21:43Z

Random thought: We can define layer wrappers for common arities (unary and binary), and define a layer(_:) factory function for turning any Differentiable-to-Differentiable function into a layer.

Rough sketch:

public struct ClosureLayer<T: Differentiable, U: Differentiable>: Layer {
    @noDerivative
    public var function: @differentiable (T) -> U
    public func applied(to input: T) -> U {
        return function(input)
    }
}

public struct ClosureLayer2<T: Differentiable, U: Differentiable, V: Differentiable>: Layer {
    ...
}

func layer<T: Differentiable, U: Differentiable>(_ function: @differentiable (T) -> U) -> ClosureLayer<T, U> {
    return ClosureLayer(function: function)
}

...

Then, you'll be able to use functions in sequenced(in:through:).

input.sequenced(in: context, through: conv, maxPool, layer(sin), ...)

What's better: You can now use a trailing closure to create anonymous layers!

let myLayer = layer { (input: Tensor<Float>) in
    sin(cos(x)) + someParameter
}

We have plans and designs for differentiating w.r.t. closure captures. Therefore, you will even be able to differentiate through this layer and optimize someParameter.

Shashi456 · 2019-03-28T04:57:18Z

@rxwei could we go onto make a list of what is available and whats to be done? so that gives a clearer picture of what layers need to be added? #53, #52 also are layer adding issues. So making a issue and referencing all of them might make the task easier.

Shashi456 · 2019-03-28T06:07:16Z

tanmayb123 · 2019-03-28T09:39:59Z

@Shashi456 that’s a great list - thanks for compiling it. Just two things:

Dropout has been implemented, but it’s not checked on your list.
According to what Richard and I discussed above, I thought we’re not planning on creating layers like activation layers? Rather, to just pass values through the functions, or pass the functions to layers as an activation function (as you can do right now).

Shashi456 · 2019-03-28T10:25:53Z

@tanmayb123 Alright I will remove the activate layers. But is that for sure? Weren't they made layers to make the process more intuitive in the first place?

tanmayb123 · 2019-03-28T20:12:10Z

@rxwei what do you think?

rxwei · 2019-03-28T20:18:55Z

@Shashi456 Thanks a lot for listing these! Looks good to me. I'd suggest starting with the non-recurrent ones first.

aman-bhu · 2019-04-20T23:32:27Z

@rxwei , I am willing to contribute. Can I implement one the above listed layers?

rxwei · 2019-04-20T23:36:20Z

Absolutely! What would you like to implement?

aman-bhu · 2019-04-20T23:43:35Z

I am planning for Conv 3D Layer.

rxwei · 2019-04-20T23:45:43Z

Sounds great. Look forward to your PR.

Shashi456 · 2019-04-22T07:46:38Z

@rxwei @dan-zheng I wanted to ask if it'd be possible to add more aliases for different kinds of layers in the repo?
For example GlobalAvgpooling = GlobalAveragePooling etc.
and maybe also for the losses.
Like Meansquarederror = MSE and sigmoidcrossentropy = XENT

rxwei · 2019-04-22T21:38:59Z

IMO it is ideal to stick with one set of names for consistency in all our models and example code. Currently we are leaning towards consistency with Keras. This will ensure we have overall consistency in our recommendations, while the user has the full freedom to define any aliases they want in their libraries.

Shashi456 · 2019-05-23T03:12:44Z

~~#130 shows that upsampling 3D doesn't work. We are currently looking at ways to fix it. One way to do it is, to take an approach inspired by the Keras Implementation of the same.~~

Solved.

dan-zheng · 2019-05-23T03:13:40Z

#130 shows that upsampling doesn't work. We are currently looking at ways to fix it. One way to do it is, to take an approach inspired by the Keras Implementation of the same.

To be precise, only UpSampling3D doesn't work, because it works with 8-D tensors that are too high-dimensional for broadcasting.

lakshya-sky · 2019-06-15T12:12:54Z

Hi, @Shashi456
when seperableconv2d will be available? so that i can implement mobilenet using s4tf.

Shashi456 · 2019-06-15T12:36:06Z

@Dash2507, sometime next week. I'm working on it locally right now, I'll push it once I'm done with the other PRs.

Shashi456 · 2019-06-16T09:01:52Z

@rxwei just had a simple question, So the convolution layers also have a zero padding layer but we already have a padded function, Do i write the layers anyway? I'm just trying to avoid redundancy since they would be wrappers just calling this function

rxwei · 2019-06-16T09:18:41Z

We already have such layers, Reshape, for example. Adding a layer wrapper for each function is definitely not ideal and would complicate our API surface. Instead of throwing a lot of work into implementing those wrapper layers, I'd suggest trying define a Function (or, Lambda) layer that takes any arbitrary differentiable function and uses it inside callAsFunction(_:). Essentially, it's going to look like this:

public struct Function<InputScalar: TensorFlowFloatingPoint, OutputScalar: TensorFlowFloatingPoint>: Layer {
    public typealias Input = Tensor<InputScalar>
    public typealias Input = Tensor<OutputScalar>
    public var body: @differentiable (Input) -> Output
    public init(body: @differentiable (Input) -> Output) {
        self.body = body
    }
    public func callAsFunction(_ input: Input) -> Output {
        body(input)
    }
}

With this, you can turn any closure to a layer:

let tanhLayer = Function<Float, Float>(tanh)
let reshapeLayer = Function<Float, Float> { x in x.reshaped(to: [10, 10]) }
let paddingLayer = Function<Float, Float> { x in x.padded(forSizes: [(0, 1)], with: 0) }

Would you like to prototype this?

Shashi456 · 2019-06-16T09:19:54Z

Alright, I'll get a PR up later today.

jon-tow · 2019-06-17T18:04:49Z

I've attempted an implementation of an Embedding layer but am running into problems with the Layer protocol's input type requirements. Given that an Embedding layer consumes tensors of indices (UInt/Int) there's no way to satisfy the differentiability of callAsFunction(_:). Is there a work around to this?

@dan-zheng I've noticed an implementation of a Differentiable Embedding struct in the GPT-2 model found in the swift-models repo (GPT-2 Transformer). This doesn't conform to the Layer protocol but could we bring it into the API since it's quite useful for NLP tasks?

Shashi456 · 2019-06-17T18:06:08Z

@jon-tow did you also define a vjp for your embedding layer?

rxwei · 2019-06-17T18:21:36Z

I've attempted an implementation of an Embedding layer but am running into problems with the Layer protocol's input type requirements. Given that an Embedding layer consumes tensors of indices (UInt/Int) there's no way to satisfy the differentiability of callAsFunction(_:). Is there a work around to this?

For now, you can define a nested Input structure and mark the vocabulary property as @noDerivative. Something like:

struct Embedding<Scalar: TensorFlowFloatingPoint> {
    struct Input: Differentiable {
        @noDerivative var vocabulary: Tensor<Int32>
    }
    func callAsFunction(_ input: Input) -> Tensor<Scalar> {
        ...
    }
}

jon-tow · 2019-06-17T18:23:18Z

Hey @Shashi456. Yup. It just wouldn't compile as it relied on the Raw.gather(params:, atIndices:) function which requires a BinaryInteger for the second argument. Thanks @rxwei I'll give it a try.

eaplatanios · 2019-06-17T18:36:59Z

I'm not sure I understood correctly what you're trying to do, but I would do something along the lines of: struct Embedding<Scalar: TensorFlowFloatingPoint> { var embeddings: Tensor<Scalar> @differentiable(wrt: self) func callAsFunction(_ indices: Tensor<Int32>) -> Tensor<Scalar> { ... } } Cheers, Anthony

…

On Mon, Jun 17, 2019 at 2:23 PM Jonathan Tow ***@***.***> wrote: Hey @Shashi456 <https://github.com/Shashi456>. Yup. It just wouldn't compile as it relied on the Raw.gather(params:, atIndices:) function which requires a BinaryInteger for the second argument. Thanks @rxwei <https://github.com/rxwei> I'll give it a try. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#54?email_source=notifications&email_token=AAJ4EXDBYLM4P7SHQT72H3DP27JBPA5CNFSM4G5GT4P2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODX4BCZA#issuecomment-502796644>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAJ4EXD7YWGB7JKD3PXRZUDP27JBPANCNFSM4G5GT4PQ> .

rxwei · 2019-06-17T19:01:51Z

Specifying @differentiable(wrt: self) is not possible yet because the Layer protocol requires both input and self to be differentiable. There are definitely a lot of ways to resolve this, e.g. defining a separate protocol that Layer inherits from and make that protocol only require self to be differentiable. However, that requires some non-trivial thunking-related engineering right now.

rxwei · 2019-06-17T19:08:35Z

It just wouldn't compile as it relied on the Raw.gather(params:, atIndices:) function which requires a BinaryInteger for the second argument.

Hope we can merge #151 so that you can use gathering(atIndices:alongAxis:).

jon-tow · 2019-06-17T19:15:30Z

Richard's advice resolved the compiler issues I had before regarding input types. Thanks for the suggestion @eaplatanios.
The only issue left seems to be differentiating gathering. I'll keep an eye out for that merge. Appreciate the help folks!

bartchr808 · 2019-06-17T20:55:44Z

Hey @jon-tow ! Actually we were mistaken but #156 already added gathering(atIndices:alongAxis:) so you should have access to it! 😄

jon-tow · 2019-06-17T20:59:38Z

@bartchr808 I had some tests passing and everything seemed okay. I was wondering what was going on! Thanks for letting me know :). I'll submit a PR sometime today.

Shashi456 · 2019-06-26T16:37:13Z

@rxwei So I've been working on the Function layer we were talking about the other day,

public struct Function<InputScalar: TensorFlowFloatingPoint, OutputScalar:TensorFlowFloatingPoint>: Layer {
    public typealias Input = Tensor<InputScalar>
 
    public typealias Output = Tensor<OutputScalar>
  
    public typealias Body = @differentiable (Input) -> Output
  
    @noDerivative public let body: Body
  
    public init(
        body: @escaping Body) {
        self.body = body
    }
    
    @differentiable
    public func callAsFunction(_ input: Input) -> Output {
        return body(input)
    }
}

Does this look right? I run into this error that the layer doesn't conform to protocol of Layer and that a call function is needed. As far as i understand, for a structure to inherit a protocol, you need to extend and define all the functions in the protocol, something like abstract classes theoretically. Any thoughts on where i might be going or doing it wrong?

tanmayb123 · 2019-06-26T18:24:29Z

public struct Function<InputScalar: TensorFlowFloatingPoint, OutputScalar: TensorFlowFloatingPoint>: Layer {
    public typealias Input = Tensor<InputScalar>
    public typealias Output = Tensor<OutputScalar>
    public typealias Body = @differentiable (Input) -> Output

    @noDerivative public let body: Body

    public init(body: @escaping Body) {
        self.body = body
    }

    @differentiable
    public func callAsFunction(_ input: Input) -> Output {
        return body(input)
    }

    @differentiable
    public func call(_ input: Input) -> Output {
        return callAsFunction(input)
    }
}

That compiles for me.

rxwei added enhancement New feature or request help wanted Extra attention is needed labels Mar 11, 2019

This was referenced Apr 1, 2019

Breaking down the Layers.swift file #69

Closed

Made some more compatibility fixes to support the latest raw ops. #74

Merged

rxwei mentioned this issue May 18, 2019

Solving errors in AvgPool3d and Maxpool3d Implementations #119

Merged

This was referenced Jun 2, 2019

Add a trainer abstraction to provide a default training loop. #23

Closed

Conv2D Signature and initialization #181

Closed

rxwei pinned this issue Jun 11, 2019

This was referenced Jun 18, 2019

Add an Embedding layer #257

Merged

Add support for Cropping1D and Cropping2D #289

Closed

This was referenced Jun 28, 2019

Add Function/Lambda Layer #298

Merged

Add ZeroPadding1D, ZeroPadding2D, ZeroPadding3D #305

Merged

jon-tow mentioned this issue Aug 1, 2019

[Layers] Add SpatialDropout support and enhance Dropout #407

Closed

rxwei unpinned this issue Oct 15, 2019

Shashi456 mentioned this issue Feb 20, 2020

Support Advanced Layers #685

Open

22 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement more layers that are available in Keras #54

Implement more layers that are available in Keras #54

rxwei commented Mar 11, 2019

tanmayb123 commented Mar 18, 2019

rxwei commented Mar 18, 2019

rxwei commented Mar 20, 2019 •

edited

Loading

Shashi456 commented Mar 28, 2019

Shashi456 commented Mar 28, 2019 •

edited

Loading

tanmayb123 commented Mar 28, 2019

Shashi456 commented Mar 28, 2019

tanmayb123 commented Mar 28, 2019

rxwei commented Mar 28, 2019

aman-bhu commented Apr 20, 2019

rxwei commented Apr 20, 2019

aman-bhu commented Apr 20, 2019

rxwei commented Apr 20, 2019

Shashi456 commented Apr 22, 2019

rxwei commented Apr 22, 2019

Shashi456 commented May 23, 2019 •

edited

Loading

dan-zheng commented May 23, 2019

lakshya-sky commented Jun 15, 2019

Shashi456 commented Jun 15, 2019

Shashi456 commented Jun 16, 2019

rxwei commented Jun 16, 2019 •

edited

Loading

Shashi456 commented Jun 16, 2019

jon-tow commented Jun 17, 2019

Shashi456 commented Jun 17, 2019

rxwei commented Jun 17, 2019

jon-tow commented Jun 17, 2019

eaplatanios commented Jun 17, 2019 via email

rxwei commented Jun 17, 2019

rxwei commented Jun 17, 2019

jon-tow commented Jun 17, 2019

bartchr808 commented Jun 17, 2019

jon-tow commented Jun 17, 2019

Shashi456 commented Jun 26, 2019

tanmayb123 commented Jun 26, 2019

Implement more layers that are available in Keras #54

Implement more layers that are available in Keras #54

Comments

rxwei commented Mar 11, 2019

tanmayb123 commented Mar 18, 2019

rxwei commented Mar 18, 2019

rxwei commented Mar 20, 2019 • edited Loading

Shashi456 commented Mar 28, 2019

Shashi456 commented Mar 28, 2019 • edited Loading

tanmayb123 commented Mar 28, 2019

Shashi456 commented Mar 28, 2019

tanmayb123 commented Mar 28, 2019

rxwei commented Mar 28, 2019

aman-bhu commented Apr 20, 2019

rxwei commented Apr 20, 2019

aman-bhu commented Apr 20, 2019

rxwei commented Apr 20, 2019

Shashi456 commented Apr 22, 2019

rxwei commented Apr 22, 2019

Shashi456 commented May 23, 2019 • edited Loading

dan-zheng commented May 23, 2019

lakshya-sky commented Jun 15, 2019

Shashi456 commented Jun 15, 2019

Shashi456 commented Jun 16, 2019

rxwei commented Jun 16, 2019 • edited Loading

Shashi456 commented Jun 16, 2019

jon-tow commented Jun 17, 2019

Shashi456 commented Jun 17, 2019

rxwei commented Jun 17, 2019

jon-tow commented Jun 17, 2019

eaplatanios commented Jun 17, 2019 via email

rxwei commented Jun 17, 2019

rxwei commented Jun 17, 2019

jon-tow commented Jun 17, 2019

bartchr808 commented Jun 17, 2019

jon-tow commented Jun 17, 2019

Shashi456 commented Jun 26, 2019

tanmayb123 commented Jun 26, 2019

rxwei commented Mar 20, 2019 •

edited

Loading

Shashi456 commented Mar 28, 2019 •

edited

Loading

Shashi456 commented May 23, 2019 •

edited

Loading

rxwei commented Jun 16, 2019 •

edited

Loading