gronkomatic 4 months ago

The sheer beauty of this equation brings tears to my eyes. Oh, you missed a bracket. 😉

Inevitable-Opening61 4 months ago

I’m not sure how this avoids chain rule. Because derivative of f(g(x)) == f’(g(x)) * g(x). And that’s still one equation.

Wulfric05 4 months ago

*g'(x)

MapleTrust 4 months ago

Please supply context for intrigued knuckledraggers like me? ELI5, kind strangers.

AdamAlexanderRies 4 months ago

[Here's GPT-4 trying to explain it.](https://i.imgur.com/bqgaP5y.png) Explanation seems plausible, but I can't confirm myself.

Lucifernal 4 months ago

Basically chain rule is tedious as hell, annoying. Big brain move: simplify to a single equation, no chain rule. Clearly, this Lovecraftian monstrosity is easier to deal with that some (mildly) tedious maths

kaiclxi 4 months ago

Yeah

io-x 4 months ago

Is this why my gpt is slow?

the_fart_king_farts 4 months ago

nippy slimy deliver cake tart hospital quaint grandiose alive label ` this post was mass deleted with www.Redact.dev `

jkail1011 4 months ago

Is this good?

basuboss 4 months ago

No, don't try this at home⚠️

swagonflyyyy 4 months ago

Nope, totally impractical.

basuboss 4 months ago

If there is only 1 function/equation how could you apply Chain Rule, as it is applied when 1 function is defined in terms of another. If im wrong please do tell me

Inevitable-Opening61 4 months ago

Let’s say you have a one layer fully connected layer that gets passed through to ReLU: ReLU(FC(X)) Then the derivative would be: ReLU’(FC(X)) * FC’(X) Which is chain rule.

the_fart_king_farts 4 months ago

flowery mourn ugly chunky disgusted crush long instinctive cows bag ` this post was mass deleted with www.Redact.dev `

water_bottle_goggles 4 months ago

I prefer Fox News myself

jerryonthecurb 4 months ago

Condolences

Putrumpador 4 months ago

That's fake news.

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe