Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Dr. Jeff Beck

๐Ÿ‘ค Speaker
455 total appearances

Appearances Over Time

Podcast Appearances

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

The only thing I don't like about test time training is the vast majority of the training that is done.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

So in a traditional energy-based model, you always find the minimum with respect to the latent variables, these extra weights, which in the case of test time training is the subset of weights that you're allowed to change during test time.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

When you do the training for a traditional energy-based model,

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

You're allowed to make those changes throughout the entire course of training.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

The way that we're often doing test time training these days is we just do regular old neural network learning.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

And then finally, when we get to the deployment phase, then we suddenly turn on these additional latents, which are basically some of the weights of the network, and we do an additional bit of learning at that point.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

Now, again, not an expert here, but this seems unwise to me.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

And the reason it seems unwise is because you didn't train the original network with that on.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

You trained it in a completely supervised way.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

Now, I'm sure that people are aware of this and it's been addressed in the literature, but I'm not personally aware of that.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

I don't think that's how it's used in practice.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

My take on it is that an energy-based model and a Bayesian model have a lot in common, right?

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

In many ways, like energy, I mean, well, literally in physics, right, energy is log probability, right?

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

Now, of course, there's the normalization factor that you don't need to worry about if you're just minimizing energy.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

In a Bayesian framework, that's like saying, well, I'm not actually going to treat some of these latent variables in a probabilistic way.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

I'm just going to do maximum or map estimation on some of my variables and just be okay with that.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

That's one way to interpret the relationship between an energy-based model and a properly Bayesian model.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

There's a happy medium here, though, right?

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

And the happy medium is you can still treat it as if it's, you know, you don't have to just minimize the energy function.

Machine Learning Street Talk (MLST)
VAEs Are Energy-Based Models? [Dr. Jeff Beck]

You can calculate the curvature down there, too, do a Laplace approximation and call yourself a Bayesian again, right?