Ray Fernando
👤 PersonAppearances Over Time
Podcast Appearances
And that's the key that you want to put in there. Similar to Grok Cloud, you just go ahead and hit create API key. So once you go to console.grok.com, There's an API key section here. And then you'll want to hit create API key. And that'll pop up a dialog with those API keys. And so that endpoint will look something like this over here.
And that's the key that you want to put in there. Similar to Grok Cloud, you just go ahead and hit create API key. So once you go to console.grok.com, There's an API key section here. And then you'll want to hit create API key. And that'll pop up a dialog with those API keys. And so that endpoint will look something like this over here.
So that'll be, if we hit configure, api.grok.com slash openai slash v1. And then you put your key in there. And you don't have to do anything with these IDs. These will be pooled directly from that endpoint. So whatever models you have available will be there. And so now when you hit the plus sign, you'll see like this nice list of models from fireworks. So there'll be the fireworks one.
So that'll be, if we hit configure, api.grok.com slash openai slash v1. And then you put your key in there. And you don't have to do anything with these IDs. These will be pooled directly from that endpoint. So whatever models you have available will be there. And so now when you hit the plus sign, you'll see like this nice list of models from fireworks. So there'll be the fireworks one.
So that'll be, if we hit configure, api.grok.com slash openai slash v1. And then you put your key in there. And you don't have to do anything with these IDs. These will be pooled directly from that endpoint. So whatever models you have available will be there. And so now when you hit the plus sign, you'll see like this nice list of models from fireworks. So there'll be the fireworks one.
So account slash fireworks. You can play with any one of those. And then the other ones that are just with the normal name are from Grok. So they have those as available for there. So you can you can play with a lot of these models, which is nice and compare them. And then the ones at the bottom are the ones from Olamo.
So account slash fireworks. You can play with any one of those. And then the other ones that are just with the normal name are from Grok. So they have those as available for there. So you can you can play with a lot of these models, which is nice and compare them. And then the ones at the bottom are the ones from Olamo.
So account slash fireworks. You can play with any one of those. And then the other ones that are just with the normal name are from Grok. So they have those as available for there. So you can you can play with a lot of these models, which is nice and compare them. And then the ones at the bottom are the ones from Olamo.
And it's a little show like, you know, the colon latest is kind of how you can tell. And if you hover over them, you'll see like some additional information over the parameter count, what quantization level it is. So Q4 means it's quantized to four bits. And that also has a play in its intelligence. Obviously, the higher level of quantization, you know, means more memory. So it's like 32 bit.
And it's a little show like, you know, the colon latest is kind of how you can tell. And if you hover over them, you'll see like some additional information over the parameter count, what quantization level it is. So Q4 means it's quantized to four bits. And that also has a play in its intelligence. Obviously, the higher level of quantization, you know, means more memory. So it's like 32 bit.
And it's a little show like, you know, the colon latest is kind of how you can tell. And if you hover over them, you'll see like some additional information over the parameter count, what quantization level it is. So Q4 means it's quantized to four bits. And that also has a play in its intelligence. Obviously, the higher level of quantization, you know, means more memory. So it's like 32 bit.
16, uh, all the way down. Um, so the, like the, the lower the number, the like not less intelligence, but you may not get the output that you want is expected. So that's kind of part of that process. It's a lot of different things here, but I think, uh, the most important thing is just, um, yeah. How, how do you host this locally, how to start playing around with it?
16, uh, all the way down. Um, so the, like the, the lower the number, the like not less intelligence, but you may not get the output that you want is expected. So that's kind of part of that process. It's a lot of different things here, but I think, uh, the most important thing is just, um, yeah. How, how do you host this locally, how to start playing around with it?
16, uh, all the way down. Um, so the, like the, the lower the number, the like not less intelligence, but you may not get the output that you want is expected. So that's kind of part of that process. It's a lot of different things here, but I think, uh, the most important thing is just, um, yeah. How, how do you host this locally, how to start playing around with it?
Um, and that's kind of like a really good primer to get started for doing these models and stuff. Yeah.
Um, and that's kind of like a really good primer to get started for doing these models and stuff. Yeah.
Um, and that's kind of like a really good primer to get started for doing these models and stuff. Yeah.
Yeah. There is an app called Apollo. Have you heard of that? Apollo.
Yeah. There is an app called Apollo. Have you heard of that? Apollo.
Yeah. There is an app called Apollo. Have you heard of that? Apollo.