🔥 phi 4 official released

max2veg · January 9, 2025, 12:24am

a few others:

aliasfox · January 9, 2025, 1:45am

Awesome to see a quantized model released. I’ll have to test the 4 but version later… But I’m really waiting for are smaller and quantized models of DeepSeek v3.

max2veg · January 9, 2025, 2:12am

bullerwins/DeepSeek-V3-GGUF
Q3, Q4, Q5, Q8, BF16

aliasfox · January 9, 2025, 4:48am

Awesome but damn, like 400GB even with 4bit quantization. No way I’ll be running that but still pretty cool. Might make the model more available and keep the cost down.

And I hadn’t even look in the last few days but these all seem pretty recent.

Thanks.

ColeMedin · January 10, 2025, 4:43pm

Yeah DeepSeek V3 is just too big haha

I wish they had a 32b/70b parameter version of it or something

ColeMedin · January 10, 2025, 4:43pm

Thanks for sharing Max! Looks awesome

max2veg · January 10, 2025, 8:45pm

Blockquote
Yeah DeepSeek V3 is just too big haha
I wish they had a 32b/70b parameter version of it or something

same here

the biggest problem imo is that LLMs are trained on too many programming languages, there should be a deepseek-v3-typescript, deepseek-v3-python, …
because i’m quite certain then it should be possible to get to a 70b or even 32b parameter size

ColeMedin · January 14, 2025, 5:54pm

Great opportunity for a game changer fine tuning project…

leex279 · January 19, 2025, 5:34pm

When do you start @ColeMedin fine tuning a model for bolt.diy specially

ColeMedin · January 19, 2025, 8:33pm

So many projects I wish I could just dive into right now and this is certainly one of them! Honestly though I’m still waiting for a better local LLM, something like DeepSeek but smaller and a better context window.

aliasfox · January 22, 2025, 2:35pm

Crazy how things can change so quickly in AI world.

ColeMedin · January 22, 2025, 5:55pm

Haha indeed it is!