[April 2025] Which model are you using?

ikt · 2 months ago

[April 2025] Which model are you using?

OpticalMoose@discuss.tchncs.de · 2 months ago

I mainly use Llama-3-8B abliterated for everyday questions, and DeepSeek-Coder-V2-Lite for programming/Linux stuff.

ikt · 1 month ago

Using DeepSeek-Coder-V2-Lite now, it’s awesome!

eurisko@lemmy.ca · 2 months ago

I find that for the purpose of my projects (narrative building, tabletop rpg simulation) gemma3:14b (with low temperature) works perfectly to create consistent psychological overviews.

Audalin@lemmy.world · 2 months ago

QWQ-32B for most questions, llama-3.1-8B for agents. I’m looking for new models to replace them though, especially the agent one.

Want to test the new GLM models, but I’d rather wait for llama.cpp to definitely fix the bugs with them first.

weker01@sh.itjust.works · edit-2 2 months ago

GLM? I feel like every other day there is a new abbreviation :(

ikt · edit-2 1 month ago

Want to test the new GLM models

Which models are you referring to? These: https://github.com/THUDM/GLM-4 ?

Audalin@lemmy.world · 1 month ago

That’s the ones, the 0414 release.

SmokeyDope@lemmy.world · edit-2 2 months ago

I have been using deephermes daily. I think CoT reasoning is so awesome and such a game changer! It really helps the model give better answers especially for hard logical problems. But I don’t want it all the time especially on an already slow model. Being able to turn it on and off wirhout switching models is awesome. Mistral 24b deephermes is relatively uncensored, powerful and not painfully slow on my hardware. a high quant of llama 3.1 8b deephermes is able to fit entirely on my 8gb vram.

weker01@sh.itjust.works · 2 months ago

Fallen Gemma. The writing style is really good and it can keep relatively persistent personalities. On the other hand it’s stupid af compared to other recent models and even the vanilla Gemma 3.