More

jwitthuhn · 2025-05-04T02:21:58 1746325318

Love the concept of BSOL, might give that a try.

jwitthuhn · 2025-05-02T12:14:32 1746188072

The difference is that SSPL is a license that was written in bad faith with the explicit intent that people can't comply with the parts about running a hosted service.

https://www.mongodb.com/legal/licensing/server-side-public-l...

See section 13, "Offering the Program as a Service". To comply with that you need to release all software used "to make the Program or modified version available as a service" under the SSPL.

For something like redis this includes: redis itself, the os you are using to host redis, the drivers and firmware for the hardware you are using to host redis, and more. Also your whole deployment stack up to and including the mouse drivers you use to click on the "deploy" button.

It is an absurd condition that effectively makes section 13 say "you can't offer a hosted version" and the OSI and FSF are right to reject that fig leaf of "it is like the agpl, but more".

jwitthuhn · 2025-04-12T18:10:56 1744481456

It sort of can but all non-adobe software I know of, even commercial stuff like Affinity Photo, has spotty support for some PSD features.

Basically any given PSD will certainly load correctly in photoshop, but you're rolling the dice if you want to load it into anything else. More so if you are using more modern features.

jwitthuhn · 2025-01-22T01:45:27 1737510327

I've made a tiny ~1m parameter model that can generate random Magic the Gathering cards that is largely based on Karpathy's nanogpt with a few more features added on top.

I don't have a pre-trained model to share but you can make one yourself from the git repo, assuming you have an apple silicon mac.

https://github.com/jlwitthuhn/TCGGPT

jwitthuhn · 2024-12-06T20:39:46 1733517586

In general you can just use the parameter count to figure that out.

70B model at 8 bits per parameter would mean 70GB, 4 bits is 35GB, etc. But that is just for the raw weights, you also need some ram to store the data that is passing through the model and the OS eats up some, so add about a 10-15% buffer on top of that to make sure you're good.

Also the quality falls off pretty quick once you start quantizing below 4-bit so be careful with that, but at 3-bit a 70B model should run fine on 32GB of ram.

profsummergig · 2024-12-07T21:25:27 1733606727

Does 70b mean there are 70 billion weights and biases in the model?

jwitthuhn · 2024-10-30T17:35:56 1730309756

For me it is consistency. I control the model and the software so I know a local LLM will remain exactly the same until I want to change it.

It also avoids the trouble of using a hosted LLM that decides to double their price overnight, costs are very predictable.

jwitthuhn · 2024-10-15T01:20:44 1728955244

For anyone else looking for the weights which as far as I can tell are not linked in the article:

Base model: https://huggingface.co/Zyphra/Zamba2-7B

Instruct tuned: https://huggingface.co/Zyphra/Zamba2-7B-Instruct

keyle · 2024-10-15T01:33:41 1728956021

I couldn't find any gguf files yet. Looking forward to trying it out when they're available.

kristianp · 2024-10-15T03:31:14 1728963074

It seems that zamba 2 isn't supported yet, the previous model's issue is here:

Feature Request: Support Zyphra/Zamba2-2.7B #8795

Open tomasmcm opened this issue on Jul 31 · 1 comment

https://github.com/ggerganov/llama.cpp/issues/8795

alchemist1e9 · 2024-10-15T01:55:36 1728957336

What can be used to run it? I had imagined Mamba based models need a different interference code/software than the other models.

gbickford · 2024-10-15T03:53:04 1728964384

If you look in the `config.json`[1] it shows `Zamba2ForCausalLM`. You can use a version of the transformers library to do inference that supports that.

The model card states that you have to use their fork of transformers.[2]

1. https://huggingface.co/Zyphra/Zamba2-7B-Instruct/blob/main/c...

2. https://huggingface.co/Zyphra/Zamba2-7B-Instruct#prerequisit...

hidelooktropic · 2024-10-15T03:06:15 1728961575

To run gguf files? LM Studio for one. I think recurse on macos as well and probably some others.

x_may · 2024-10-15T11:10:52 1728990652

As another commenter said, this has no GGUF because it’s partially mamba based which is unsupported in llama.cpp

xyc · 2024-10-15T21:08:23 1729026503

dev of https://recurse.chat/ here, thanks for mentioning! rn we are focusing on features like shortcuts/floating window, but will look into support this in some time. to add to the llama.cpp support discussion, it's also worth noting that llama.cpp does not yet support gpu for mamba models https://github.com/ggerganov/llama.cpp/issues/6758

wazoox · 2024-10-15T12:09:40 1728994180

Gpt4all is a good and easy way to run gguf models.

Havoc · 2024-10-15T12:01:53 1728993713

Mamba based stuff tends to take longer to become available

jwitthuhn · 2024-09-26T00:21:44 1727310104

Looking forward to them taking a similar action against Wordpress.com for upselling access to the core wordpress feature of plugins.

jwitthuhn · 2024-09-25T02:10:09 1727230209

I'm no trademark lawyer, but isn't offering "WordPress" hosting fine as long as you are genuinely using the WordPress software? As I understand it that is purely nominative use.

jwitthuhn · 2024-09-18T23:13:22 1726701202

I see that sentiment largely coming from developers who, I think, misunderstand the freedom that the GPL is protecting.

The GPL focuses on the user's freedom to modify any software they are using to better suit their own needs, and it does a great job of it.

The people saying that it is less free than bsd/mit/apache are looking at it from a developer's perspective. The GPL does deliberately limit a developer's freedom to include GPL code in a proprietary product, because that would restrict the user's freedom to modify the code.