Switzerland government release full FOSS LLM under Apache 2.0, argue for AI as Public Utility

Cooper8@feddit.online · 2 months ago

Switzerland government release full FOSS LLM under Apache 2.0, argue for AI as Public Utility

☂️-@lemmy.ml · edit-2 1 month ago

deleted by creator

CookieOfFortune@lemmy.world · 2 months ago

Yes. Although they don’t host the dataset binaries.

E_coli42@lemmy.world · 2 months ago

Is this hosted somewhere? Maybe distributed? I would love a privacy respecting distributed LLM chatbot.

xcjs@programming.dev · edit-2 2 months ago

In case you’re not aware, there are a decent number of open weight (and some open source) large language models.

The Ollama project makes it very approachable to download and use these models.

Xylight‮@lemdro.id · edit-2 2 months ago

Ollama has taken a bad turn lately (such is the nature of VC backed software). Maybe recommend ~~kobold.cpp~~ jan.ai for LLM noobs instead

xcjs@programming.dev · 2 months ago

I’m keeping an eye on Ollama’s service offerings - I don’t think they’re in enshittification territory yet, but I definitely share the concern.

I still don’t believe the other LLM engines out there have reached an equivalent ease of use compared to Ollama, and I still recommend it for now. If nothing else, it can be a stepping stone to other solutions for some.

lepinkainen@lemmy.world · 2 months ago

Or just llama.cop they finally got an UI added

Xylight‮@lemdro.id · 2 months ago

That’s what I use and also the backend of the aforementioned software, but it’s still complicated for people to set up.

I should also mention Jan, it makes things super easy and it also has a very nice GUI

xcjs@programming.dev · 2 months ago

Jan is another great recommendation!

mudkip@lemdro.id · 2 months ago

there is nothing wrong with ollama it runs models fast and easy add a gguf and youre done unless you want to squeeze out extra performance and have time to figure out your exact flags then use llama cpp otherwise ollama just works for 99 percent of people

Xylight‮@lemdro.id · 2 months ago

4.5/10 bait

mudkip@lemdro.id · 2 months ago

if you send me a video of you completing the bussin level on geometry dash ill send you 10$

Xylight‮@lemdro.id · 2 months ago

handcam necessary or just screen recording w clicks

mudkip@lemdro.id · 2 months ago

screen recording 110715909

PandaInSpace@kbin.earth · 2 months ago

Other than Apertus, are there any truly open source models - mainly what I want to know is models that list their training data publicly to ensure no theft of art and stuff. (i replied to your comment as you seem to know about these models, I have no clue abou this stuff)

xcjs@programming.dev · edit-2 2 months ago

Deepseek R1 and OpenThinker are two more examples. There’s also SmolLM, which I believe also open sources its training data and ensures proper licensing for it.

E_coli42@lemmy.world · 14 days ago

I tried Deepseek R1 7B on my MacBook M3 Pro but it is shit compared to ChatGPT unfortunately

xcjs@programming.dev · edit-2 14 days ago

There are some factors to consider. Some of the Deepseek quants are based on Llama 3, whereas others are based on Qwen Reasoning.

You’re also not going to get the same quality of the full ChatGPT experience comparing a 7B parameter model to a 500B+ model like ChatGPT.

Regardless, it’s difficult to run the actual Deepseek R1 model as there’s not a true quantization or distillation of the original model.

You can also try GPT-OSS if you want an open source model comparable to ChatGPT. Once again, you’re going to have to balance the size and precision of the model with your expectations.

Cooper8@feddit.online · 2 months ago

Links in the article. Hugging Face and Swiss Telecoms host

StrixUralensis@tarte.nuage-libre.fr · 2 months ago

ABetterTomorrow@sh.itjust.works · 2 months ago

I can’t find any hardware requirements for this. What will it take to run this smoothly?

IngeniousRocks (They/She) @lemmy.dbzer0.com · edit-2 2 months ago

8b parameter models are relatively fast on 3rd gen RTX hardware with at least 8gigs of vram, CPU inferencing is slower and requires boatloads of ram but is doable on older hardware. These really aren’t designed to run on consumer hardware, but the 8b model should do fine on relatively powerful consumer hardware.

If you have something that would’ve been a high end gaming rig 4 years ago, you’re good.

If you wanna be more specific, check huggingface, they have charts. If you’re using linux with nvidia hardware you’ll be better off doing CPU inferencing.

Edit: Omg y’all I didn’t think I needed to include my sources but this is quite literally a huge issue on nvidia. Nvidia works fine on linux but you’re limited to whatever VRAM is on your video card, no RAM sharing. Y’all can disagree all you want but those are the facts. Thays why AMD and CPU inferencing are more reliable, and allow for higher context limits. They are not faster though.

Sources for nvidia stuff https://github.com/NVIDIA/open-gpu-kernel-modules/discussions/618

https://forums.developer.nvidia.com/t/shared-vram-on-linux-super-huge-problem/336867/

https://github.com/NVIDIA/open-gpu-kernel-modules/issues/758

https://forums.opensuse.org/t/is-anyone-getting-vram-backed-by-system-memory-with-nvidia-drivers/185902

ABetterTomorrow@sh.itjust.works · 2 months ago

Thanks for the reply. Never been on the HF site and doing it on mobile of the first time I seem lost. I couldn’t find it but I’m sure I will.

Jakeroxs@sh.itjust.works · 2 months ago

Disagree on Linux nvidia support, it works fine

IngeniousRocks (They/She) @lemmy.dbzer0.com · 2 months ago

deleted by creator

General_Effort@lemmy.world · 2 months ago

For fastest inference, you want to fit the entire model in VRAM. Plus, you need a few GB extra for context.

Context means the text (+images, etc) it works on. That’s the chat log, in the case of a chatbot, plus any texts you might want summarized/translated/ask questions about.

Models can be quantized, which is a kind of lossy compression. They get smaller but also dumber. As with JPGs, the quality loss is insignificant at first and absolutely worth it.

Inference can be split between GPU and CPU, substituting VRAM with normal RAM. Makes it slower, but you’ll probably will still feel that it’s smooth.

Basically, it’s all trade-offs between quality, context size, and speed.

frongt@lemmy.zip · 2 months ago

Apertus was developed with due consideration to Swiss data protection laws, Swiss copyright laws, and the transparency obligations under the EU AI Act. Particular attention has been paid to data integrity and ethical standards: the training corpus builds only on data which is publicly available. It is filtered to respect machine-readable opt-out requests from websites, even retroactively, and to remove personal data, and other undesired content before training begins.

Available doesn’t mean licensed for AI training.

schnurrito@discuss.tchncs.de · 2 months ago

and yet it is still a legally unsettled question whether LLM training requires a copyright license at all; and it is my opinion that no one should want that to be the case, why would people on the Internet want to argue for an expansion of copyright law?

frongt@lemmy.zip · 2 months ago

Why would it be an expansion? If you’re using someone else’s work, why wouldn’t you need a license? If I write a book and publish it under CC-BY-NC, should Google be allowed to take my work for their commercial product without compensation or even attribution? Should Microsoft be allowed to create closed-source commercial Copilot off GPL source code?

schnurrito@discuss.tchncs.de · 2 months ago

It’s an expansion to say that LLM training constitutes a derivative work. You are of course entitled to your opinion that it should be the case; all I can say to that is that in the 2000s and 2010s nearly everyone on the Internet tended to argue for more limitations, not further expansions, of copyright law, and I wonder what happened to that attitude.

frongt@lemmy.zip · edit-2 2 months ago

Well, this being the open source community, I would expect most people here to be on the side of respecting the rights of content creators. Like I said, if I write some GPL software, I don’t think Microsoft should be able to disrespect my license just because they’re also disrespecting everyone else’s license too through automation at scale.

Edit: forgot to mention, since their product is wholly dependent on the other works, that’s the very definition of a derivative work. While you could argue it’s transformative, it certainly fails the other tests for fair use.

General_Effort@lemmy.world · 2 months ago

I find it very unexpected. It used to be understood that IP laws favor monopolies. EG I don’t remember the OS community being on the side of Oracle in https://en.wikipedia.org/wiki/Google_LLC_v._Oracle_America,_Inc.

Maybe it just passed me by.

No1@aussie.zone · edit-2 2 months ago

deleted by creator

NeatNit@discuss.tchncs.de · 2 months ago

Obligatory nitpick: open weights ≠ open source. For it to be open source, they need to release the training data as well as all the parameters they used in training it.

Cooper8@feddit.online · 2 months ago

Please read the article before commenting.

“The model is named Apertus – Latin for “open” – highlighting its distinctive feature: the entire development process, including its architecture, model weights, and training data and recipes, is openly accessible and fully documented.”

NeatNit@discuss.tchncs.de · 2 months ago

Thanks… I have downvoted my own comment in shame. Godspeed!

Cooper8@feddit.online · 2 months ago

a gentleperson and a scholar

verdi@feddit.org · 2 months ago

Props to the humility!

hoshikarakitaridia@lemmy.world · 2 months ago

Madlad

illusionist@lemmy.zip · 2 months ago

This https://github.com/swiss-ai/pretrain-data ?

Switzerland government release full FOSS LLM under Apache 2.0, argue for AI as Public Utility

Switzerland government release full FOSS LLM under Apache 2.0, argue for AI as Public Utility

Apertus: a fully open, transparent, multilingual language model