I don't think any kind of "poisoning" actually works. It's well known by now that data quality is more important than data quantity, so nobody just feeds training data in indiscriminately. At best it would hamper some FOSS AI researchers that don't have the resources to curate a dataset.
lily33
There are already other providers like Deepinfra offering DeepSeek. So while the the average person (like me) couldn't run it themselves, they do have alternative options.
A server grade CPU with a lot of RAM and memory bandwidth would work reasonable well, and cost "only" ~$10k rather than 100k+...
To be fair, most people can't actually self-host Deepseek, but there already are other providers offering API access to it.
Yes, OpenAI wishes everyone else has to have authorization to do model training...
Fortunately, their ToS don't matter all that much, it's easy to use their model through a third party without ever touching them.
The point of it being open is that people can remove any censorship built into it.
The particular AI model this article is talking about is actually openly published for anyone to freely use or modify (fine-tune). There is a barrier in that it requires several hundred gigs of RAM to run, but it is public.
It's almost sure to be the case, but nobody has managed to prove it yet.
Simply being infinite and non-repeating doesn't guarantee that all finite sequences will appear. For example, you could have an infinite non-repeating number that doesn't have any 9s in it. But, as far as numbers go, exceptions like that are very rare, and in almost all (infinite, non-repeating) numbers you'll have all finite sequences appearing.
Now, if only the article explained how that killing was related to TikTok. The only relevant thing I saw was,
had its roots in a confrontation on social media.
It's says "social media", not "TokTok" though.
I'm confused, isn't Fedora atomic immutable? Shouldn't that make it stateless automatically?
https://openrouter.ai/deepseek/deepseek-r1 - offers multiple providers, so at least someone will be up (though note that most are more expensive than Deepseek themselves).