this post was submitted on 17 Dec 2024
214 points (97.8% liked)

Technology

72414 readers
2587 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 12 points 6 months ago (9 children)

I just want one to self host a 70B LLM model for fuck's sake. I don't want to be forced to take out a god damned mortgage/personal loan to buy one.

[–] [email protected] 6 points 6 months ago (1 children)

I picked up a pair of old Tesla P40s. Right now I'm running a Q4 quant of Qwen 2.5 72B that fits in the combined 48GB of VRAM with 12k context. They aren't as fast as newer consumer cards, but it generates as fast as I can read while costing less than a used 3080.

[–] [email protected] 2 points 6 months ago (1 children)

interesting. They are cooled passively, right? What's your case and cooling setup?

[–] [email protected] 2 points 6 months ago

I have a dell power edge 730, which was about $200. It's CPU shrouds perfectly match the GPU intakes so air just flows through both from the server fans. I've seen a few 3d printable fan mounts for jury rigging them into a regular tower too.

load more comments (7 replies)