My Ever-Changing Homelab

My Ever-Changing Homelab
from pech@lemmy.world to homelab@lemmy.ml on 24 May 2026 13:44
https://lemmy.world/post/47291115

3588 3581

I consolidated my setup a bit. This is my local LLM hosting server. I took a gamble on a Chinese NVLink SXM2 mezzanine board from ebay, it was surprisingly plug and play for my dual 16GB v100s 😂. I’m also running a Tesla P40 that I repasted with liquid metal that sits at 24C idle 🥶.

#homelab

threaded - newest

SomeDudeFromSpace@lemmy.ml on 24 May 2026 13:51 next collapse

That’s great! But I’m even more curious about the green device in the background of the first pic 👀 What’s that?

pech@lemmy.world on 24 May 2026 14:47 collapse

It’s an older version of a heatset insert fixture. I used it to put threaded brass inserts into my 3d prints instead of trying to capture a nut or screwing directly into plastic.

printables.com/…/609644-stealth-press-1-heat-set-…

SomeDudeFromSpace@lemmy.ml on 28 May 2026 16:17 collapse

That’s interesting. Thanks!

judgyweevil@feddit.it on 24 May 2026 14:01 next collapse

From the first photo I thought this was an earthquake simulator with models of skyscrapers

pech@lemmy.world on 24 May 2026 17:11 collapse

😂 I can see how it looks that way lol

digdilem@lemmy.ml on 24 May 2026 14:58 next collapse

Nice, dude. Love anyone who learns by doing.

pech@lemmy.world on 24 May 2026 17:10 collapse

It’s been a fascinating experience so far and I feel like I’m only touching the surface. I’m exploring some of the various memory harnesses to hook into Hermes Agent and see how it learns with extended use.

whatiswrongwithyou@lemmy.ml on 24 May 2026 16:57 next collapse

What kind of model and space limitations are you under with v100s?

Some of the most interesting computer music I’ve heard in years was composed on them but idk if it’s worth getting into a whole new generation of hardware if it can only really do that.

pech@lemmy.world on 24 May 2026 17:09 collapse

Currently I’m running a Q6K quant of Hermes 4 14B with a 32K context window via llama.cpp that works pretty well. Generation output is a comfy ~50tok/sec. These v100s are 16GB each, but there are 32GB versions available too.

I’m running everything via NixOS and have to do package overrides to get inference engines to build with the right CUDA versions.

My goal is to get a cohesive environment set up for Hermes Agent to learn my system/lab/network and help my grow it over time.

Overall, I’m happy with them. The mezzanine board is good quality, I’m using PTM sheets under those massive heatsinks and some arctic p9 fans to keep them at around 60C under load.

buildmylab@lemmus.org on 01 Jun 2026 00:51 collapse

This looks amazing. Did you make it with a CNC machine? Are you planning to add an enclosure? I think it would look even better with one.

pech@lemmy.world on 01 Jun 2026 01:48 collapse

Thank you! I actually 3D printed these. I couldn’t get my qidi ifast to play nicely with my generic ABS filament, so I sliced them in half and printed them in PLA on my prusa mini. They’re joined in the middle with a keyed lock on the top and bottom and 8x 12mm M3 bolts