What’s going on on your servers?

I had to bite the bullet and buy new drives after the old ones filled up. I went for used enterprise SSDs on eBay and eventually found some that had an okay price, even though it’s been much more than last time I got some. Combined with Hetzner’s hefty price increase some month ago, my hobby has become a bit more expensive again thanks to the ever growing appetite of companies building more data centers to churn more energy.

Anyways, the drives are in, my Ansible playbook to properly encrypt them and make them available in Proxmox worked, so that was smooth (ignoring the part where I disassembled the Lenovo tiny from the rack, open it, SSD out, SSD in, close it and put it back in only to realize I put in the old ssd again).

Any changes in your hardware setups? Did the price increase make you reconsider some design decisions? Let us know!

  • myrmidex@belgae.social
    link
    fedilink
    English
    arrow-up
    1
    ·
    16 minutes ago

    Only upgrade this week is LED lights for my rack. The rails I ordered for one of my cases did not come through, some mistake during order picking it seems, so it’s only the LEDs this weekend. Installing the lights was easier than reporting the issue through Megekko. Doubt I’ll order from there again in the future.

  • TheRagingGeek@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    8 hours ago

    so this week I was getting ready for my workday when my Son tells me CraftyController is inaccessible, so I tried to SSH into the box that the service is pinned to… nada, dead. tried to power cycle it, nada.

    now this node was a B450M-A mobo Ryzen 7 2700X platform with some hodgepodge scrap RAM I’ve had running in it(RAM birthday was 2019). I hooked it up to a mini monitor and a keyboard, but it didn’t post at all, so just a blue screen of no signal. unfortunately the B450M-A mobo didn’t feature POST debug lights, nor did it use QLED, it apparently relied on PC Speaker, and my machine wasn’t telling any tales. so since I had no real idea as to the root cause and after reseating the RAM and the GPU and fiddling with it got me nowhere, I got my partner to approve the outspend for replacement of the motherboard so that I could have actual Debug indicators.

    Thursday the ROG B550-F Gaming WIFI II mobo arrived, as did the Ryzen 9 5900XT and the Nautilus 360RS cooler. I spent the evening assembling the mobo and CPU and the GPU, the RAM, and all the related wiring. figured I would do the Cooler the next day. Yesterday I got the cooler in place with some serious hardware acrobatics. I then fired it up and Yellow LED. DRAM issue, so I unseated all of the RAM, plugging in one of the hodgepodge sets(I had 4x8GB ram sticks) neither set worked, went to just trying a single stick. of the 4 sticks only 1 was able to get past the Yellow LED and into completed POST.

    So the RAM was shot and I’m not going to run containers on a machine with only 8 GB of ram. so I ordered up some Vengeance LPX 2x16G sticks and they arrived this morning! I just finished slotting them and then wrestling with Gentoo’s understanding of where all the hardware was. it was a lot of fiddling with the gentoo kernel config, and installing the nvidia drivers, but after all of that was done, the system booted up successfully! I’ve now got it back in its residence connected up to the UPS power, about to shunt docker containers back to the newly improved machine with 2x the CPU capacity.

    Was a wild ride, but the cool part of it was when the system shat itself it was part of a 3 node Docker Swarm and I had recently migrated to a NAS for persistence of my container data. though the other 2 nodes aren’t as overbuilt as this thing, so I did have to do some memory wrangling and disabling my lower priority services in order to restore service, but I was able to ensure all necessary services were able to run during the outage, and I got some learning in regards to a couple of the services that didn’t port as cleanly as I would’ve liked. all in all fun times in system administration! lol.

  • ClydapusGotwald@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    7 hours ago

    Just updated my cpu to a epyc 7852 and installed an intel 310 eco. Also just cried and bought 64gb of ddr4 ram to add to it.

  • realitaetsverlust@piefed.zip
    link
    fedilink
    English
    arrow-up
    5
    ·
    8 hours ago

    Another week of not updating my proxmox from proxmox 7 which is outdated for like 2 years now because I just can’t be bothered tbh.

    Before ya’ll freak out, it’s isolated and only two containers are accessible from outside, behind a proxy. So no need to panic.

  • ampersandrew@lemmy.world
    link
    fedilink
    English
    arrow-up
    18
    ·
    edit-2
    5 hours ago

    I’ve crossed that threshold in Dunning-Kruger where I see how much I don’t know, and it’s simultaneously disheartening and stressful. But hell, what am I going to do now? Quit?

    I’m trying to properly learn VLANs and set them up so that I’ve got “self-hosted services exposed to the internet” and “everything else”. So far, the only thing I need to isolate is a NAS with Jellyfin and Komga, but I plan to add more services via a mini PC later. The thing that has made this whole journey frustrating is that every time I try to learn something, even laser targeted, I don’t get the full answer from the first thing I find, and the next answer I find introduces more complexity. I think what I need is a managed switch from my local Micro Center like a Netgear GS108Tv3, to replace the switch currently in my office. Then, if I understand correctly, I think I need to put the NAS (and eventually mini PC) on their own subnet and use VLAN rules to allow traffic to that subnet but not from that subnet to the rest of my LAN. But it’s hard to determine if I’ve even got that right.

    • irmadlad@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      10 hours ago

      every time I try to learn something, even laser targeted, I don’t get the full answer from the first thing I find, and the next answer I find introduces more complexity

      Can empathize. Read a tutorial and think ‘Well, that seems pretty straight forward’. Read another tutorial about the same topic…‘Jebus that does not seem straight forward.’

      • ampersandrew@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        10 hours ago

        It took me about 50 YouTube videos to get me to a point where I believe I now understand what a reverse proxy is and how I should use one.

  • SpikesOtherDog@ani.social
    link
    fedilink
    English
    arrow-up
    1
    ·
    7 hours ago

    I revived my second WAP today by soldering on a serial header and reloading the firmware. Sounds badass, but I broke it myself and then did a crap job trying to improvise and revive it the first time. I had to buy the correct tools before I could try again.

    On the other hand, my SMB shares were mysteriously down this morning. Easy fix, but weird.

  • rtxn@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    10 hours ago

    Right at this moment, I’m rebuilding my homelab after a double HDD failure earlier this year.

    The previous build had a RAID 5 array of three 1TB Seagate Barracudas that I picked out of the scrap pile at work. I knew what I was getting into and only kept replaceable files on it. When one of the drives started doing the death rattle, I decided to yank some harder-to-acquire files to my 3TB desktop HDD before trying to resilver the entire array. Guess which device was the next to fail. I could mount and read it, but every operation took 2-5 minutes. SMART showed a reallocation count in the thousands. That drive contained some important files that I couldn’t replace, which were backed up to the (now dead) server. Fortunately ddrescue managed to recover damn near everything and I only lost 80 kilobytes out of the entire disk. That was a very expensive lesson that I’ve learned very cheaply.

    The new setup has a RAIDz1 pool of 3x 4TB Ironwolf disks (constrained by the available SATA sockets on the motherboard), plus a new SSD for the OS and 16GB RAM (upgraded from literally the first SSD I ever bought and 10GB mis-matched DDR3).

    Mounting it was a bit of a dilemma. The previous array was simply mounted to the filesystem from fstab and bind-mounted to the containers. I definitely wanted the storage to be managed from Proxmox’s web UI and to be able to create VDs and LXC volumes on it. Some community members helped me choose ZFS over LVM-on-RAID5. Setting up the correct permissions wasn’t as much of a headache as last time. I’ve just managed to get a Samba+NFS+HTTP file server and Jellyfin running and talking to each other. Forgejo and Nextcloud will be next.

  • Boomer Humor Doomergod@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    ·
    13 hours ago

    I’m in the unenviable position of basically doing self hosting for work and all I want to do on the weekend is get greasy under my car.

    Gonna change the oil later and replace a MAF

  • black_flag@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    6
    ·
    11 hours ago

    Everything’s fucking terrible infested with AI slop I don’t know what the fuck software to run everything is bad

    • Ooops@feddit.org
      link
      fedilink
      English
      arrow-up
      1
      ·
      9 hours ago

      If it wast just AI, but the idiotic crawlers everywhere are getting worse by the day it feels.

      I still have some ancient RPi running a basic homepage with some reverse proxies. A few weeks ago and after stopping to care about that thing years ago I realized that the access log that was just happily sitting there for years without getting to relevant sizes has suddenly grown by nearly 1GB, most of it in the last 6-8 months because I never bothered to set up logrotate.

      But hey… I wanted to test setting up Anubis for quite some time. So now I can watch them run circles in the (still experimental) honeypot feature reading pages and pages of non-sensical babbling 😂

  • IsoKiero@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    4
    ·
    11 hours ago

    Fixed headaches with my proxmox backup server. It has a SAS-controller and 4 spinning drives running backups at detached garage and the old fujitsu desktop I dug out of office dumpster pile just kept crashing. Flashed controller to IT-firmware, updated bios on motherboard and did everything else I could figure out but the system just lost the drives pretty much daily and required a hard reset. Turns out, or at least that’s my conclusion, that the PSU on the machine just didn’t have enough juice for the whole setup and that caused instability. I dug out old (2010 or so) desktop from my own pile and threw 600W PSU on the box, it’s now been stable for at least a week.

    I would’ve liked to keep the fujitsu-machine as it’s in a more compact case and couple of generations newer CPU, but that thing has propietary power supply so it was easier to swap out the whole system and just move drives from one to another. So, the current setup consumes maybe a bit more electricity, but at least it’s doing what it is supposed to.

  • TacoEvent@lemmy.zip
    link
    fedilink
    English
    arrow-up
    2
    ·
    10 hours ago

    Dug up an old RPI 3 for the sole purpose of running a bulletproof sandbox for Claude Code. At 1 gb of RAM it’s a little slow but quite tolerable for running —dangerously-skip-permissions and one shot prompts.

    Network isolated, Tailscale ACLs in, sudoless user created, and service account tokens setup with minimal permissions to GitHub. It should function almost exactly like Claude code remote. Excited to start testing it.

  • bluGill@fedia.io
    link
    fedilink
    arrow-up
    3
    ·
    12 hours ago

    I did the monthly arch upgrade and had to reboot to get some service to come up. Took an extra 5 minutes this month.

  • confusedpuppy@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    2
    ·
    12 hours ago

    Since last time, I’ve finally learned how to make rootless podman work on Alpine Linux and it’s been pretty smooth so far.

    My Pi4 is quietly running HomeAssistant and I like to leave that untouched so I don’t have to worry about pooping in the dark. I learned that the container requires root in order to access the Zigbee USB dongle through dbus so I can’t really run it as a rootless container. It’s not web facing so it’s locked down to my local network which is good enough for me.

    My Pi5 is finally up and running again. Got a new, shorter domain name, managed to get the TLS set up in one go with Caddy which was nice. Right now I just have a bunch of wiki’s hosted with Kiwix and a file server using Caddy.

    I’m putting the final touches on my series of scripts I wrote meant for automating backups. rTransfer for the actual backup, remoff for rotating backups (I plan to keep 1 backup a week, over a months time), and containers-util(work in progress) to automatically start and stop containers in preparation for a backup.

    A bit crazy but I’ve been working on this whole backup process on and off for about a year now. It’s all POSIX portable except for a few commands like rsync and podman. Once I finish the last script, I’ll set up a blog and then my server will be secure to my liking and very low maintenance (my keep-alive script I also wrote has been working better than expected).

    I also wrote a Dynamic MOTD script which updates /etc/motd with some basic information about the machine so I can get a quick look at the machine I’m ssh’ing into. I’m quite happy with how it turned out too.

    I’ve been trying to use as few programs as possible and building my own when I can. It’s been quite the adventure this past year and a bit.

  • non_burglar@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    12 hours ago

    I made the right decision to get two HP elitedesk g4 minis instead of one, and to give them both plenty of ram (before the shortage).

    I temporarily “lost” one of them after a power outage revealed that the CMOS battery was dead in one. It’s back up and running, but it was nice to be able to simply import the containers on the remaining good unit while I fixed it up.

    Test your backups! It’ll save you time and headache, plus you can feel superior to others.