AI tutorial - IT technology blog

How to Convert LLM Models to GGUF Format with llama.cpp Quantization

Running large language models locally requires shrinking their file size without destroying quality. This guide walks through the full llama.cpp pipeline: downloading a Hugging Face model, converting it to GGUF format, and quantizing it to Q4_K_M or other levels to fit consumer hardware.
Linux tutorial - IT technology blog

How I Cut My Linux Server Boot Time in Half Using systemd-analyze

After six months of running a slow-booting Ubuntu 22.04 production server, I used systemd-analyze to trace a 40-second boot time down to three root causes and fixed them. Here is the full process, from reading the blame output to choosing between disabling, masking, or delaying services.