llama.cpp : Installation

Reply to llama.cpp : Installation on Fri, 19 Jun 2026 10:53:53 GMT

farias — Fri, 19 Jun 2026 10:53:53 GMT

Mise à jours de Ollama :

# curl -fsSL https://ollama.com/install.sh | sh
>>> Cleaning up old version at /usr/local/lib/ollama
>>> Installing ollama to /usr/local
>>> Downloading ollama-linux-amd64.tar.zst
######################################################################## 100.0%
>>> Adding ollama user to render group...
>>> Adding ollama user to video group...
>>> Adding current user to ollama group...
>>> Creating ollama systemd service...
>>> Enabling and starting ollama service...
>>> NVIDIA GPU installed.

Visiblement même problème :

...
juin 19 10:51:37  ollama[2964]: time=2026-06-19T10:51:37.153Z level=INFO source=model_list_cache.go:111 msg="model list cache hydration complete" models=16 failures=0 elapsed=654.370427ms
juin 19 10:51:42  ollama[2964]: time=2026-06-19T10:51:42.591Z level=WARN source=cuda_compat.go:38 msg="NVIDIA driver too old" device="Quadro M5000" compute=5.2 driver=535 required_driver="570 or newer"
juin 19 10:51:42  ollama[2964]: time=2026-06-19T10:51:42.591Z level=WARN source=cuda_compat.go:38 msg="NVIDIA driver too old" device="Quadro M4000" compute=5.2 driver=535 required_driver="570 or newer"
juin 19 10:51:43  ollama[2964]: time=2026-06-19T10:51:43.181Z level=INFO source=types.go:32 msg="inference compute" id=1 filter_id=1 library=Vulkan compute=0.0 name=Vulkan1 description="Quadro M4000" libd>
juin 19 10:51:43  ollama[2964]: time=2026-06-19T10:51:43.181Z level=INFO source=types.go:32 msg="inference compute" id=0 filter_id=0 library=Vulkan compute=0.0 name=Vulkan0 description="Quadro M5000" libd>
...

Reply to llama.cpp : Installation on Fri, 19 Jun 2026 10:16:21 GMT

farias — Fri, 19 Jun 2026 10:16:21 GMT

Le GPU est trop ancien : https://en.wikipedia.org/wiki/CUDA#Supported_GPUs

Reply to llama.cpp : Installation on Fri, 19 Jun 2026 10:16:07 GMT

farias — Fri, 19 Jun 2026 10:16:07 GMT

Je teste la compilation CUDA :

# cd llama.cpp
# export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64
# export PATH=$PATH:$CUDA_HOME/bin
# cmake -B build -DGGML_CUDA=ON  -DCMAKE_CUDA_COMPILER=`which nvcc`
...
# cmake --build build --config Release -j 20
[  8%] Building CUDA object ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/cross-entropy-loss.cu.o
[ 10%] Built target ggml-cpu
nvcc fatal   : Unsupported gpu architecture 'compute_52'

Reply to llama.cpp : Installation on Fri, 19 Jun 2026 10:11:52 GMT

farias — Fri, 19 Jun 2026 10:11:52 GMT

Visiblement pas possible de mettre une version à jours :

root@jellyfin:/home/arias# nvidia-smi
Fri Jun 19 10:10:52 2026       
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.309.01             Driver Version: 535.309.01   CUDA Version: 12.2     |
|-----------------------------------------+----------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
|                                         |                      |               MIG M. |
|=========================================+======================+======================|
|   0  Quadro M5000                   Off | 00000000:00:10.0 Off |                  Off |
| 40%   48C    P8              14W / 150W |      4MiB /  8192MiB |      0%      Default |
|                                         |                      |                  N/A |
+-----------------------------------------+----------------------+----------------------+
|   1  Quadro M4000                   Off | 00000000:00:1B.0 Off |                  N/A |
| 50%   52C    P8              14W / 120W |      4MiB /  8192MiB |      0%      Default |
|                                         |                      |                  N/A |
+-----------------------------------------+----------------------+----------------------+
                                                                                         
+---------------------------------------------------------------------------------------+
| Processes:                                                                            |
|  GPU   GI   CI        PID   Type   Process name                            GPU Memory |
|        ID   ID                                                             Usage      |
|=======================================================================================|
|  No running processes found                                                           |
+---------------------------------------------------------------------------------------+
WARNING: infoROM is corrupted at gpu 0000:00:10.0

Reply to llama.cpp : Installation on Fri, 19 Jun 2026 10:09:36 GMT

farias — Fri, 19 Jun 2026 10:09:36 GMT

Et comme toujours perte des drivers pour NVIDIA :

# cd
root@jellyfin:~# nvidia-smi
Failed to initialize NVML: Driver/library version mismatch
NVML library version: 535.309
# apt-get purge nvidia-*
# ubuntu-drivers install
# nvidia-smi
Failed to initialize NVML: Driver/library version mismatch
NVML library version: 535.309
# reboot