<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[llama.cpp avec Vulkan]]></title><description><![CDATA[<p dir="auto">Test de la configuration :</p>
<pre><code class="language-bash"># npx --no node-llama-cpp inspect gpu
OS: Ubuntu 24.04.4 LTS (x64)
Node: 26.3.1 (x64)

node-llama-cpp: 3.18.1
Prebuilt binaries: b8390

ggml_cuda_init: failed to initialize CUDA: CUDA driver version is insufficient for CUDA runtime version
CUDA: available
ggml_cuda_init: failed to initialize CUDA: CUDA driver version is insufficient for CUDA runtime version
Vulkan: available

CUDA devices: Quadro M5000, Quadro M4000
CUDA used VRAM: 0.85% (138.88MB/15.86GB)
CUDA free VRAM: 99.14% (15.73GB/15.86GB)

Vulkan devices: Quadro M5000, Quadro M4000
Vulkan used VRAM: 1.76% (298.13MB/16.48GB)
Vulkan free VRAM: 98.23% (16.19GB/16.48GB)

CPU model: Intel(R) Xeon(R) CPU E5-2450 v2 @ 2.50GHz
Math cores: 32
Used RAM: 16.49% (2.99GB/18.14GB)
Free RAM: 83.5% (15.15GB/18.14GB)
Used swap: 0% (0B/4GB)
Max swap size: 4GB
mmap: supported

</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/topic/369/llama.cpp-avec-vulkan</link><generator>RSS for Node</generator><lastBuildDate>Mon, 22 Jun 2026 16:12:37 GMT</lastBuildDate><atom:link href="https://lemmy.cyber-neurones.org/topic/369.rss" rel="self" type="application/rss+xml"/><pubDate>Fri, 19 Jun 2026 14:16:21 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 17:49:39 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://huggingface.co/unsloth/Qwen3.5-2B-GGUF/resolve/main/Qwen3.5-2B-Q4_0.gguf?download=true" rel="nofollow ugc">https://huggingface.co/unsloth/Qwen3.5-2B-GGUF/resolve/main/Qwen3.5-2B-Q4_0.gguf?download=true</a></p>
]]></description><link>https://lemmy.cyber-neurones.org/post/933</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/933</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 17:49:39 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 17:46:13 GMT]]></title><description><![CDATA[<p dir="auto">Le meilleur modèle semble être <a href="https://huggingface.co/Qwen/Qwen3.5-2B" rel="nofollow ugc">https://huggingface.co/Qwen/Qwen3.5-2B</a> pour mes cartes.</p>
]]></description><link>https://lemmy.cyber-neurones.org/post/932</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/932</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 17:46:13 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 17:29:07 GMT]]></title><description><![CDATA[<p dir="auto">Mon fichier service :</p>
<pre><code class="language-bash"># systemctl status llama-server
● llama-server.service - Llama Server
     Loaded: loaded (/etc/systemd/system/llama-server.service; disabled; preset: enabled)
     Active: active (running) since Fri 2026-06-19 17:27:42 UTC; 29s ago
   Main PID: 37413 (llama-server)
      Tasks: 41 (limit: 94224)
     Memory: 91.7M (peak: 91.7M)
        CPU: 3.103s
     CGroup: /system.slice/llama-server.service
             └─37413 /usr/local/bin/llama-server --model /models/qwen2.5-1.5b-instruct-q4_k_m.gguf --host 0.0.0.0 --port 8080

juin 19 17:27:42 jellyfin systemd[1]: Started llama-server.service - Llama Server.
root@jellyfin:/home/arias/llama.cpp/build# cat /etc/systemd/system/llama-server.service
[Unit]
Description=Llama Server
After=network.target

[Service]
Type=simple
User=root
WorkingDirectory=/home/XXXX/llama.cpp
Environment="NVM_BIN=/root/.nvm/versions/node/v26.3.1/bin"
Environment="LD_LIBRARY_PATH=:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64"
Environment="VULKAN_VERSION=1.4.350.1"
ExecStart=/usr/local/bin/llama-server \
  --model /models/qwen2.5-1.5b-instruct-q4_k_m.gguf \
  --host 0.0.0.0 --port 8080
Restart=on-failure
RestartSec=5s
StandardOutput=file:/tmp/llama-server.stdout.log
StandardError=file:/tmp/llama-server.stderr.log

[Install]
WantedBy=multi-user.target

</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/931</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/931</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 17:29:07 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 17:15:16 GMT]]></title><description><![CDATA[<p dir="auto">Test en ligne de commande :</p>
<pre><code class="language-bash"># llama-server -m /models/qwen2.5-1.5b-instruct-q4_k_m.gguf --host 0.0.0.0
</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/930</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/930</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 17:15:16 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 17:08:36 GMT]]></title><description><![CDATA[<p dir="auto">Arret de openwebui :</p>
<pre><code class="language-bash"># systemctl stop openwebui
# systemctl disable openwebui
Removed "/etc/systemd/system/multi-user.target.wants/openwebui.service".

</code></pre>
<p dir="auto">Arret de ollama :</p>
<pre><code class="language-bash"># systemctl stop ollama
# systemctl disable ollama
Removed "/etc/systemd/system/default.target.wants/ollama.service".

</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/929</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/929</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 17:08:36 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 14:41:33 GMT]]></title><description><![CDATA[<p dir="auto">Petit test :</p>
<pre><code class="language-bash"># make install
# ldconfig -v
#  llama-bench -m  /models/qwen2.5-1.5b-instruct-q4_k_m.gguf
ggml_vulkan: Found 2 Vulkan devices:
ggml_vulkan: 0 = Quadro M5000 (NVIDIA) | uma: 0 | fp16: 0 | bf16: 0 | warp size: 32 | shared memory: 49152 | int dot: 1 | matrix cores: none
ggml_vulkan: 1 = Quadro M4000 (NVIDIA) | uma: 0 | fp16: 0 | bf16: 0 | warp size: 32 | shared memory: 49152 | int dot: 1 | matrix cores: none
| model                          |       size |     params | backend    | ngl |            test |                  t/s |
| ------------------------------ | ---------: | ---------: | ---------- | --: | --------------: | -------------------: |
| qwen2 1.5B Q4_K - Medium       | 934.69 MiB |     1.54 B | Vulkan     |  -1 |           pp512 |         53.48 ± 0.42 |
| qwen2 1.5B Q4_K - Medium       | 934.69 MiB |     1.54 B | Vulkan     |  -1 |           tg128 |         63.55 ± 0.73 |

build: 5fd2dc2c4 (9721)</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/928</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/928</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 14:41:33 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 14:31:41 GMT]]></title><description><![CDATA[<p dir="auto">La commande pour le build :</p>
<pre><code class="language-bash"># cmake --build build --config Release -j
</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/927</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/927</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 14:31:41 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 14:30:48 GMT]]></title><description><![CDATA[<p dir="auto">Nouveau build :</p>
<pre><code class="language-bash">#  cmake -B build -DGGML_VULKAN=ON -DCMAKE_BUILD_TYPE=Release
CMAKE_BUILD_TYPE=Release
-- Warning: ccache not found - consider installing it for faster compilation or disable this warning with GGML_CCACHE=OFF
-- CMAKE_SYSTEM_PROCESSOR: x86_64
-- GGML_SYSTEM_ARCH: x86
-- Including CPU backend
-- x86 detected
-- Adding CPU backend variant ggml-cpu: -march=native 
-- Found Vulkan: /usr/lib/x86_64-linux-gnu/libvulkan.so (found version "1.4.313") found components: glslc glslangValidator 
-- Vulkan found
-- GL_KHR_cooperative_matrix supported by glslc
-- GL_NV_cooperative_matrix2 supported by glslc
-- GL_NV_cooperative_matrix_decode_vector not supported by glslc
-- GL_EXT_integer_dot_product supported by glslc
-- GL_EXT_bfloat16 supported by glslc
-- Including Vulkan backend
-- ggml version: 0.15.2
-- ggml commit:  5fd2dc2c4
-- Found OpenSSL: /usr/lib/x86_64-linux-gnu/libcrypto.so (found version "3.0.13")  
-- Performing Test OPENSSL_VERSION_SUPPORTED
-- Performing Test OPENSSL_VERSION_SUPPORTED - Success
-- OpenSSL found: 3.0.13
-- Generating embedded license file for target: llama-app
-- Configuring done (5.0s)
-- Generating done (0.6s)

</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/926</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/926</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 14:30:48 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 14:29:45 GMT]]></title><description><![CDATA[<p dir="auto">La boulette j’ai pas pris la bonne version… on recommance :</p>
<pre><code class="language-bash">rm  /etc/apt/sources.list.d/lunarg-vulkan-jammy.list 
wget -qO- https://packages.lunarg.com/lunarg-signing-key-pub.asc | sudo tee /etc/apt/trusted.gpg.d/lunarg.asc
sudo wget -qO /etc/apt/sources.list.d/lunarg-vulkan-noble.list http://packages.lunarg.com/vulkan/lunarg-vulkan-noble.list
sudo apt update
sudo apt install vulkan-sdk
</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/925</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/925</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 14:29:45 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 14:27:21 GMT]]></title><description><![CDATA[<p dir="auto">Installation de SDK Vulkan :</p>
<pre><code class="language-bash"># wget -qO- https://packages.lunarg.com/lunarg-signing-key-pub.asc | sudo tee /etc/apt/trusted.gpg.d/lunarg.asc
# sudo wget -qO /etc/apt/sources.list.d/lunarg-vulkan-jammy.list http://packages.lunarg.com/vulkan/lunarg-vulkan-jammy.list
# sudo apt update
# sudo apt install vulkan-sdk
</code></pre>
<p dir="auto">Mais erreur :</p>
<pre><code class="language-bash"># sudo apt install vulkan-sdk
Lecture des listes de paquets... Fait
Construction de l'arbre des dépendances... Fait
Lecture des informations d'état... Fait      
Certains paquets ne peuvent être installés. Ceci peut signifier
que vous avez demandé l'impossible, ou bien, si vous utilisez
la distribution unstable, que certains paquets n'ont pas encore
été créés ou ne sont pas sortis d'Incoming.
L'information suivante devrait vous aider à résoudre la situation : 

Les paquets suivants contiennent des dépendances non satisfaites :
 crashdiagnosticlayer : Dépend: libyaml-cpp0.7 (&gt;= 0.7.0) mais il n'est pas installable
E: Impossible de corriger les problèmes, des paquets défectueux sont en mode « garder en l'état ».

</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/924</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/924</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 14:27:21 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 14:22:26 GMT]]></title><description><![CDATA[<p dir="auto">Tentative de build :</p>
<pre><code class="language-bash"># cmake -B build -DGGML_VULKAN=ON -DCMAKE_BUILD_TYPE=Release
-- The C compiler identification is GNU 13.3.0
-- The CXX compiler identification is GNU 13.3.0
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working C compiler: /usr/bin/cc - skipped
-- Detecting C compile features
-- Detecting C compile features - done
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++ - skipped
-- Detecting CXX compile features
-- Detecting CXX compile features - done
CMAKE_BUILD_TYPE=Release
-- Found Git: /usr/bin/git (found version "2.43.0") 
-- The ASM compiler identification is GNU
-- Found assembler: /usr/bin/cc
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success
-- Found Threads: TRUE  
-- Warning: ccache not found - consider installing it for faster compilation or disable this warning with GGML_CCACHE=OFF
-- CMAKE_SYSTEM_PROCESSOR: x86_64
-- GGML_SYSTEM_ARCH: x86
-- Found OpenMP_C: -fopenmp (found version "4.5") 
-- Found OpenMP_CXX: -fopenmp (found version "4.5") 
-- Found OpenMP: TRUE (found version "4.5")  
-- Including CPU backend
-- x86 detected
-- Adding CPU backend variant ggml-cpu: -march=native 
CMake Error at /usr/share/cmake-3.28/Modules/FindPackageHandleStandardArgs.cmake:230 (message):
  Could NOT find Vulkan (missing: glslc) (found version "1.3.275")
Call Stack (most recent call first):
  /usr/share/cmake-3.28/Modules/FindPackageHandleStandardArgs.cmake:600 (_FPHSA_FAILURE_MESSAGE)
  /usr/share/cmake-3.28/Modules/FindVulkan.cmake:600 (find_package_handle_standard_args)
  ggml/src/ggml-vulkan/CMakeLists.txt:9 (find_package)


-- Configuring incomplete, errors occurred!


</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/923</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/923</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 14:22:26 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 14:21:59 GMT]]></title><description><![CDATA[<p dir="auto">Test :</p>
<pre><code class="language-bash"># vulkaninfo | grep -i "deviceName"
'DISPLAY' environment variable not set... skipping surface info
error: XDG_RUNTIME_DIR is invalid or not set in the environment.
	deviceName        = Quadro M5000
	deviceName        = Quadro M4000
	deviceName        = llvmpipe (LLVM 20.1.2, 256 bits)
</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/922</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/922</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 14:21:59 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 14:20:10 GMT]]></title><description><![CDATA[<p dir="auto">Ajout pour la compilation :</p>
<pre><code class="language-bash"># sudo apt install libvulkan-dev vulkan-tools glslang-tools  cmake build-essential git
</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/921</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/921</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 14:20:10 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 14:19:43 GMT]]></title><description><![CDATA[<p dir="auto">Actuellement on est :</p>
<pre><code class="language-bash">Downloading Vulkan SDK version 1.4.350.1
</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/920</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/920</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 14:19:43 GMT</pubDate></item><item><title><![CDATA[Reply to llama.cpp avec Vulkan on Fri, 19 Jun 2026 14:17:46 GMT]]></title><description><![CDATA[<p dir="auto">Lancement de l’installation :</p>
<pre><code class="language-bash">sudo apt update
sudo apt install curl wget xz-utils

export VULKAN_VERSION="$(curl -fsSL https://vulkan.lunarg.com/sdk/latest/linux.txt)"                                                                                      

echo "Downloading Vulkan SDK version ${VULKAN_VERSION}"
curl --progress-bar "https://sdk.lunarg.com/sdk/download/${VULKAN_VERSION}/linux/vulkan_sdk.tar.xz" -o "/opt/vulkan-sdk.tar.xz"

echo "Installing Vulkan SDK to /opt/vulkan-sdk"
rm -rf "/opt/vulkan-sdk" &amp;&amp; mkdir -p "/opt/vulkan-sdk"
tar -Jxf "/opt/vulkan-sdk.tar.xz" --strip-components=1 -C "/opt/vulkan-sdk"
rm -f "/opt/vulkan-sdk.tar.xz"

echo "Adding Vulkan SDK environment variables to shell profiles"
([ ! -f "$HOME/.bashrc" ] || grep -qxF "source /opt/vulkan-sdk/setup-env.sh" "$HOME/.bashrc") || (echo "source /opt/vulkan-sdk/setup-env.sh" &gt;&gt; "$HOME/.bashrc")
([ ! -f "$HOME/.zshrc" ] || grep -qxF "source /opt/vulkan-sdk/setup-env.sh" "$HOME/.zshrc") || (echo "source /opt/vulkan-sdk/setup-env.sh" &gt;&gt; "$HOME/.zshrc")
source /opt/vulkan-sdk/setup-env.sh

</code></pre>
]]></description><link>https://lemmy.cyber-neurones.org/post/919</link><guid isPermaLink="true">https://lemmy.cyber-neurones.org/post/919</guid><dc:creator><![CDATA[farias]]></dc:creator><pubDate>Fri, 19 Jun 2026 14:17:46 GMT</pubDate></item></channel></rss>