Run local inference without cloud costs

local-llm-bridgeskillsetup L3★3

What it does

Route bounded tasks to local Gemma LLM

Best for

Sub-second bounded tasks using local Gemma without cloud latency

Inputs

Outputs

Requires

Preconditions

llama-server running on localhost:8089

Failure modes

llama-server not running; command fails

Trust signals