Evidence¶
“설계” → “검증된 시스템”
This page answers the single question a skeptical reviewer asks:
“does this actually run?” Each row links to a reproducible
artefact (captured .pccx trace, Vivado utilisation report, or
board-log excerpt) so the numbers can be independently verified.
When a measurement is not yet in hand the row is pending with the gating task explicitly called out — never a speculative figure.
Measured (reproducible)¶
Metric |
Value |
Source |
Reproducer |
|---|---|---|---|
Sail model type-check |
clean |
|
|
pccx-core test suite |
7/7 ISA + 16 analyzer tests |
|
|
|
bit-exact |
|
|
Sphinx zero-warning build |
EN + KO |
|
|
Golden-diff regression gate (self-calibrated) |
8 / 8 steps + 128 / 128 steps within ±15 % |
|
|
Pending (board / synth)¶
Metric |
Status |
Gate |
|---|---|---|
End-to-end Gemma-3N E4B decode tok/s |
pending board run |
§4.1 RTL dispatcher + Global_Scheduler wiring (Hardware Architecture) |
KV260 resource usage (LUT / DSP / URAM / BRAM) |
pending Vivado impl |
|
Timing closure @ 400 MHz core / 250 MHz AXI |
pending Vivado impl |
Gate as above |
Layer-by-layer golden-model diff (vs PyTorch reference) |
pending |
Scaffold ( |
P99 decode latency under sustained load |
pending board capture |
Requires 512-token run on real DDR traffic. |
7 W TDP headroom under W4A8KV4 decode |
pending Vivado impl + board pmbus |
Gates same as resource usage. |
Baselines (for future comparison)¶
Baseline |
Target |
Method |
|---|---|---|
CPU (Ryzen 4500U, llama.cpp Q4_K_M) |
tok/s on Gemma-3N E4B |
|
GPU (RTX 4060, HF Transformers bf16) |
tok/s on Gemma-3N E4B |
PyTorch 2.4, generate() with KV cache on, batch = 1. |
On-device (pccx v002 @ KV260) |
tok/s on Gemma-3N E4B |
|
How this page gets updated¶
pccx-FPGA-NPU-LLM-kv260captures a new.pccxor Vivado report.pccx-labexports the relevant fields viapccx_analyze --json.A commit to this repo lands the numbers in the tables above, with the source link and a permanent
samples/artefact.make strictpasses, CI re-deploys the page.
No speculative numbers. Every row either links to a reproducible artefact or is marked pending with a named gate.
Cite this page¶
@misc{pccx_evidence_2026,
title = {pccx Evidence: reproducible measurement log for an open W4A8 NPU},
author = {Kim, Hyunwoo},
year = {2026},
howpublished = {\url{https://pccxai.github.io/pccx/en/docs/Evidence/index.html}},
note = {Tracks the "설계 → 검증된 시스템" closure plan. Part of pccx: \url{https://pccxai.github.io/pccx/}}
}