Terje
|
21e0cc31c4
|
MTP development: Architecture analysis and feasibility study
- mtp-development.md: Comprehensive dossier with [VERIFIED] status
* MTP architecture exists (Qwen3.5-27B layer 64)
* Performance: 0.70× baseline single-head, 0.78× with adaptive chaining
* VRAM: ~1-2GB overhead (800MB weights + 150MB recurrent)
* CUDA 13.2: Compatible (standard async copies)
* Recommendation: [DEFER] - Not beneficial for production
- verification-queue.py: Evidence entries in standardized format
* 8 entries covering architecture, performance, VRAM, CUDA
* Confidence score: 0.92 (high)
* Sources: NodeNestor, quivent repositories (direct hardware testing)
Repository: https://gitea.sverd.eu/terjejsd/hermes-profiles
|
2026-05-05 10:13:08 +00:00 |
|