What is the correlation between varying hot neuron selection thresholds in PowerInfer and token generation thr
Description
Small language models (SLMs), despite their widespread adoption in modern smart devices, have received significantly less academic attention compared to their large language model (LLM) counterparts, which are predominantly deployed in data centers and cloud environments. While researchers continue to improve the capabilities of LLMs in the pursuit of artificial general intelligence, SLM research aims to make machine intelligence more accessible, affordable, and efficient for everyday tasks. Focusing on transformer-based, decoder-only language models with 100M-5B parameters, we survey 70 state
Research goal: What is the correlation between varying hot neuron selection thresholds in PowerInfer and token generation throughput when running LLaMA-70B on the MBPP code generation dataset?
Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 9.2/10.
Notes
Files
paper.pdf
Files
(74.8 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:1e5863201dca8151d2079f494d43f5ee
|
74.8 kB | Preview Download |