Does the Python specialization in Code Llama lead to improved instruction-following performance in non-Python
Description
We release Code Llama, a family of large language models for code based on Llama 2 providing state-of-the-art performance among open models, infilling capabilities, support for large input contexts, and zero-shot instruction following ability for programming tasks. We provide multiple flavors to cover a wide range of applications: foundation models (Code Llama), Python specializations (Code Llama - Python), and instruction-following models (Code Llama - Instruct) with 7B, 13B, 34B and 70B parameters each. All models are trained on sequences of 16k tokens and show improvements on inputs with up
Research goal: Does the Python specialization in Code Llama lead to improved instruction-following performance in non-Python languages when evaluated on the CodeGen-Coder benchmark, compared to the base Code Llama model?
Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.5/10.
Notes
Files
paper.pdf
Files
(86.3 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:a2543ed88719a53a0247624942ba9d68
|
86.3 kB | Preview Download |