Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

@Blaed@lemmy.world · 2 years ago

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

@drspod@lemmy.ml · 2 years ago

Is this model trained specifically for problem solving, or does it also perform as well as ChatGPT on conversational and generic text-generation tasks?

@L_Acacia@lemmy.one · 2 years ago

Specifically probleme sovling, chatgpt has multiple model too it is just hidden to the user

0421008445828ceb46f496700a5fa6 · 2 years ago

We fined-tuned on a proprietary dataset of ~80k high quality programming problems and solutions.

@ChrisLicht@lemm.ee · 2 years ago

Dumb question: Does one install the Python model, or access online?

@L_Acacia@lemmy.one · edit-2 2 years ago

The best way to run a Llama model locally is using Text generation web UI, the model will most likely be quantized to 4/5bit GGML / GPTQ today, which will make it possible to run on a “normal” computer.

Phind might make it accessible on their website soon, but it doesn’t seem to be the case yet.

EDIT : Quantized version are available thanks to TheBloke

@ChrisLicht@lemm.ee · 2 years ago

You are awesome; thanks for the clue-in!

@babysharknanana@lemmy.world · 2 years ago

Exciting, but as far as I know, we can’t use LLaMA commercially. So I ask myself how to use it in a non-commercial context? Isn’t it expensive to embedd such a model in free/open-source software?

@L_Acacia@lemmy.one · 2 years ago

Llama 2 now uses a license that allows for commercial use.

@babysharknanana@lemmy.world · 2 years ago

I know, but the text is only talking of Llama. So this is using Llama 2?

@abhibeckert@lemmy.world · edit-2 2 years ago

LLama2 and Llama are basically excatly the same model, except the “2” version has a more permissive license and was trained with a larger source data set. Nobody should use the old one ever, and I expect the noncommercial license is part of a contract Meta signed with someone who provided source material.

This is “CodeLlama” which was built on Llama2 and allows commercial use.

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

Blog Post

Download