restingboredface@sh.itjust.works to Privacy@lemmy.mlEnglish · 1 年前DeepSeek collects keystroke data and more, storing it in Chinese serversmashable.comexternal-linkmessage-square295linkfedilinkarrow-up1585arrow-down10file-text
arrow-up1585arrow-down1external-linkDeepSeek collects keystroke data and more, storing it in Chinese serversmashable.comrestingboredface@sh.itjust.works to Privacy@lemmy.mlEnglish · 1 年前message-square295linkfedilinkfile-text
minus-squareayaya@lemdro.idlinkfedilinkEnglisharrow-up14·1 年前This is mildly pedantic but you’re not actually running Deepseek R1, you’re running a 7B version of Qwen that’s been fine-tuned on Deepseek R1 outputs. All of the “distilled” models are existing models trained on R1.
minus-squareZeDoTelhado@lemmy.worldlinkfedilinkarrow-up6·1 年前Nice catch. I’ll be sure after do run the real thing
This is mildly pedantic but you’re not actually running Deepseek R1, you’re running a 7B version of Qwen that’s been fine-tuned on Deepseek R1 outputs. All of the “distilled” models are existing models trained on R1.
Nice catch. I’ll be sure after do run the real thing