where is the source code for this Model ? - what does they prodoudly say by open-source models?
#83
by
tstarksys
- opened
Is it right to say open-model(extensible model) or open-source-model ?
as per technical report(https://arxiv.org/html/2412.19437v1) says.... also making significant strides, endeavoring to close the gap with their closed-source counterparts. To further push the boundaries of open-source model capabilities,....I just want to figure out where is source code for the model is where they have implemented Multi-head Latent Attention
I am not an expert in this area, but https://huggingface.co/deepseek-ai/DeepSeek-R1/blob/main/modeling_deepseek.py