When will this be available in Transformers library?
#87 opened about 3 hours ago
by
solwol
cannot regenerate (blank respone)
#86 opened about 4 hours ago
by
pluhong
A Bug using hugging face API
#85 opened about 11 hours ago
by
Kevin355
Do we need an authorization access to use this ?
#84 opened about 12 hours ago
by
Natwar
where is the source code for this Model ? - what does they prodoudly say by open-source models?
1
#83 opened about 16 hours ago
by
tstarksys
智王发布deepseek-r1懒人包,解压即用Deepseek-r1 Lazy Package, easy to decompress and use
1
#81 opened about 18 hours ago
by
zwpython
model-00078-of-000163.safetensors not marked safe?
2
#80 opened about 19 hours ago
by
aborst
Create Dare
#79 opened about 21 hours ago
by
Dara996
problem with using serverless inference
1
#78 opened about 22 hours ago
by
manju2345
Some weird sensorship on unsensitive topic. 对非敏感话题的奇怪审查。
5
#77 opened 1 day ago
by
junnanwu
Upload dkfoEtm3H4bMcaI0KEJbq.1023.jpeg
#76 opened 1 day ago
by
luckysalami089
Update README.md
#75 opened 1 day ago
by
NuoNb
🚩 Report: Ethical issue(s)
#74 opened 1 day ago
by
Typeofprototype
Deepseek-R1 falls: ZW demon redesigns' Nine Birds' Deepseek-R1沦陷:zw魔改版“九只鸟”
#73 opened 1 day ago
by
zwpython
Consistency, can Deepseek pass?一致性,deepseek能及格吗?
#71 opened 1 day ago
by
zwpython
Does this model support text insertion (fill in middle)?
2
#70 opened 2 days ago
by
AayushShah
Thoughts on deepseek-r1. Correct me if I'm wrong
1
#69 opened 2 days ago
by
pkms
Remove unavailable import
#68 opened 2 days ago
by
Rocketknight1
ImportError: cannot import name 'is_torch_greater_or_equal_than_1_13' from 'transformers.pytorch_utils'
8
#67 opened 2 days ago
by
bashir-abubakar
e-currency
3
#63 opened 3 days ago
by
Zhendaxie
Meet PEEPSEEK, the first meme made by DeepSeek r1
1
#61 opened 3 days ago
by
deepseeker3b56
鲸 Logo transparent
#60 opened 3 days ago
by
DorianDarko2525
Meet Finley, the Whale of DeepSeek!
#59 opened 3 days ago
by
deepseekjanus
最近的炒作和硬币
#58 opened 3 days ago
by
Chester1111
Official DeepThink Crypto Currency
1
#56 opened 3 days ago
by
qwen-llm
Congrats, this is the by far the best open source model! Just a few steps until complete domination (feedback)
1
#54 opened 3 days ago
by
Dampfinchen
deepseek
#53 opened 3 days ago
by
denizkaya2022
Modify abbreviations in benchmark images into full name to avoid confusion
#52 opened 3 days ago
by
karminski
How to deploy DeepSeek-R1 witn LMDeploy ?
#48 opened 3 days ago
by
vansin
使用不带 thinking 的数据集微调时无法正常生成
1
#46 opened 4 days ago
by
HuanLin
Use memory to store inactive experts
#45 opened 4 days ago
by
xm10086
qwen32B蒸馏模型,长度>8k时,预测一定比例乱码,出现<think><think><think><think><think><think>
5
#44 opened 5 days ago
by
daniellibin
Update LICENSE
#43 opened 5 days ago
by
town24
DesspSeek Censorship
13
#42 opened 5 days ago
by
rzgar
edit paper link to hf for easier conversations
#41 opened 6 days ago
by
clem
Upload 80b78bb2-3b7e-4a0c-a76c-93e1503c7b30.jpeg
#40 opened 6 days ago
by
Uman1
The LICENSE-MODEL file is missing??
#39 opened 6 days ago
by
spanspek
New permissions gate doesn't look valid
3
#38 opened 6 days ago
by
AdjectiveAllison
Amazing Release! Can we also have DeepSeek-R1-Zero-Qwen-32B
#37 opened 6 days ago
by
cfpark00
Question about possible R1 - lite versions 70b / 32b
#36 opened 6 days ago
by
smokestudio
Update README.md
1
#35 opened 6 days ago
by
sloshywings
Add pipeline tag
#34 opened 6 days ago
by
nielsr
Deploying production ready Deepseek R1 on your AWS with vLLM
5
#32 opened 7 days ago
by
samagra14
Create Stephy
#31 opened 7 days ago
by
Kouadio12
comfyui-deepseek-r1
#30 opened 7 days ago
by
zwpython
I can't use your model in hugginsface spaces
2
#29 opened 7 days ago
by
MrEscorpion
Upload IMG_2394.jpeg
#28 opened 7 days ago
by
Itsvijay12