Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
raxxor
10 months ago
|
parent
|
context
|
favorite
| on:
Questions censored by DeepSeek
That is semantics and they are strongly comparable with their input and output. Distillation is different to finetuning.
Sure, you could say that only running the 600+b model is running "the real thing"...
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
Sure, you could say that only running the 600+b model is running "the real thing"...