Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
anemll
5 months ago
|
parent
|
context
|
favorite
| on:
Run LLMs on Apple Neural Engine (ANE)
Right.I was thinking about it, you still need batch refill, however, Apple Core ML tools were failing for attention activations quantization. Long context, pre-fill is still compute bound.
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: