I think the innovation is using caching as so to make the cost of the approach m...

skeptrune · on Sept 20, 2024

You could also just save the first outputted atomic chunk and store it then re-use it each time yourself. Easier and more consistent.

IanCal · on Sept 20, 2024

I don't understand how that helps here. They're not regenerating each chunk every time, this is about caching the state after running a large doc through a model. You can only do this kind of thing if you have access to the model itself, or it's provided by the API you use.

postalcoder · on Sept 20, 2024

To be fair, that only works if you keep chunk windows static.

postalcoder · on Sept 20, 2024

Yup. Caching is very nice.. but the framing is weird. "Introducing" to me, connotes a product release, not a new tutorial.