So, if I understand the approach correctly: we're essentially doing very advance...

softwaredoug · 2025-01-24T13:31:59 1737725519

TBH I'm not sure its better, but the decision tree structure is pretty handy for problem exploration

(However 'better' might be defined, I care more about the precision / recall tradeoff)

ellisv · 2025-01-24T14:22:54 1737728574

This resonates with my experience. Use LLMs for feature engineering, then use traditional ML for your inference models.

Matthyze · 2025-01-24T14:31:20 1737729080

Perhaps the reason that this approach works well is that, while the LLM gives you good general-purpose language processing, the decision tree learns about the specific dataset. And that combination is more powerful than either component.

ellisv · 2025-01-24T15:09:44 1737731384

It’s the same reason LLMs don’t perform well on tabular data. (They can do fine but usually not was well as other models)

Performing feature engineering with LLMs and then storing the embeddings in a vector database also allows you to reuse the embeddings for multiple tasks (eg clustering, nearest neighbor).

Generally no one uses plain decision trees since random forest or gradient boosted trees perform better and are more robust.

gerad · 2025-01-24T14:30:41 1737729041

It seems like a really easy way to overfit your model to your data, even while using LLMs.