It would be great to have the data cleaned up a bit more. For example Nvidia chips are at least 5 items and not all of them start with N. That makes them hard to find. Also there are some “manufacturers” like see description and so on. Maybe just throw it at a local LLM and it wil likely clean it up nicely.