Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I agree. But my impression was that she took the fact that google translate rapidly decays into gibberish as an indication that it's not doing a good enough job. I don't think you can argue that exactly because google translate does not have the interpretation capability.


Don't read too much into that example -- I chose it as a humorous metaphor, not a mathematical argument, and I messed up the delivery in the talk, anyway.

I'll refine the example for the next time!


I'm really interested in doing more machine learning work (my current projects, as interesting as they are, dont really require it).

I've done a few weirdo projects with NLTK, tho, and its great fun. By stream hacking do you mean offloading learning sets (active or initial) and that heavy overhead into the "cloud", or am I misunderstanding the terminology?


In most data analysis work, we assume that the data resides in some database and that you have the luxury of iterating over that data as many times as you like to get to a final result.

The challenge with stream analysis is that you are dealing with a continuous stream of data where you can see each element of the stream only one time and must still be able to cluster/classify/analyze it. There are still few algorithms and tools designed explicitly for that purpose.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: