Hacker Newsnew | past | comments | ask | show | jobs | submit | dfrodriguez143's commentslogin

Your submission receives no love but my one afternoon hack does... oh the humanity... lol

That is some amazing work, thanks!


Amazing! Thanks!


Thanks!


I like to use the readability API so I don't need to see the HTML of every single site. I did an example here: http://danielfrg.github.io/blog/2013/08/20/relevant-content-...


Should work with most newer versions of any browser.

Which browser are you using?


Firefox 22 on Windows 8


Very interesting and simple improvement. Definitely will take a look at that.


I agree completely a more complex benchmark should be done with a complete cross-validation.

Just for future reference I did ran the fitting a few times founding very(+-2%) similar results. Also Random Forests do an average so probably not much to improve on that particular algorithm.


To be honest I don't expect the results to change; but this is an only way to attach significance to the observed differences and to ensure this wasn't a lucky shot.


Definitely a lot to read and improvements to make. I will probably do a more complete benchmark with more datasets on a later post.

Thanks for the suggestions.


You may be interested in this ICML 2006 paper, which empirically compared many standard algorithms across a combination of metrics and UCI datasets - http://www.cs.cornell.edu/~caruana/ctp/ct.papers/caruana.icm...


Yes: http://danielfrg.github.io/feeds/all.atom.xml

Gonna add a direct link from the site soon.


Definitely a native spanish speaker here :P. Because I wrote this on an iPython notebook it takes a little bit longer to spell-check. I will try not to be so lazy next time.

Thanks for the tips.


The only one I caught was "state of the are" in the last sentence.


Blogging with iPython notebooks has never been easier.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: