Hacker Newsnew | past | comments | ask | show | jobs | submit | godmode2019's commentslogin

You need to fake users to get the network effects to start.

Use a LLM to generate comments maybe finetuned on HN

When real people start showing up they will see comments and assume its not a wasteland. As real people come reduce the bot comments.


My new TV wouldn't work unless I agreed to it recoding and uploading those recordings to it's servers which may be temporary stored while they are transcribing the audio to text for more permanent storage.

My TV is forcing me into a employment agreement where I generated data to train their models or otherwise 'improve service'.

Data is so valuable companies are risking a huge PR backlash. Data collection is the business model and I assume the same ethos will make its away into open source.


I believe 2021 was the tipping point where most text content is now AI generated, so to avoid training your LLM with other LLM output they restrict the date to 2021.


>> most text content is now AI generated

do you have any sources to back it up or is it a gut feeling?


Why would you Install any of this spyware when you can just use the web version and know everything is sandboxed as good as a browser can offer.


I suppose because there is a theory that a native application can provide a better video experience than webrtc. I've no idea whether that is true for zoom.


Because they don’t always do the same things? I’m assuming that as someone with this vocal a view, you seldom actually use the app if you can help it.


This is a very old project glad to see it get the credit it needs.

One interesting thing is if you show someone this for a random sentence and ask them want do they think. They always say how did it copy my writing. Everyone thinks it looks like their own writing. Likely because its the average of all peoples writing. Try it out


Can you confirm that was for inference? I thought that was only for training 55min on 8x v100


You are right, inference only uses one single v100 according to the paper.



Something interesting that will come from these LLM is university is all about writing papers, you get your degree by writing papers.

Now you can argue each year the value of a university degree will drop because more and more people will get LLM to write their papers for them.

The whole higher education model may need to change, its a similar paradigm shift as the invention of the graphs calculator.


Another reason these diagrams should be expressed in some sort of code is LLM currently are not well versed in architecture because its missing from the training set.


When you say browsing history you mean DNS lookups?

I wouldn't think they have access to your actual URLs as this is HTTPS, only the domain name. Or am I missing something.


That is correct. Still, it's enough to establish that you(linked with your driving licence/passport) have visited pornhub 37 times a day on the 12th of December. They dont' store exactly what you were doing, but obviously it's not like the public cares about that. And pornhub is a very mild example, let's say a website you visited pulled some resource from 4chan(website known to harbour terrorists and pedophiles! /s) or somewhere that sounds like it's a terrorist organisation. Or even hacker news(are you a hacker? that's illegal you know). You have no control over your DNS lookups(by default) so you don't actually know what gets written in those logs, nor have any way to inspect them.

And specifically because the logs are so crap and don't actually contain any information beyond the domain name, they can be used to infer pretty much anything the prosecutors might want to see. Otherwise, why even keep them?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: