More

godmode2019 · on Feb 13, 2023

You need to fake users to get the network effects to start.

Use a LLM to generate comments maybe finetuned on HN

When real people start showing up they will see comments and assume its not a wasteland. As real people come reduce the bot comments.

godmode2019 · on Feb 9, 2023

My new TV wouldn't work unless I agreed to it recoding and uploading those recordings to it's servers which may be temporary stored while they are transcribing the audio to text for more permanent storage.

My TV is forcing me into a employment agreement where I generated data to train their models or otherwise 'improve service'.

Data is so valuable companies are risking a huge PR backlash. Data collection is the business model and I assume the same ethos will make its away into open source.

godmode2019 · on Feb 1, 2023

I believe 2021 was the tipping point where most text content is now AI generated, so to avoid training your LLM with other LLM output they restrict the date to 2021.

singularity2001 · on Feb 1, 2023

>> most text content is now AI generated

do you have any sources to back it up or is it a gut feeling?

godmode2019 · on Jan 27, 2023

Why would you Install any of this spyware when you can just use the web version and know everything is sandboxed as good as a browser can offer.

forgotusername6 · on Jan 27, 2023

I suppose because there is a theory that a native application can provide a better video experience than webrtc. I've no idea whether that is true for zoom.

KyeRussell · on Jan 27, 2023

Because they don’t always do the same things? I’m assuming that as someone with this vocal a view, you seldom actually use the app if you can help it.

godmode2019 · on Jan 26, 2023

This is a very old project glad to see it get the credit it needs.

One interesting thing is if you show someone this for a random sentence and ask them want do they think. They always say how did it copy my writing. Everyone thinks it looks like their own writing. Likely because its the average of all peoples writing. Try it out

godmode2019 · on Jan 15, 2023

Can you confirm that was for inference? I thought that was only for training 55min on 8x v100

speedgoose · on Jan 15, 2023

You are right, inference only uses one single v100 according to the paper.

godmode2019 · on Jan 14, 2023

Birds arent real meme

https://m.youtube.com/watch?v=lsgnrYog6W0

godmode2019 · on Jan 13, 2023

Something interesting that will come from these LLM is university is all about writing papers, you get your degree by writing papers.

Now you can argue each year the value of a university degree will drop because more and more people will get LLM to write their papers for them.

The whole higher education model may need to change, its a similar paradigm shift as the invention of the graphs calculator.

godmode2019 · on Jan 12, 2023

Another reason these diagrams should be expressed in some sort of code is LLM currently are not well versed in architecture because its missing from the training set.

godmode2019 · on Dec 15, 2022

When you say browsing history you mean DNS lookups?

I wouldn't think they have access to your actual URLs as this is HTTPS, only the domain name. Or am I missing something.

gambiting · on Dec 15, 2022

That is correct. Still, it's enough to establish that you(linked with your driving licence/passport) have visited pornhub 37 times a day on the 12th of December. They dont' store exactly what you were doing, but obviously it's not like the public cares about that. And pornhub is a very mild example, let's say a website you visited pulled some resource from 4chan(website known to harbour terrorists and pedophiles! /s) or somewhere that sounds like it's a terrorist organisation. Or even hacker news(are you a hacker? that's illegal you know). You have no control over your DNS lookups(by default) so you don't actually know what gets written in those logs, nor have any way to inspect them.

And specifically because the logs are so crap and don't actually contain any information beyond the domain name, they can be used to infer pretty much anything the prosecutors might want to see. Otherwise, why even keep them?