Exactly. I would prefer to remain invested as I dont want to time the market. But I would prefer if I could meaningfully reduce exposure to OpenAI and the consequences of their possible downfall.
Keep in mind that the "news cycle" isn't of much use in this field. For 2025, almost all "mainstream" media was dead wrong in their takes. Remember the Deepseek r1 craze in feb25? Where nvda is dead, oai is dead and so on? Yeah... that went well. Remember all the "no more data" craze? Despite no actual researcher worth their salt saying it or even hinting at it? Remember the "hitting walls" rhetoric?
The media has been "social media'd", with everything being driven by algorithms, everything being about capturing attention at the cost of everything else. Negativity sells. FUD sells.
> Remember all the "no more data" craze? Despite no actual researcher worth their salt saying it or even hinting at it?
We ran out of fresh interesting data. A large chunk of training needs to generate its own now. Synthetic data training became a huge thing over the last year.
> Remember the "hitting walls" rhetoric?
Since then the basic training slowed down a lot and improvements are more in the agentic and thinking solutions, with lots more reinforcement training than in the past.
The fact we worked around those problems doesn't mean they weren't real. It's like people say Y2K wasn't a problem... ignoring all the work that went into preventing issues.
No, we didn't. Hassabis has been saying this for a while now, and Gemini3 is proof of that. The data is there, there are still plenty of untapped resources.
> Synthetic data training became a huge thing over the last year.
No, people "heard" about it over the last year. Synthetic data training has been a thing in model training for ~2 years already. L3 was post-trained on synthetic-only data, and was released in apr24. Research only was even earlier with the phi family of models. Again, if you're only reading the mainstream media you won't get an accurate picture of these things, as you'd get from actually working in this field, or even following good sources, read the key papers and so on.
> The fact we worked around those problems doesn't mean they weren't real.
The way the media (and some influencers in this space) have framed it over the last year is not accurate. I get that people don't trust CEOs (and for good reasons), but even amodei was saying there is no data problem in early interviews in 25.
That's very interesting! I see numbers as... The actual numbers along a line which is a bit more like stairs. The number 8 is higher up than the number 5. My own Rainman visualisation is that when I write a text I can _feel_ what others will feel when they read what I've written. Whether it's funny, sad or boring (or clever!) I can just read the passage and at the same time feel what my readers will feel.
Regarding his identity; I used to hang out on #coders on IRCNet from the mid 90's to the early 2000's. There was a guy there called MostlyHarmless and I think both the age and nationality checks out. I haven't been there for ages and I am pretty sure that the channel was password protected but it would probably take somebody just a few minutes to check around there amongst the regulars to see if they're the same guy. Right now there isn't a user online called MostlyHarmless, which there always was back then.
Thank you for the compliment about the copywriting! I did try to put effort into explaining what Mailhero actually does and to make sure that I explained why I made it instead of pretending to be somebody else. :)
I am going to add that functionality soon. You will, of course, have to click the recaptcha for each message send... I hope everybody understands why. :)
Ditto, I get that a lot too. That pregnant pause on the other end of the line after you get as far as <companyname>@...
What I have discovered as well, is that gmail/hotmail/yahoo addresses are so ubiquitous, that call centre staff get very confused when your email address doesn't end in one of the 'big three' domain names.
Also don't get me started on the problems I've had trying to explain that yes, .io is a valid domain ending for an email address...
Another thing I am working on for Mailhero is to start publicly listing sites/organizations that have leaked mail addresses. This list currently includes Coca Cola, Adobe, Neteller and others.
You can add box.net to the list. I used a one-off email address on their service and it got leaked. They were at least honest enough to confirm it did indeed happen to them:
We were recently informed by a handful of users that they had received spam email at the address associated exclusively with their Box account. We scoured our own systems and checked every possible scenario, and didn't find any evidence of our systems being compromised. Thanks to information sent to us by our customers, we were able to pin down that a third-party email service we used to send our newsletter to select users in February, March and April had been compromised.
No other information beyond email addresses were ever exposed to this vendor.
We sincerely apologize for the inconvenience this has caused our users. We continue to be committed to your privacy and keeping their confidential information safe. As a result of this issue, we're leaving this particular service provider immediately and consolidating all of our customer communication efforts within a single vendor with more robust security practices.
reply