Hacker Newsnew | past | comments | ask | show | jobs | submit | Jaxkr's commentslogin

Great project. The last time someone did this idea well they got acquired by Microsoft. Clipchamp has since been enshittified, making them ripe for disruption. The wheel continues to turn…

The author of this post could solve their problem with Cloudflare or any of its numerous competitors.

Cloudflare will even do it for free.


Cool, I can take all my self hosted stuff and stick it behind centralised enterprise tech to solve a problem caused by enterprise tech. Why even bother?


"Cause a problem and then sell the solution" proves a winning business strategy once more.


Cloudflare seems to be taking over all of the last mile web traffic, and this extreme centralization sounds really bad to me.

We should be able to achieve close to the same results with some configuration changes.

AWS / Azure / Cloudflare total centralization means no one will be able to self host anything, which is exactly the point of this post.


They don't. I'm using Cloudflare and 90%+ of the traffic I'm getting are still broken scrapers, a lot of them coming through residential proxies. I don't know what they block, but they're not very good at that. Or, to be more fair: I think the scrapers have gotten really good at what they do because there's real money to be made.


Probably more money in scraping than protection...


Cloudflare won't save you from this - see my comment here: https://news.ycombinator.com/item?id=46969751#46970522


Parent of your comment became [flagged][dead], which broke your in-context link.

A direct link works, however:

https://news.ycombinator.com/item?id=46970522


For logging, statistics etc. we have the Cloudflare bot protection on the standard paid level, ignore all IPs not from Europe (rough geolocation), and still have over twice the amount of bots that we had ~2 years ago.


The scrapers should use some discretion. There are some rather obvious optimizations. Content that is not changing is less likely to change in the future.


They don't care. It's the reason they ignore robots.txt and change up their useragents when you specifically block them.


I'm pretty sure scrapers aren't supposed to act as low key DOS attacks


I think the point of the post was how something useless (AI) and its poorly implemented scrapers is wrecking havoc in a way that’s turning the internet into a digital desert.

That Cloudflare is trying to monetise “protection from AI” is just another grift in the sense that they can’t help themselves as a corp.


you don't understand what self-hosting means. self-hosting means the site is still up when AWS and Cloudflare go down.


Roblox pays the full 30%.


Only for in app purchases of Robux. They are uniquely allowed to distribute their own app store (not based on HTML/JS applets) within the App Store, which is against the terms that every other Apple developer agrees to. And they are allowed to use a virtual currency that can be obtained elsewhere without an Apple Tax to pay for digital goods purchases inside an iOS app, bypassing the IAP system, again against the terms.


FYI the second link (about Anthropic RLHF) 404's. Here's an archive link: https://web.archive.org/web/20250428152855/https://www.surge...


On performance: 3.13 removed the GIL and added experimental first-party JIT (like PyPy).

In two years I bet we’ll be seeing v8 level performance out of CPython.


The “Faster CPython” team were let go from Microsoft because they could only produce a 1.5x speedup in four years instead of the planned 5x.

It’s wildly optimistic to now expect a 10x speedup in two years, with fewer resources.


Them being let go "because they didn't meet 5x" is hearsay. The only source for that is in social-media commentary and opinion pieces. Microsoft described the layoffs as "organizational adjustments".


That sounds awfully like a scene by scene replay of Google's Unladen Swallow experience.

https://peps.python.org/pep-3146/#rationale-implementation


At least Microsoft hired a bunch of experienced Python core developers for their effort. IIRC Unladen Swallow was just a couple of interns.


Wow, know you make me curious about the business processes at Microsoft. Did they see that they would earn more money if the interpreter had a 5x speedup, that they wouldn’t see with 1.5x? Or was it trust broken?


Instead of generating more revenue, it would drive down costs. You will need less computers to do the same amount of work if the work can be done faster.


Lower costs could open new markets, as it would allow you to charge less and still make a profit.


Depends if they are the right resources.


Depends if it’s possible.


Python is slow due to design decisions in the language. For example operator dispatch is slow without some kind of static analysis. But this is hindered by how dynamic the language is.


It's hard to make Python run fast when it pervasively uses duck typing. It makes types only resolvable at runtime. JIT is the only thing that can work here at the moment, but I think that needs to make very similar assumptions to a branch predictor, plus it needs to identify lexical regions (is that what they're called?). People here have criticised PyPy, but I've forgotten why.


Have you checked out what PyPy is already capable of?


I'd be surprised if we saw anything more than the 4x speedup from compiling Python with something like Nuitka/mypyc/etc can bring.

I also believe the JIT in v8 and Python are different, the latter relying on copy-and-patch while v8 uses a bunch of different techniques together.


Obviously dumb microbenchmark, but here's ~17x on my machine:

  $ time python -c 'sum(range(1_000_000_000))'

  real 0m19.997s
  user 0m19.992s
  sys 0m0.005s

  $ time pypy -c 'sum(range(1_000_000_000))'

  real 0m1.146s
  user 0m1.126s
  sys 0m0.020s


I think some relatively simple math JITs and compiles nicely like that, but when you start using other parts of the language heavily like you would in a real project, it averages out to about ~4x due to the object, VM and locking model, I believe. It's been a while since I've looked into this.


I would surprised to see performance as good as V8, although that would be great. As I recall the v8 team performed exceptionally well in a corporate environment that badly wanted js performance to improve, and maybe inherited some Hotspot people at the right time.

I'd be quite delighted to see, say, 2x Python performance vs. 3.12. The JIT work has potential, but thus far little has come of it, but in fairness it's still the early days for the JIT. The funding is tiny compared to V8. I'm surprised someone at Google, OpenAI et al isn't sending a little more money that way. Talk about shared infrastructure!


pypy is probably faster. Lets put effort into that. BUT the dynamic features that make python lovely are always going to limit its performance.

If you're using python because you have to then you might not like all that and might see it as something to toss out. This makes me sad.


I bet we’ll be seeing python compiled to JVM of getting JVM levels of performance. Much better than v8


There have for a long time been IronPython (CLR) and and Jython (JVM).

But, they don't have the full compatibility with CPython, so nobody really picks them up.


Jython seems to be effectively dead though - it only has 2.7 compatibility.


You are right: GraalPy (https://www.graalvm.org/python/) is where it's at these days.


JVM Python exists for the longest time now, where "exists" is purely technical. It's very cursed and bad, keeping in line with the rest of Java-adjacent stack.


Yet this "Java-adjacent stack" wipes the floor with Python and its ilk w.r.t performance and is what's actually running the world outside of some silicon valley ephemeral unicorns.



Has something changed that allows a more relaxed refcounting / less eager "gc"? Py_DECREF was what murdered any hope of performance back when we hooked up 3.3 to OMR... Well that and the complete opacity of everything implemented in C


It didn't "remove the GIL". It added an experimental free-threading mode which removes it, but is still considered experimental and not widely used in production yet.


This is an amazing product. I don't have kids yet but I would buy this for them if I did!

However, since this is Hacker News, I must say I'd probably enjoy building this myself using TTS and LLM APIs...


Build it :)

It's the most fun I have had in a long time. Building a character and having it sit on your desk and chatter/say things you don't expect.

I love absurdism humor. This hits the spot for me.


FXAIX


I think some of the power user demand is fairly inelastic. I’ve seen developers who are allergic to spending money happily drop $200/mo on those new Claude subscriptions.


Yeah but if you push the price up, given that many users will cancel their subscriptions you will end up with still a tiny market segment relative to what is necessary, in revenues, to justify the valuations purported.


It's a tricky one, there is also a lot of push right now to use AI so developers are incentivized to drop money on subscriptions. I'd have difficulty justifying 1k/month for smaller shops - but corporations will be different. If the average engineer is just 20% more productive, then that is a 30-60k value to the company.

I don't have difficulty getting to a 20% productivity gain with AI just from automating the tasks I procrastinate on or can't focus on. Likewise the ability to code a prototype overnight/over the weekend is a reasonable extension of practical working hours.

The challenge I do see is that fully AI generated code bases devolve into slop pretty fast. The productivity cutoffs are much lower compared to human engineers.


The “Cursor for Xcode” startups just got Sherlocked…


Were there really such startups? It's so obviously a bad idea..


There's Alex (https://www.alexcodes.app), YC-backed.


Like… you’d expect a company to evaluate the potential for competition, right? But these AI companies are obviously not actual companies with any business model, most are just trying to grab some investors money while they can surf the hype.

I always find this article something to get back to: https://www.inc.com/magazine/20110301/making-money-small-bus...


> In terms of value per minute spent, it’s the same tier of slop as TikTok or Instagram

Insane take. Reddit hosts deep threaded discussions on almost any topic imaginable. In its prime it was the best forum on the internet. There’s a reason people commonly add “reddit” to the end of their search queries.

Unfortunately it feels like the community has gotten much dumber after they banned third party apps and restricted API access. It’s also lost almost all of its Aaron Swartz style hacktivist culture.

Reddit, in its prime, was incredible and beloved by almost everyone I know (most of which are far outside the HN sphere)


It feels like all those hacktivists moved to Discord... Which is even more "locked away" than Reddit.

I miss the old skool php web forums.


There are still many around - most of them die because admins give up or users leave - if you actually miss them it should be easy to find some for your interests


I would love to have some directory with all kinds of active (PHP) web forums. That was the heyday of the open web for me.

Do you have any tips on how to specifically search for these forums? Without just googling for topics and browsing hours to find some. When I think about it, just googling/searching might be the only way.



Nice one, thanks!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: