More

rustdeveloper · 2025-02-06T21:22:03 1738876923

Happy to suggest another web scraping API alternative I rely on: https://scrapingfish.com

xeornet · 2025-02-07T12:05:10 1738929910

What’s the chance you’re affiliated? Almost every one of your comments links to it. And curiously similar interest in Rust from the official HN page and yours. No need to be sneaky.

rustdeveloper · 2025-01-12T20:08:55 1736712535

Interesting data for sugar in products scraped from Walmart: https://scrapingfish.com/blog/scraping-walmart

swatcoder · 2025-01-12T20:17:19 1736713039

I was really excited to read this, but it's very shallow and barely presents any data at all.

It's main and mostly meaningless conclusion is that there are more skus in the Walmart catalog for sugar-dominated products than for any protein- or fat-dominated ones, and that online reviewers generally tend to leave better reviews for those.

Nothing about actual sales volume, promotional actions by Walmart, or monetary or nuyitional share of food consumed (or even purchased), etc -- any of which might say something more impactful.

rustdeveloper · 2024-12-20T12:02:28 1734696148

And many more: https://compareproxy.com/

KomoD · 2024-12-22T19:19:11 1734895151

("Scraping Fish" owns this site but it isn't disclosed anywhere, and this guy "rustdeveloper" seems associated with them, most of his comments push that service)

rustdeveloper · 2024-12-22T19:35:40 1734896140

This is correct, my friends from Scraping Fish are hosting https://compareproxy.com to help people find proxy for web scraping. I'm happy to "push" for Scraping Fish as I'm also a satisfied user who received a lot of help from the founders for my web scraping projects.

rustdeveloper · 2024-10-14T19:33:18 1728934398

For web scraping at scale you want to get lost in the crowd. This usually means being (or pretending to be) chromium on windows. Unusual browsers are suspicious, detected or have very distinct fingerprint.

KPGv2 · 2024-10-14T20:41:49 1728938509

Indeed. I heard about a browser called Zen a couple weeks ago and installed it. Just took it for a drive yesterday, and by the end of the day, Reddit had blocked me just based on my sporadic, normal use of the site for about two hours here and there while I did other things.

I switched back to Safari and it worked normal immediately.

rustdeveloper · 2024-09-25T10:40:27 1727260827

This is a terrible news :( I know it was an option for web scraping and I used in once. I’m curious what is the real reason they took it down.

optymizer · 2024-09-25T14:00:58 1727272858

I have seen a push in the past year or so for saving storage across Google products. Caching the Internet takes a lot of storage. I suspect that's why they've removed it.

rustdeveloper · 2024-09-23T08:23:07 1727079787

Surprisingly, according to you tool, HN is neutral on “web scraping”. I noticed others also reported bias for neutral on other keywords.

rustdeveloper · 2024-09-22T09:50:03 1726998603

There are also SaaS products with usage based pricing. It depends on what SaaS or software or product it is. Different pricing model works for different things.

alanfranz · 2024-09-22T11:47:28 1727005648

but sometimes the product I need doesn't offert the right subscription for me.

rustdeveloper · 2024-09-22T09:40:25 1726998025

Actually, they do allow this. I store my photos and iPhone backup on synology NAS. You just have to devote some time to set it up yourself.

rustdeveloper · on Feb 20, 2024

I'm using Scraping Fish because of their pay-as-you-go style pricing as opposed to subscription with monthly scraping volume commitment. And they don't charge extra credits for JS rendering or residential proxies because the cost of each request is the same: https://scrapingfish.com

rustdeveloper · on Oct 16, 2023

This reminds me of https://scrapingfish.com/blog/are-most-rust-jobs-in-crypto :)