More

mstef · 2025-04-14T13:59:04 1744639144

sure, for bookmarking that's fine. but it does not help against linkrot, or censorship, or pages changing with time.

quantadev · 2025-04-14T14:13:27 1744640007

I agree. Saving web-links is a solved problem. Saving the CONTENT afaik is not solved and I've wanted a solution for YEARS, as well as the rest of humanity has needed this (including those who don't even know they need it, haha!).

Closest thing would be to "print" to PDF but that always creates a trainwreck in the PDF and is never good enough for me. Looking forward to trying Omnom if it's open source.

asciimoo · 2025-04-14T14:20:59 1744640459

It's not only open source, it is free software. Take a look at https://github.com/asciimoo/omnom - suggestions/contributions are appreciated =)

quantadev · 2025-04-14T16:57:13 1744649833

That looks like a pretty heavy-weight solution, with a lot of complexity, and I don't mean that as a criticism at all. I'm not a 'go' developer myself. I've always wanted a pure JS solution (as a browser extension, maximum of 200 lines of code) that can capture the content of a web page (doing a virtual scroll to the bottom, to capture the whole page). Since there's no perfect way to translate HTML to PDF, my idea had always been to capture the IMAGE of the page (aside from capturing keywords for DB indexing which can be done separately just for 'search' support later on).

The fly in the ointment is of course the scrolling too, because some apps have "infinite" scrolling, and so in many SPAs there's literally no such thing as "The whole page". Anway, I haven't tried your app yet, because of not-JS and not-Small, reasons, but I'm just sharing my perspective on this topic. Thanks for sharing your project!

Ametrin · 2025-04-16T19:48:06 1744832886

I recently released a Chrome extension that converts webpages to PDF. It's free, but you need to register to get a key. Unfortunately, this solution isn't client-side JavaScript; I'm using an API underneath. To be honest, I mainly created it to promote the API, but if it's useful for people, I might develop it further. Perhaps it could be useful to you in some way. I don't know your requirements, but maybe with this base in the form of this extension, it wouldn't be difficult to add something that meets your expectations, let me know. However, if you want to export a PDF from Ahrefs, for example, I'm afraid that might not be possible; currently, only basic authentication is supported. Unless maybe I could add an option like in my API to pass JavaScript code, but I also doubt that would work because Ahrefs probably has some bot protection.

edit: i forgot the link https://chromewebstore.google.com/detail/pdfbolt-web-to-pdf/...

quantadev · 2025-04-17T01:48:14 1744854494

Thanks for sharing that. Looks pretty nice!

mstef · 2025-04-14T13:12:48 1744636368

looks nifty, but does it do snapshotting?

pratio · 2025-04-14T16:43:37 1744649017

It also does highlighting which is absolutely amazing. No need to save an entire page just for a piece of text. Also, you can have opds for ebooks.

braggerxyz · 2025-04-14T14:39:56 1744641596

Yes it does. Also written in Go and pretty hackable.

mstef · 2025-04-14T12:47:25 1744634845

the difference is, that archive.ph snapshots something in headless. omnom snapshots the exact same state that your browser is displaying you. so if there is js interactions that change the dom, those will be snapshotted, unlike with archive.ph.

also lets not forget that archive.ph wraps everything in their own frame and has their own way of mangling the result. not in a bad way, it's just not the original as it would have been rendered in your browser.

viraptor · 2025-04-14T12:52:43 1744635163

Archive.ph is often used, because the "js interactions that change the dom" is typically a paywall. And you won't want it.

mstef · 2025-04-14T12:59:28 1744635568

omnom is for snapshotting, not for circumventing paywalls. i'm merely comparing the snapshot feature of the two projects. circumventing paywalls is out of scope.

your bookmarks will never linkrot away.

mstef · 2025-04-14T13:57:28 1744639048

btw it is perfectly fine to circumvent a paywall with archive.ph and then to snapshot it with omnon so your bookmark never linkrots away. also when i say "js manipulation" i also mean stuff like captchas, or dynamic documents that you change by interacting with it, or even private services like e.g. rocket chat hidden behind some barrier like http auth, or private vpn. archive.ph will never have access to what your browser might have access to.

mstef · 2025-04-14T12:17:07 1744633027

btw if you want to have a look at more than a decade of snapshots, try https://links.ctrlc.hu/ my private and membership invitation-only instance.

mstef · 2024-08-01T18:10:07 1722535807

this is a recent (1-2 months old) report on the impact of the NGI0 programme:

https://op.europa.eu/en/publication-detail/-/publication/257...

there was also a fosdem talk about this, if you prefer AV to text, finding it i leave as an exercise to the interested parties...

mstef · on June 21, 2024

besides the docs, there is also all the source code in /usr/lib/zig/std... very useful to learn zigisms and the API of the std library itself...

mstef · on June 8, 2024

what's next that urgently needs mem-safety? /bin/true?

mstef · on June 8, 2024

this must be parody. what exactly is the threatmodel where memory-safety matters for a calculator? did these devs miss the point of popping a calc.exe? surely no bc has ever been used for LPE or RCE.

jgarzik · on June 8, 2024

The entire repo is written in Rust. bc is one of 145 utils specified by POSIX.

mstef · on May 21, 2024

john scalzi: starter villain

note, if your wife doesn't like to be woken up, do not try to read this next to her in bed. you will LoL, and wake her up.

iamthirsty · on May 21, 2024

> you will LoL, and wake her up.

Ironically, I committed that mortal sin the other night around 1 am.

Guess I'll have to add this to a "morning" specific reading list.

cableshaft · on May 22, 2024

Just checked out Old Man's War from the library. First book of his I'll be reading.

mstef · on March 30, 2024

this backdoor had nothing at all to do with memory safety.