Never Use Pixelation to Hide Sensitive Text (2014)

alright2565 · 2025-12-28T18:47:03 1766947623

The Flameshot screenshot tool uses an interesting variant of pixelation that does protect the text from unredaction: https://github.com/flameshot-org/flameshot/commit/533a1b7d55...

> Since pixelation does not protect the contents of the pixelated area (see e.g. https://github.com/bishopfox/unredacter), _pseudo-pixelation_ is used:

> Only colors from the fringe of the selected area are used to generate a pixelation-like effect. The interior of the selected area is not used as an input at all and hence can not be recovered.

The edges of the pixelated area are used the generate a color palette, and then each pixel is generated by randomly sampling from that pallete's gradient.

KronisLV · 2025-12-28T16:36:40 1766939800

To make it more fun for the maths nerds and to keep them guessing, replace the underlying contents with mostly random garbage (probably not full on obvious white noise) and then pixelize that: https://imgur.com/a/CTM4Zlv :)

Not serious advice.

MadameMinty · 2025-12-28T16:44:28 1766940268

I remember a protocol which required the text to be replaced with random-length output of a Markov chain text generator, and only then pixelizing.

Oh, you've spent hours on unpixelizing my secrets? Well congratulations, is the last telescope that, nor drink from shrinking nothing out and this and shutting.

0_____0 · 2025-12-28T18:27:54 1766946474

if you fully control the text and layout, you could just replace the redacted text with [redacted]

MadameMinty · 2025-12-30T16:50:40 1767113440

Yes, but that doesn't potentially waste your opponents' time.

pfortuny · 2025-12-28T17:11:19 1766941879

Only names are allowed, of long-dead people.

ErroneousBosh · 2025-12-28T17:30:12 1766943012

Oooh oooh I know, I know! Replace the text with strings of all-caps five-letter groups that look just like oldschool CW encrypted messages, and that'll keep the MXGJD SWLTW UODIB guessing until AMEJX OYKWJ SKYOW LKLLW MYNNE XTWLK!

Dwedit · 2025-12-28T19:13:00 1766949180

SATOR AREPO TENET OPERA ROTAS

quchen · 2025-12-28T20:28:06 1766953686

Flameshot (a screenshot tool) in its newer versions (!!) uses random noise for pixelation, and colors it based on the un-noised surroundings so it blends in reasonably.

It's a nice mix if optically unobtrusive, algorithmically secure, and pleasant to look at.

vunderba · 2025-12-28T17:02:28 1766941348

Good article - one takeaway is that any redaction process which follows a fixed algorithmic sequence (convolutions, transformation filters, etc) is potentially vulnerable to a dictionary attack.

dahart · 2025-12-28T17:44:12 1766943852

I see what you mean, but FWIW “fixed” doesn’t sufficiently constrain or describe it. For example, filling a rectangle with black or random pixels is a fixed algorithmic sequence, same might go for in-painting from the background. The redaction output simply should not be a function of the sensitive region’s pixels. The information should be replaced, not modified.

loeg · 2025-12-28T20:06:30 1766952390

A black redaction rectangle still leaks the dimensions of the occluded pixels, potentially revealing possible contents.

eurleif · 2025-12-28T19:52:08 1766951528

To be pedantic, `f(x) = 0` is a function of x.

dahart · 2025-12-28T23:02:32 1766962952

Yeah, true. Is there a proper math term for a function of x that does not depend on x?

MarkusQ · 2025-12-29T07:08:05 1766992085

That's called a constant.

dahart · 2025-12-29T20:47:25 1767041245

That’s not quite the word I was looking for, since a function returning a random number isn’t a constant, and also doesn’t depend on it’s inputs.

kmoser · 2025-12-28T20:58:43 1766955523

> Remember, you want to leave your visitors with NO information, not blurred information.

Blacking out text still gives attackers an idea of the length of the original, which can be useful information, especially when the original is something like a person's name. You can mitigate that by either erasing the text completely (e.g. replace it with the background color of the paper) or making the bars longer.

petters · 2025-12-28T18:32:04 1766946724

Paedophile Used 'Swirl' Effect To Hide. How Interpol 'Unswirled' Him: https://www.ndtv.com/world-news/christopher-paul-neil-paedop...

croes · 2025-12-28T19:13:49 1766949229

So there are cases where I would recommend using such obfuscation techniques.

hinkley · 2025-12-28T19:17:03 1766949423

Maybe we should use whistle blowers and freedom fighters as examples though and not predators.

awesome_dude · 2025-12-28T19:49:57 1766951397

Yeah - although the hard fact is, any tool designed for "good" can, and will, be used for "evil"

hinkley · 2025-12-28T20:56:05 1766955365

Yeah I helped out a bit with Freenet before I saw what was being posted. Basically 4chan. Lots of edge lords.

But I helped because a friend dragged me to Amnesty International meetings in college and so I knew there were people who legitimately needed this shit.

awesome_dude · 2025-12-28T21:59:09 1766959149

Tor is the big example for me, created to allow people to have the ability to speak freely without being tracked, often criticized because it allows those things for our criminals (it has to be kept in mind that the spies and dissidents that are/were using Tor are considered criminals in their country)

hinkley · 2025-12-29T00:12:23 1766967143

When a law is unjust it will be broken by those on the right side of history. Software can’t tell if a law is just or not.

So if you want to support suffragists or underground railroads you’re making software that breaks the law.

Really we are all breaking some law all the time. Which is how oppression works. Selective enforcement. ‘Give me six lines from the most innocent man and I will find in them something to damn his soul.”

awesome_dude · 2025-12-29T00:55:30 1766969730

I have a slightly different view

There is no such thing as "good" or "bad" - actions are meaningless - it's the context that makes the difference.

Example: Sex

Good when the context is consenting adult (humans)

Bad when the context is not.

Further, "One man's 'freedom fighter' is another man's 'terrorist'" - meaning context is very much in the eye of the beholder.

Couple this with the Taoist? fable "What luck you lost a horse" where the outcome of an event can not really be determined immediately, it may take days, months, years to show.

And you are left with - do we really have any idea on what is right/wrong

So, my philosophical take is - if it leads toward healthy outcomes (ooo dripping with subjective context there...) then it's /likely/ the right thing to do.

When I spoke with an AI on this recently the AI was quick to respond that "Recreational drug use 'feels good' at first, but can lead to a very dark outcome" - which is partly true, but also demonstrates the first point. Recreational drug use is fine (as far as I am concerned, after my 4th cup of tea) as long as the context isn't "masking" or "crutch" (although in some cases, eg. PTSD, drug use to help people forget is a vital tool)

croes · 2025-12-28T20:03:16 1766952196

Predators are a good example of people who should use bad obfuscation.

Havoc · 2025-12-28T16:58:15 1766941095

Or put simply - remove the info don't transform the info

jedberg · 2025-12-28T18:50:42 1766947842

Or, you do the equivalent of adding a hash, and apply mosaic to it twice, with two slightly different size regions. Or apply both mosaic and swirl in random order. Or put a piece of random text over it before you mosaic it.

The main point here stands -- using something with a fixed algorithm for hashing and a knowable starting text is not secure. But there are a ton of easy fixes to add randomness to make it secure.

dheera · 2025-12-28T19:11:58 1766949118

Surprised to see my article float up again so many years later.

I wouldn't consider a mosaic + swirl to be fully secure either though, especially considering both of these operations may preserve the sum of all pixels, which may still be enough entropy to dictionary attack a small number of digits.

jedberg · 2025-12-28T19:20:44 1766949644

It's probably the least secure of the ones I mentioned, yes. But even so, it massively increases the search space for a dictionary attack because the attacker doesn't know which algorithm was applied first.

But yes, at the end of the day, the best bet is to just take a mosaic of a random text and place it over the text you're trying to obscure. The reason people use mosaic is because it is more aesthetic than a black box, but there is no reason it has to be a mosaic of the actual text.

tom1337 · 2025-12-28T16:32:30 1766939550

related: https://news.ycombinator.com/item?id=43695701

ElijahLynn · 2025-12-28T21:03:33 1766955813

When I blur out sensitive information, I blur out: * the whole thing * then a random subset * then another random subset * then the whole thing again

This feels safe to me, I suppose with machine learning it could still be cracked though. Thoughts on this technique?

elijahdl · 2025-12-28T21:12:49 1766956369

I don't think this does enough to destroy the data you're trying to hide. Each blue operation on its own is reversible, I don't see why stacking blurring operations, even if they affect different areas each time, changes things.

hyperific · 2025-12-28T17:40:43 1766943643

Also related

https://news.ycombinator.com/item?id=34031568

ectospheno · 2025-12-28T20:12:36 1766952756

You take the original document and manually retype it into a different file format. Very hard to reverse that.

MadameMinty · 2025-12-28T16:37:46 1766939866

You should be blacking out information, to be sure, but credit card numbers are one of the very few examples where cracking makes sense, given that otherwise you don't know the pattern nor the font. Assuming it's text at all.

fwip · 2025-12-28T16:44:20 1766940260

Or the common case of redacting a name, address, or other sensitive text in a screenshot of a web page, word doc or PDF. In those, getting the font is very straightforward.

You also don't need to match the whole redacted text at once - depending on the size of the pixels, you can probably do just a few characters at a time.