More

theahura · 2025-12-13T22:55:27 1765666527

Soft plug: take a look at https://github.com/tilework-tech/nori-profiles

I've spent the last ~4 months figuring out how to make coding agents better, and it's really paid off. The configs at the link above make claude code significantly better, passively. It's a one-shot install, and it may just be able to one-shot your problem, because it does the hard work of 'knowing how to use the agents' for you. Would love to know if you try it out and have any feedback.

(In case anyone is curious, I wrote about these configs and how they work here: https://12gramsofcarbon.com/p/averaging-10-prs-a-day-with-cl...

and I used those configs to get to the top of HN with SpaceJam here: https://news.ycombinator.com/item?id=46193412)

docheinestages · 2025-12-13T23:34:04 1765668844

How does it compare to Claude Code's skills [1] ?

[1] https://github.com/anthropics/skills/tree/main/skills

theahura · 2025-12-14T00:18:51 1765671531

Nori uses Claude Code's skills extensively, which you can see here: https://github.com/tilework-tech/nori-profiles/tree/main/src...

We use Claude Code's ability to use skills by defining a bunch of really useful and common skills that are necessary for writing software. For e.g. brainstorming, doing test driven development, or submitting a git commit.

The specific skills you linked are interesting demos of what you can do with skills! But most of them are not useful for the day to day of building software

theahura · 2025-12-11T02:10:55 1765419055

Pretty sure this whole post is generated by AI

theahura · 2025-12-08T17:15:55 1765214155

Please read the blog post!

toroszo · 2025-12-08T17:59:57 1765216797

I have, and I couldn't believe what it was saying and had to go see the code to verify. I'm really struggling to believe that anyone would consider this a "coding success".

fluidcruft · 2025-12-08T19:28:01 1765222081

Yeah, same. I thought it was saying it reproduced only the background due to not being able to figure out an offset due to a sloppy initial screenshot or something. Then I was wondering why all the link images looked fuzzy and tried to inspect them and also wondered why the links didn't line up with the buttons either with dev tools open.

On the plus side, it does somewhat explain the weird patterns in his diff image which I had been puzzling over.

Ukv · 2025-12-08T20:39:20 1765226360

> I'm really struggling to believe that anyone would consider this a "coding success".

The index_tiled.html version later in the article is what justifies the success claim IMO, and is the version I think it would've made more sense to host.

The currently hosted index.html just feels like a consequence of the author taking a scaled/compressed screenshot and asking Claude to produce an exact match.

theahura · 2025-12-08T17:15:13 1765214113

Please read the blog post!

gaigalas · 2025-12-08T17:24:20 1765214660

It's a joke, right? A joke similar to this one:

---

> Make me a python script that calculates the value of PI

```python

print("3.1415")

```

"I think it's passable!" <--- The joke

---

If it's not a joke, then it's just sad.

johnfn · 2025-12-08T17:33:56 1765215236

I hate to tell you this but all digital representations of pi are numeric approximations. Your joke works, but perhaps not in the direction you were angling for.

gaigalas · 2025-12-08T17:37:45 1765215465

Only the digital ones? oof, why so specific?

I would have accepted `22/7`.

theahura · 2025-12-08T17:36:54 1765215414

First, you're being unnecessarily acerbic. It doesn't help your case, and it's just kinda weird!

Second, the original post was obviously about the placement of the buttons on the space jam website.

Third, I spend at least half the blog post responding to the exact complaint you have. If you do not have more to add beyond pointing out that the 'hack' exists, you aren't adding to the conversation.

Fourth, the blog post and the repo has a version that does not include the screenshot and actually tiles the gif.

I'm still convinced you haven't actually read the blog post because you have shown zero indication that you are engaging with the material. In which case, why even bother commenting?

gaigalas · 2025-12-08T17:44:55 1765215895

Can I offer some valid criticism?

The original Space Jam website is fluid (it's 90s lingo for responsive).

It is also a still relevant website because it is a living fossil of that era's way of doing webdesign.

Asking to recreate it, should include those aspects (epoch-relevant technical achievements such as fluid layouts) and faithfulness to the original implementation.

I'm not saying that Claude should know that out of the box (it would have been impressive if it did), but the prompt should have included those ideas.

A modern reconstruction in CSS3, in contrast to a faithful reproduction, should have mirrored what the techniques accomplished with modern tools. It would be useful in a sense of showcasing how CSS3 evolved, it would have a purpose.

Do you understand why this is not passable? It has no value as a recreation.

theahura · 2025-12-08T16:22:41 1765210961

its less than a few hundred words. The full total of what I typed into claude to get the first version is:

Initial prompt:

> I am giving you:

> 1. A full screenshot of the Space Jam 1996 landing page (screenshot.png)

> 2. A directory of raw image assets extracted from the original site (files/)

> Your job is to recreate the landing page as faithfully as possible, matching the screenshot exactly.

> Use the webapp-testing skill. Take screenshots and compare against the original. <required>You must be pixel perfect.</required>

plan response:

> they should all go to tilework.tech

> exact screenshot dimensions

which is 75 words

theahura · 2025-12-08T16:14:01 1765210441

Note that I didn't even it tell it to use a pixel diff. Claude w/ Nori did that on its own by following the Nori TDD skill. I did very little, I'm actually very lazy :D

stanac · 2025-12-08T16:19:42 1765210782

There is a quote about lazy developers, but I too lazy to search for it.

eCa · 2025-12-08T16:39:53 1765211993

Laziness is one of the three virtues (of a good programmer), but I think Larry didn’t anticipate the current situation when he wrote it:

”The quality that makes you go to great effort to reduce overall energy expenditure. It makes you write labor-saving programs that other people will find useful and document what you wrote so you don't have to answer so many questions about it.”

theahura · 2025-12-08T16:07:21 1765210041

author here -- it took like 5 minutes of actual attention from me? I'm not sure why you are counting reading the blog post or setting up playwright. I guess I did read the blog post, but im not sure that should count. And claude set up playwright, not me.

theahura · 2025-12-08T16:01:06 1765209666

https://tilework-tech.github.io/space-jam/

The site claude made is live on github pages now too, enjoy

Aldipower · 2025-12-08T17:34:38 1765215278

Does not render correctly here. It does not zoom properly and a window resize also have weird effects. Recreation not finished I guess.

Ukv · 2025-12-08T20:29:53 1765225793

From the article, Claude asked:

> The screenshot shows viewport-specific positioning - should we match at a specific viewport size or make it responsive?

And the author responded:

> exact screenshot dimensions

So it's only intended to replicate the screenshot, but I do agree that making it center/zoom properly would've been more interesting.

internetter · 2025-12-08T16:51:20 1765212680

for me its not center aligned

layer8 · 2025-12-08T16:54:22 1765212862

Yeah, it renders quite differently depending on screen/window geometry.

theahura · 2025-12-08T15:58:55 1765209535

There were a few prompts that went into a single commit so that doesn't quite make sense in this case. I posted the transcript, both in the original jsonl format and in markdown

theahura · 2025-12-08T09:26:13 1765185973

I was able to get Claude to do this, though it kinda sorta cheated. Blog post describing the output here: https://theahura.substack.com/p/i-successfully-recreated-the...

TLDR:

"The plan is designed to ‘autoformalize’ the problem by using Test Driven Development (TDD). TDD is incredibly important for getting good outputs from a coding agent, because it helps solve the context rot problem. Specifically, if you can write a good test when the model is most ‘lucid’, it will have an easier time later on because it is just solving the test instead of ‘building a feature’ or whatever high dimensional ask you originally gave it.

From here, Nori chugged away for the better part of half an hour in yolo mode while I went to do other things. And eventually I got a little pop up notification saying that it was done. It had written a playwright test that would open an html file, screenshot it, diff it with the original screenshot, and output the final result...

After trying a few ways to get the stars to line up perfectly, it just gave up and copied the screenshot in as the background image, then overlaid the rest of the HTML elements on top.

I’m tempted to give this a pass for a few reasons.

This obviously covers the original use case that tripped up Jonah.

It also is basically exactly what I asked the model to do — that is, give me a pixel perfect representation — so it’s kind of my fault that I was not clearer.

I’m not sure the model actually can get to pixel perfect any other way. The screengrab has artifacts. After all, I basically just used the default linux screenshot selection tool to get the original output, without even paying much attention to the width of the image.

If you ask the model to loosen the requirements for the exact screengrab, it does the right thing, but the pixel alignment is slightly off. The model included this as index_tiled.html in the repo, and you can see the pixel diff in one of the output images..."