More

helloplanets · 2025-09-16T08:54:58 1758012898

The iMac mouse looks amazing though! Ergonomics, not too sure about those.

helloplanets · 2025-09-11T08:56:18 1757580978

Not all LLM based applications are a user facing free form chat.

If you take an LLM that makes 10 tool calls in a row for an evaluation, any reduction in unpredictable drift is welcome. Same applies to running your prompt through DSPy Optimizer. [0] Countless other examples. Basically any situation where you are in control of the prompt, the token level input to the LLM, so there's no fuzziness.

In this case, if you would've eliminated token level fuzziness and can yourself guarantee that you're not introducing it from your own end, you can basically map out a much more reliable tree or graph structure of your system's behavior.

[0]: https://dspy.ai/#2-optimizers-tune-the-prompts-and-weights-o...

skeezyboy · 2025-09-11T15:56:12 1757606172

> If you take an LLM that makes 10 tool calls in a row for an evaluation, any reduction in unpredictable drift is welcome

why use an ambiguous natural language for a specific technical task? i get that its a cool trick but surely they can come up with another input method by now?

helloplanets · 2025-08-30T06:55:34 1756536934

Could you provide a link to that system prompt? Becuase I'm confused. I typed in "Are you smart?" and got this back:

> That’s a thoughtful question! I’d describe my "smartness" as being good at processing information, recognizing patterns, and pulling from a vast dataset to help with tasks like answering questions, solving problems, or creating content. However, I’m not "smart" in the human sense—I don’t have consciousness, emotions, or independent critical thinking. I rely entirely on my training data and algorithms.

> Think of me as a tool that can assist with creativity, analysis, or learning, but I lack the depth of human intuition, lived experience, or true understanding. If you’re curious, test me with a question or challenge — I’ll do my best! (smiley emoji)

tarruda · 2025-08-30T10:12:25 1756548745

> Could you provide a link to that system prompt?

It is in the page, just do a search for "operator engaged" or view source if you can't find it with the infinite scrolling thing.

helloplanets · 2025-08-30T14:56:35 1756565795

Ah, the site's bugged on Safari and wouldn't scroll. Worked on Chrome. Tried to look for it on the actual chat page, and wasn't in the source there.

Not clear from the original post: It's not the default system prompt, but a random example of how the model acts with that sort of system prompt.

helloplanets · 2025-08-12T22:02:39 1755036159

Don't forget the millions of years of pre-training! ;)

helloplanets · 2025-08-04T20:07:45 1754338065

You could argue that the suggested system is actually much simpler than the one we currently have for the sites that are "free", aka funded with ads.

The lengths Meta and the like go to in order to maximize clickthroughs...

helloplanets · 2025-07-31T09:38:15 1753954695

Most of those look like context issues to me. Repo map (using Tree-sitter, etc) and documentation would already do wonders. Feeding 32-64kTok of context directly into a model like Gemini Pro 2.5 is something that more people should try out in situations like this. Or even 128kTok+.

helloplanets · 2025-07-21T18:14:34 1753121674

Are you using a tool other than ChatGPT? If so, check the full prompt that's being sent. It can sometimes kneecap the model.

Tools having slightly unsuitable built in prompts/context sometimes lead to the models saying weird stuff out of the blue, instead of it actually being a 'baked in' behavior of the model itself. Seen this happen for both Gemini 2.5 Pro and o3.

helloplanets · 2025-07-19T20:44:18 1752957858

Source?

helloplanets · 2025-07-18T21:11:48 1752873108

It's not a model of text, though. It's a model of multiple types of data. Pretty much all modern models are multimodal.

helloplanets · 2025-07-16T13:57:24 1752674244

A lot of people make small games just for fun, like any other creative hobby. Similar to: Making music, writing, drawing, 3d modeling, etc.

Actually, you can put all of those together and use them in a game. And the best part is that there's no target market, no KPIs to hit, so you can do anything!