More

juanre · 2026-01-15T13:38:59 1768484339

I built https://github.com/juanre/llmemory and I use it both locally and as part of company apps. Quite happy with the performance.

It uses PostgreSQL with pgvector, hybrid BM25, multi-query expansion, and reranking.

(It's the first time I share it publicly, so I am sure there'll be quirks.)

juanre · 2025-12-02T11:43:02 1764675782

If you want to learn how to unicycle, and you speak Spanish, you may be interested in a booklet I wrote about how to learn to unicycle when my kids were learning: https://juanreyero.com/monociclo/

I have used the technique to teach other people, and it works surprisingly well.

juanre · 2025-11-02T08:26:11 1762071971

Skills are also a convenient way for writing self-documenting packages. They solve the problem of teaching the LLM how to use a library.

I have started experimenting with a skills/ directory in my open source software, and then made a plugin marketplace that just pulls them in. It works well, but I don't know how scalable it will be.

https://github.com/juanre/ai-tools

juanre · 2025-10-22T20:38:38 1761165518

I think they did, actually. (I worked at HP for 19 years, and I know the guy in their Barcelona site who started it.)

juanre · 2025-09-30T13:22:31 1759238551

LLMRing

https://llmring.ai

is a unified interface across OpenAI, Anthropic, Google, and Ollama - same code works with all providers.

Use aliases instead of hardcoding model IDs. Your code references "summarizer", and a version-controlled lockfile maps it to the actual model. Switch providers by changing the lockfile, not your code.

Also handles streaming, tool calling, and structured output consistently across providers. Plus a human-curated registry (https://llmring.github.io/registry/) that I (try to) keep updated with current model capabilities and pricing - helpful when choosing models.

An MCP server and client are included, as well as the ability to help you define your aliases/models with an interactive chat.

It's a thin wrapper on top of the downstream providers, so you are talking directly to them. And it also comes with a postgres-backed open source server to store logs, costs, etc.

MIT licensed. I am using it in several projects, but it's probably not ready to be presented in polite society yet.

juanre · 2025-09-29T18:13:39 1759169619

I built LLMRing (https://llmring.ai) for exactly this. Unified interface across OpenAI, Anthropic, Google, and Ollama - same code works with all providers.

The key feature: use aliases instead of hardcoding model IDs. Your code references "summarizer", and a version-controlled lockfile maps it to the actual model. Switch providers by changing the lockfile, not your code.

Also handles streaming, tool calling, and structured output consistently across providers. Plus a human-curated registry (https://llmring.github.io/registry/) that I keep updated with current model capabilities and pricing - helpful when choosing models.

MIT licensed, works standalone. I am using it in several projects, but it's probably not ready to be presented in polite society yet.

juanre · 2025-05-25T08:38:26 1748162306

This is eerily close to some of the scenarios in Max Tegmark's excellent Life 3.0 [0]. Very much recommended reading. Thank you Simon.

0. https://en.wikipedia.org/wiki/Life_3.0

hakonbogen · 2025-05-25T08:45:23 1748162723

Yeah thought the same thing. I wonder if he has commented on it?

juanre · on Sept 14, 2023

Neal Stephenson's great trilogy The Baroque Cycle does a great job at depicting this facet of Newton.

juanre · on Sept 1, 2023

Location: Cambridge, UK

Remote: yes

Willing to relocate: yes

Technologies: Python, Common Lisp, Javascript, technical leadership, systems architecture, several other programming languages, mechanical engineering.

Résumé: https://juanreyero.com/res/res.html

Email: juan@juanreyero.com

Github: https://github.com/juanre and https://github.com/awordai/aword (yes, I have also been doing a RAG implementation along with almost everyone else).

Currently Chief Engineer at a startup in Berlin making a novel 3D printer, also owner and creator of https://greaterskies.com

30 years of experience creating products and bringing them to market.

juanre · on April 14, 2023

I pay for ChatGPT-4. It's helping me program noticeably better and much faster, even with mid-sized projects involving many files. This is the process that I follow:

- I write a system prompt with a succinct description of what I want to implement: "X is an online app that does Y, with these features: ..." I try to be exhaustive, and I write as I would if I were describing what I want to do to a very proficient programmer who needs to understand the problem and the solution. I save this to a prompts/0-system.txt file that will be part of the project.

- I design an architecture and define the general boundary conditions. I may ask ChatGPT for opinions, but at this stage, it's mostly based on experience. I add it to the system prompt.

- I write a description of the first chunk of functionality that I want to implement, usually a set of functions or a class, at the individual file level. For example, prompts/1-dal.txt.

- I (actually ChatGPT) wrote a very simple recursive cat script (https://github.com/juanre/catprompt) that combines several files into a single prompt (a file can include other files, etc). I add a line to prompt/1-dal.txt pointing to the system prompt, and use catprompt to create a combined prompt that will include the overall description and the specifics of the functionality I am after.

- I give this prompt to ChatGPT, and it produces the code. It's typically perfectly fine. I add it to the project, for example, as dal.py, review, and fix whatever may need fixing.

- I ask ChatGPT to create tests in the same session. This also tends to work fine.

- Then I move to the next piece of functionality, in a new session. I write another description, in another file, including the system prompt and usually the code already created.

The prompts remain: they become a description of the code (and of my intentions) for other programmers, and they can help ChatGPT write documentation as well.

I enjoy programming like this. The effort I make in writing concise descriptions helps me think more clearly. I used to love literate programming back in the 90s, and this feels very much like it. Knuth described it thus:

"Let us change our traditional attitude to the construction of programs: Instead of imagining that our main task is to instruct a computer what to do, let us concentrate rather on explaining to human beings what we want a computer to do."

Replace "human beings" with LLMs, and I think that's where we are now.

ParetoOptimal · on April 15, 2023

I've essentially been using ChatGPT-4 in the same way and can confirm that the code it gives typically works. If not I just decompose pieces of it once and try again. If it fails there I just write it myself.

xk_id · on April 15, 2023

And would you say this process saves you time? Instead of traditional programming? Or do you find other upsides to it?

juanre · on April 15, 2023

It certainly saves me time, and I end up with better code and better documentation. I can stay thinking at a higher level, which removes cognitive friction. I usually don't want to be programming: I want the functions I need to be written and working. ChatGPT-4 takes care of that quite well.