Hacker News new | past | comments | ask | show | jobs | submit | neoneye2's comments login

Sidenote: `Tufa Labs` team includes the `MindsAI` team of ARC-AGI fame. https://tufalabs.ai/team.html


nice!


The zip files include: WBS, SWOT, Assumptions, Criticism from experts.

It can be tried out here: https://huggingface.co/spaces/neoneye/PlanExe

If you have python skills: https://github.com/neoneye/PlanExe/tree/main


Here is a video how to use PlanExe

https://www.youtube.com/watch?v=7AM2F1C4CGI


Examples of plans outputted by PlanExe https://neoneye.github.io/PlanExe-web/

These are zip files, containing mostly json files, some markdown and a csv file. The tasks gets assigned a UUID. Is this a format that makes sense?

I'm using LlamaIndex so it's not tied to a particular LLM provider. I prefer using OpenRouter with the new Gemini 2.0 Flash. And I also like Ollama with Llama3.1 where I can inspect the logs as the code is running.

I'm using Luigi for a DAG representation of the data being exchanged between the many agents.


The 3rd place solution by Agnis Liukis, solves 40 tasks. https://www.kaggle.com/code/gregkamradt/arc-prize-2024-solut...


Can we get this person to explain thier inspiration?


As a hobby project, I did investigate the distribution of prime numbers.

The primes are somewhat evenly spaced with this transformation, I'm the author of it.

A342730: a(n) = floor((frac(e * n) + 1) * prime(n+1)).

https://oeis.org/A342730/a342730.png

Instead of e, I have tried other constants such as pi, but it doesn't look as good. I guess there is another constant that makes the distribution look even nicer.


I don't think this is anything special about primes or e - if you replace prime(n+1) with just (n+1) itself you get the same sort of patterns. But it is something to do with approximations of irrationals by rationals - you might want to look into continued fractions. Try replacing e with a rational number a/b (say 8/3 or 11/4); then you get b horizontal-ish lines, corresponding to the different remainders of n when divided by b. So the pattern you get with pi isn't "as good" because pi is famously close to 22/7.


In that case, you might get the best results with Liouville numbers.


There are 100 tasks that is hidden from the public, that is only exposed, when running on an offline computer. So the solver has no prior knowledge about what these tasks are about.

Humans can try the 800 tasks here. There is no time limit. I recommend not starting with the `expert` tasks, but instead go with the `entry` level puzzles. https://neoneye.github.io/arc/?dataset=ARC

If a model jumps to 100%, that may be a clever program or maybe the program has been trained on the 100 hidden tasks. Fchollet has 100 more hidden tasks, for verifying this.


The task is here. https://neoneye.github.io/arc/edit.html?dataset=ARC&task=11e...

There are many examples where the test is slightly OOD (out of distribution), so the solver will have to generalize.


Not sure what you mean. There's a viable answer that's marked incorrect. The examples should show the pattern well enough to eliminate possible wrong answers, correct?


IRL problems are often underspecified, and despite minor mistakes, humans manages to solve the puzzle.


Nice overview/details. Do you plan on adding more metrics?

Idea for a metric: - Number of pixels that stays the same between input/output. - Histogram changes.


Thanks, yeah lots more to look into. Just getting started! Thanks for your work. Your "Awesome ARC" page looks really helpful.


Melanie is coauthor/supervisor of ConceptARC, that can be tried here: https://neoneye.github.io/arc/?dataset=ConceptARC


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: