Giter VIP home page Giter VIP logo

arc's People

Contributors

ageron avatar alejandrodemiquel avatar beyretb avatar davenh avatar dmauskop avatar fchollet avatar gromgull avatar guyaglionby avatar jmmcd avatar kaalam avatar kokecacao avatar m3at avatar ngdelamo avatar petkovacs19 avatar skewwhiff avatar slyster avatar wizeman avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

arc's Issues

79fb03f4 test is unsolvable, water flow

Problem

ARC-Interactive

My first prediction

My reasoning.

  • It seems water can flow backwards into holes. maybe water can flow forward into a hole.

Screenshot 2024-02-25 at 02 04 43 copy

My second prediction

My reasoning.

  • Guide the water above the hole obstacle, without sending any water below the obstacle.

Screenshot 2024-02-25 at 02 09 55 copy

At this point I looked at what was the expected result.

Expected

expected

I'm quite surprised to see that the water flows into the hole. But not around the entire structure. It seems like a mistake.

There is no example among the train pairs, that shows this behavior.

Solution

Tweak one of the train pairs, so it demonstrates what to do in similar scenarios.

0d87d2a6 is ambiguous

Untitled

In all three examples, the blue lines cut through the red boxes. But in the "correct" output, a box is colored blue, despite the fact that the blue line was merely adjacent.

2dee498d train 0 input breaks the pattern

it seems to me that 2dee498d train 0 input is inconsistent with the other inputs of this task.

train 1, train 2, and test 0 consist of a simple 1x3 tiling of the output. The central block of 1x3 tiling of train 0 is reversed, thus breaking the pattern across the inputs.

issue

Apparently, 6 tasks seem to have the same implicit goal

Perhaps I'm simply misunderstanding the paper, but on page 46 it says the following regarding this repository:

  • Focus on measuring developer-aware generalization, rather than task-specific skill, by
    only featuring novel tasks in the evaluation set

It also says:

All tasks are unique, and the set of test tasks and the set of training tasks are disjoint.

I'm not entirely sure what is meant by "disjoint", "novel" or "unique" here (e.g. unique in the sense of the inputs/output pairs being different or unique in the sense that the implicit goal is different?), but I'm guessing the latter.

If so, then I think I found 6 tasks that have the same implicit goal.
They are the following:

data/training/ff805c23.json
data/training/dc0a314f.json
data/training/9ecd008a.json
data/evaluation/67b4a34d.json
data/evaluation/f4081712.json
data/evaluation/e66aafb8.json

The implicit goal can be described as something like "fill the largest homogeneously-colored rectangle with the pattern that is symmetrically opposite to its location", i.e. basically what is shown on figure 4 of the paper.

I found two of those tasks manually by chance, and to confirm that there was repetition I created a program that solves this particular algorithm and applied it to all the tasks in the public training and evaluation sets.

The program I wrote only tries to solve the task if:

  1. All the demonstration inputs have only one homogenously-colored rectangle with the largest area (i.e. it skips the task if multiple homogeneous rectangles with the same largest area are found in one of the demonstration inputs), and
  2. The demonstration outputs match (in grid size and grid contents) the output that is expected by this implicit goal.

This relatively simple skipping criteria (which seems to be required anyway, otherwise it would surely fail to solve the test input unless it matched by pure random chance) seems to successfully skip all the publicly-available tasks that are not meant to be solved with this algorithm, as the program never fails on the test inputs that it tries to solve.

If you want to, you can find the actual source code and instructions for compiling and running it here: https://github.com/wizeman/ARC/tree/solve/ocaml

Important note: the data/evaluation/e66aafb8.json file seems to be the same task that is shown on figure 4 of the paper, so if you end up deciding to delete any of these files for not being unique, I guess you wouldn't want to delete this one!

B7CB93AC.json is not unambiguous

Not exactly an issue, but most of the task in the dataset seems to have single unambiguous solutions while this and couple of others doesn't.

image
According to the rule we have to combine shapes in a way such that we get complete rectangle. Rotation is allowed. With that in mind solution like this should also be correct.
image

423a55dc test is unsolvable, skew

Problem

ARC-Interactive

The reasoning seems to be like this:

  • The bottom line of the orange shape stays where it is.
  • One line above the bottom, is moved one cell towards the left.
  • Two lines above the bottom, is moved 2 cells towards the left.
  • 3 lines above the bottom, is moved 3 cells towards the left.
  • There is no wrap around, so when the orange shape overflows the left edge, the shape doesn't reappear on the right side.

Following the reasoning, the prediction is:

predicted

However the expected output is supposed to be the following. I have highlighted the problem area.
expected copy

Solution

Replace the test output, with my prediction. So this task is solvable. Otherwise I doubt this task will be solvable.

Version ARC

@fchollet as people are now coding against this, and as "we intend to keep adding new tasks to ARC in the future", can we please version/tag this repo, ideally with release notes listing ids of new or modified tasks.

58E15B12.json has mistake in test output

image
It seems that by the rule when green and aqua lines are intersect they should produce magenta color. But in the provided test output single magenta cell is missing:
image

8FBCA751.json seems to have two correct solutions

image
The following image is an intended solution for the task. It assumes that bottom and top blocks fully connected between themselves. But if we don't assume that, following solution also seems feasible.
image

Scale and motion towards the observer

A number of the examples imply motion in 2D plane. I wonder if it would be useful to include simple examples that imply motion in 3D. That is, imagine a input grid with an object and where the output contains the same object scaled up (each pixel of input object become n*n in output, or vice versa) to simulate motion towards or away from the observer.

Untitled presentation (11)

This could also be combined with 2D motion

b230c067 make less ambiguous, so it doesn't require 2 attempts

Problem

ARC-Interactive

I used 2 attempts at solving this task. By making it more explicit, I think it can be reduced to 1 attempt.
Screenshot 2024-02-03 at 11 57 48

Attempt 1 - incorrect

At first glance, it seems the top-right object should be colored red, like this. But it's incorrect.
Screenshot 2024-02-03 at 12 20 23

Attempt 2 - correct

My second take. Objects with the identical shape gets colored blue, and odd-shapes colored red. This is correct.
Screenshot 2024-02-03 at 12 19 58

Solution

To make it less-ambiguous, I suggest rotating by 180 degrees on the first training pair, so it looks like this. This rules out the failed attempt 1.
b230c067-proposed-solution2

4852f2fa train output - move one pixel

ARC-Interactive

This issue is also reported here: volotat/ARC-Game#4

The output repeats the shape. But the repeat is inconsistent in the first training pair.

The light blue pixel in the bottom of the light blue shape, is misplaced by 1 pixel.
4852f2fa

This task has many training pairs to learn from, so no worries. The real world is rarely perfect.

310f3251 test is unsolvable, wrap around

Problem

ARC-Interactive

I was unable to solve this task.

My reasoning.

  • Maintain same repeating tile.
  • For every non-black pixel, insert a red pixel at relative position -1, -1.

My predicted solution

predicted

The expected solution

expected

Difference

annotated

My mistake was that I assumed maintain same repeating tile, thus assuming that the red pixels would wrap around the edge.

Solution

Add a train pair, that demonstrates that the tiles are non-repeating.

Adding customised tasks ?

In the app, would it be possible to let us create our own tasks ?
I guess such a tool was used to create the tasks, could it be possible to share it ?

Choose File not responding

The ARC testing interface does not load a file if it was loaded before selecting a random task.

To replicate, perform the following twice:

  • Click "Random task"
  • Click "Choose File" and select a file (must be same file both times)

The last step should load the selected file, but instead does nothing.

Abstraction and Reasoning Corpus --> bigbench?

Hi There!

This is a really cool corpus, @fchollet :-)

I'm wondering if a version of this task could be adapted to this ICLR workshop, centered on the construction of a big set of sequence-to-sequences benchmarks?:

https://iclr.cc/Conferences/2021/Schedule?showEvent=2147
https://github.com/google/BIG-bench/

I think I could adapt it in a few hours, and it could be a good candidate for addition, but I wanted to check to see if that would be okay with you.

Jack

Recursion Error After Certain Sequence of Operations

While porting the ARC testing framework to an OpenAI Gym environment, my random operation testing revealed an issue with the Fill Operation. Given a certain combination of operations, the environment runs into a recursion error.

This has been validated in both my Python implementation and the JS web interface of ARC testing (which was ported straight from the JS environment).

The commands on each line correspond to either fill or edit, followed by the row and col (indexed by 0) and the color symbol selections from the bottom pallete (also indexed by 0).

EDIT 0 2 1
EDIT 0 2 2
FILL 0 1 5
EDIT 2 1 5
EDIT 0 1 6
FILL 2 1 3
FILL 2 1 4
FILL 2 1 2
EDIT 0 2 3
FILL 0 0 6
EDIT 0 0 3
FILL 0 1 5
EDIT 2 0 3
> maximum recursion depth exceeded in comparison

20818e16 and b0f4d537 has mistype

my model told me that:
evaluation/20818e16.json ['train'][2]['output'] - size(9,8) => (8,8)
evaluation/b0f4d537.json ['train'][0]['output'] - middle line need move 1px righter
seems its true =)

Vary background color

@fchollet An intelligent agent should be able to recognize focal objects versus background, even if background color might vary among training examples.

There are various examples of deep-learned systems that recognize B&W digits well but fail spectacularly when colors are simply inverted. I would like to see an intelligent ARC-solver that is robust to this.

All the ARC examples I have seen so far have black backgrounds. Thus, I propose that ARC is augmented with examples with different background colors, sometimes yellow, sometimes white etc.

ARC in real world use

Hi,

Is there a plan to make a software which would convert real world scenes/tasks to/from ARC tasks?

I guess this is not trivial to do but it would be worth as AI researchers would be much more interested in ARC if they saw that ARC helps different general AI algos to be tested in real world usage via the converter + ARC.

br
Lajos

9def23fe train[0] output

There are two extra red pixel in the first train output. As I understand in this task the red square emits horizontal and vertical rays unless there is a non-red and non-black pixel that prevents it.
Screenshot from 2020-05-26 06-33-38

Train Task a9f96cdd.json - Problem with One Point

The fourth training output includes the red point from the input. From the other 3 training/output pairs and the test input/output pair, it seems the task is to add the 4 colored points centered around the red point and then remove the red point (make it black).

image

Maintaining ARC

Since this repo seems unmaintained at the moment, with quite a few pending PRs and open issues relating to bugs in the tasks -- is anyone aware of another repo which applies all of these fixes?

dc433765.json train 5 == test 0

It seems that dc433765's train 5 == test 0:

import json
filename = "../ARC/data/training/dc433765.json"
with open(filename, "r+") as jsonFile:
    data = json.load(jsonFile)
print(data['train'][5]['input'] == data['test'][0]['input'])
print(data['train'][5]['output'] == data['test'][0]['output'])

produces

True
True

dc433765

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.