What would you classify the following objects under?

Likely some form of chair or place to sit. Collection is obvious.

What about these following objects:

You would think of them as fuel sources, also quite obvious. But notice how the wood stump falls under both collections.

This is what can make classification for Evolutionary Robotics. In neuroscience/philosophy we can think of this hierarchy under the Rhizome hierarchy. Also similar to Obsidians tag clustering methods. Also reminiscent of Cortical Columns in the Neocortex.

Minimal Cognition

For a CTRNN experimental agent working towards passability experiments. Agent moves horizontally while a wall with an adjustable aperture falls from above. The rays of the agent’s proximity sensors are shown.

Our fitness function will look like:

where:

for an opening too narrow for the agent to pass through and:

for an aperture wide enough for the agent for pass through, and is the final horizontal separation between the center of the agent and the center of the aperture at the end of the trial. This fitness measure assigns near zero fitness to incorrect options.

After Training

After performing an evolutionary algorithm, deleting poor performers and making random modifications.

Strategy

Below graphs how the apertures would move in relation to the agent. Note that while the apertures fall at a consistent rate, the horizontal movement is due to the agent moving from side to side. Left graph is for apertures 1 unit smaller than the agent, right side is 1 unit larger apertures. Time starts at top and increases downward.


New Experiment

Agent has now been modified for the task of self/nonself discimination. Agent is now stationary, but can swing an “arm”, which also interacts with its proximity sensors. Goal was to see if it could understand the variable it controlled and how it affected its sensors.

Fitness Function

where:

and is the angular error at the end of the trial.

After Training

As one would expect, model seems to align arm with object. Final angular position of the hand is plotted against the final angle of the object.

Also plotted is arm trajectories over time of the best self/nonself discrimination agent catching objects at the midline from initial hand positions at either the left or right edge of the visual field. The trajectories are shaded according to the initial angular position of the object as indicated at the top of the plot.


Selective Attention

We change the goal yet again. The agent now has to select which target to focus on, as there are now two ones coming at once. It still has to catch both, but it has to select which one to catch first.

Experimental setup for selective attention experiments. The agent can move horizontally while 2 objects fall from above. the initial positions and velocities of the two objects are constrained so that the agent as a reasonable chance of catching both.

Fitness Function

where:

  • is the horizontal distance between objects ‘s final positions and agents final position.

PO: passing objects’ problem OP: object permanence problem

Arising Problems

Behavior of the best selective attention agent. Example motion plots are shown for different combinations of passing objects (PO) and object permanence (OP).


A short Segment on Gradient Descent

Beyond actual evolutionary algorithms to train our robots, we also have the option of Gradient Descent to tune our parameters.

We seek to calculate a behavioral loss, a function which calculates the amount of error in prediction vs actual. We then use Chain rule to calculate the derivative at that point in the loss function, such that Gradient Descent can take a step in the direction of loss minimization. This is the forward pass.

Backward pass is when we actually tune the neurons with our new information.

In the robotics perspective, remember that weights are not purely for modifying the neuronal weights of the robot, but also to alter parts of the body itself. Perhaps we can tune the mass of the front leg, as this could also be to blame for a nonzero loss.