Skip to content

Near32/SymbolicBehaviourBenchmark

Repository files navigation

SymbolicBehaviourBenchmark

SymbolicBehaviourBenchmark (S2B) is a suite of OpenAI Gym-compatible multi-agent reinforcement learning environments built around referential games, designed to operationalise and empirically evaluate the symbolic behaviour traits introduced by Santoro et al. (2021) — namely receptivity, constructivity, malleability, and separability. The benchmark itself is proposed and described in:

Denamganai, K., Missaoui, S., & Walker, J. A. (2022). Meta-Referential Games to Learn Compositional Learning Behaviours. arXiv preprint arXiv:2207.08012. https://arxiv.org/abs/2207.08012

default_env


Overview

Human beings use compositionality to generalise from past experiences to novel ones — decomposing experiences into fundamental atomic components that can be recombined in novel ways. We refer to behaviours making use of this ability as Compositional Learning Behaviours (CLBs). Building artificial agents capable of exhibiting CLBs is a central challenge towards agents that can collaborate with human beings.

S2B is designed to investigate agents' abilities to exhibit CLBs in a domain-agnostic setting. Taking inspiration from the language emergence and grounding framework of referential games, it proposes a meta-learning extension — Meta-Referential Games — as the vehicle for this investigation. The benchmark exposes fine-grained control over vocabulary size, sentence length, latent structure, distractor count, and sampling strategy, enabling systematic evaluation of the symbolic behaviour traits characterised by Santoro et al. (2021): receptivity, constructivity, malleability, and separability.


Branches

Branch Description
main Stable release
S2B-LM Adaptation of S2B for language models — removes numerical processing as a confounding factor and incorporates chain-of-thought scaffolding to actively elicit Compositional Learning Behaviours (CLBs), as described in Denamganai (2025), "On Compositional Learning Behaviours in Formal Mathematics"

To use the S2B-LM variant:

git clone -b S2B-LM https://www.github.com/Near32/SymbolicBehaviourBenchmark
pip install -e ./SymbolicBehaviourBenchmark/

Installation

Via repository clone

git clone https://www.github.com/Near32/SymbolicBehaviourBenchmark
pip install -e ./SymbolicBehaviourBenchmark/

Usage

gym must be installed. The following example instantiates an environment configured to evaluate receptivity and constructivity:

import gym
import symbolic_behaviour_benchmark

env = gym.make(
    "SymbolicBehaviourBenchmark-ReceptiveConstructiveTestEnv-v0",
    nbr_communication_rounds=1,
    vocab_size=6,
    max_sentence_length=3,
    descriptive=True,
    nbr_latents=3,
    min_nbr_values_per_latent=2,
    max_nbr_values_per_latent=5,
    nbr_object_centric_samples=4,
    nbr_distractors=0,
    use_communication_channel_permutations=True,
    allow_listener_query=False,
    provide_listener_feedback=True,
    sampling_strategy="component-focused-4shots",
    domain="SCS",
)

Citation

If you use this benchmark in your research, please cite:

@article{denamganai2022meta,
  title={Meta-Referential Games to Learn Compositional Learning Behaviours},
  author={Denamgana{\"\i}, Kevin and Missaoui, Sondess and Walker, James Alfred},
  journal={arXiv preprint arXiv:2207.08012},
  year={2022}
}

About

Suite of OpenAI Gym-compatible multi-agent reinforcement learning environment to benchmark for behavioral traits pertaining to symbolic behaviours (primarily: being receptive, constructive, malleable, and separable) using referential games.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors