CoEvolution

This repository aims to implement multiple co-evolution algorithms designed for simulated reinforcement learning.

All work is one as part of an Internship at l'Institut des Systèmes Intelligents et de Robotique (ISIR), Sorbonne Université, Paris. The main objective of the related internship is to develop a new algorithm based on Quality-Diversity, co-evolving agents and environnements in order to find policies that generalize better.

Key-words : Reinforcement Learning, Co-Evolution, Evolution algorithms, Quality-Diversity

Implemented algorithms

Currently, the repository includes the full implementations of:

NSGA-II (Deb, K. et al 2002)
POET Enhanced (Wang, R. et al. 2020))
Evolution Strategies (Salimans et al., 2017)
A new NSGA-II inspired co-evolution algorithm

As well as a structure for co-evolution built up in a test/learner fashion:

IPCA structure (De Jong, E. D. 2004)

However, no algorithm was currently developped according to this structure.

More details can be found in the "Algorithms" section of the wiki.

Dependencies

Python 3.6 is required, due to the frequent use of f-strings. The package f2format (https://github.com/pybpc/f2format) may come in handy.

Main package dependencies are as follow :

Numpy / Matplotlib / Scipy
Ipyparallel (https://github.com/ipython/ipyparallel)
Gym (https://github.com/openai/gym)

Sci-kit Learn and Keras also appear in the code, although they are not used by default.

Some environments in the repository also need they own packages, including but not limited to :

Neat-python (https://github.com/CodeReclaimers/neat-python)
PyFastSim (https://github.com/alexendy/pyfastsim)

Details can be found the "Environments" section of the wiki.

Quick-start

The main configuration file, Parameters.py uses differed imports to synchronise parts of the code and to make modulation easier. The main parts that may need to be changed often are environments, agents and optimizers. Theses three can be anything inherited from abstract classes defined in ./ABC, defaults are :

gym BipedalWalkerV2 with CPPN-drawn landscapes
Numpy fully-connected NN with tanh activation
Adam optimizer

In order to run any of the main algorithms, one needs to start an ipyparallel cluster beforehand, which needs to be accessible in the same folder as the file that needs to run. As an exemple, running POET Enhanced with default arguments and a local cluster of 32 process :

ipcluster start -n 32

python POET_Main.py

Arguments can be written directly in the shell (--arg %d), arguments informations can be displayed with --h or --help.

It is possible to resume any execution with the argument --resume_from *folder*, loading the last indexed Iteration_%d file, archive if needed and the file Hyperparameters.json containing previous execution arguments.

Name		Name	Last commit message	Last commit date
Latest commit History 212 Commits
ABC		ABC
Algorithms		Algorithms
AnalysisTools		AnalysisTools
Objects		Objects
Results		Results
Utils		Utils
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Main_ES.py		Main_ES.py
Main_ipca.py		Main_ipca.py
Main_nnsga.py		Main_nnsga.py
Main_nsga2.py		Main_nsga2.py
Main_poet.py		Main_poet.py
Parameters.py		Parameters.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CoEvolution

Implemented algorithms

Dependencies

Quick-start

About

Uh oh!

Languages

License

jeremyaqp/CoEvolution

Folders and files

Latest commit

History

Repository files navigation

CoEvolution

Implemented algorithms

Dependencies

Quick-start

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages