GitHub - jgroenheide/rl2select: On learning for node selection in the branch-and-bound algorithm. Contains the experiments code for my master thesis on learning to select using Reinforcement learning.

This code is meant to serve as a starting point for future research on learning to select with imitation learning and/or reinforcement learning. The code has been distributed over 5 steps.

Generate instances | (data/{problem}/instances/{instance_type}_{difficulty}/instance_*.lp)
Generate solutions | (data/{problem}/instances/{instance_type}_{difficulty}/instance_*-*.sol)
(a) Remove infeasible instances and instances with less than 100 nodes explored.
Generate samples | (data/{problem}/samples/{instance_type}_{difficulty}/sample_*.pkl)
(a) For each [instance, solutions] set, generate [state, action] pairs from the oracle in nodesel_oracle.py.
(b) The state includes both nodes of the comparison, with a representation defined in extract.py.
(c) The action is sampled using the Sampler class, denoted with [0, 1] for left and right respectively.
Train model RL/IL | (experiments/{problem}/04_train_il/{seed}_{timestamp}/best_params_il_{mode}.pkl) | (actor/{problem}/{model_id}.pkl)
(a) MLP policy: [branching_features, node_features, global_features]
(b) GNN policy: [not fully implemented]
Evaluate policies | (experiments/{problem}/05_evaluate/{seed}_{timestamp}/{experiment_id}_results.csv)
(a) Evaluates on all available [test] and [transfer] instances, with results averaged over 5 runs.
(b) Evaluation results are returned using geometric mean and geometric standard deviation.
(c) Results can be recalculated based on the csv files using postprocessing.py.

Implemented reward signals: Global tree size, Primal bound improvement, Optimality-Bound difference.

Global tree size: -1 for each step
Primal bound improvement: (New GUB - Old GUB) / gap
Lower bound - objective inequality: -1 if LB > Opt else 0

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.idea		.idea
nodesels		nodesels
01_generate_instances.py		01_generate_instances.py
02_get_instance_solutions.py		02_get_instance_solutions.py
03_generate_il_samples.py		03_generate_il_samples.py
04_train_il.py		04_train_il.py
04_train_rl.py		04_train_rl.py
05_evaluate.py		05_evaluate.py
README.md		README.md
agent.py		agent.py
brain.py		brain.py
config.json		config.json
data.py		data.py
extract.py		extract.py
model.py		model.py
observation.py		observation.py
postprocessing.py		postprocessing.py
requirements.txt		requirements.txt
rl2select.iml		rl2select.iml
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

jgroenheide/rl2select

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages