GitHub - michaelzhiluo/mesa-safe-rl

Description

Implementation of MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance. The repo builds on top of Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones. The main file is main.py in the root directory. MESA's meta-learning takes place in run_multitask.py. The core SAC implementation can be found in sac.py and is built on the implementation from https://github.com/pranz24/pytorch-soft-actor-critic. This file also implements constraint critic training for Recovery RL.

Reproducing Experiments

To reproduce experiments (1) download data from \todo{insert} link and place it in a folder called data/ in the root directory. The commands to run the experiments are in

Name		Name	Last commit message	Last commit date
Latest commit History 497 Commits
config		config
env		env
learning_to_adapt		learning_to_adapt
.gitignore		.gitignore
DotmapUtils.py		DotmapUtils.py
MPC.py		MPC.py
README.md		README.md
VisualRecovery.py		VisualRecovery.py
__init__.py		__init__.py
analyze_runs.py		analyze_runs.py
analyze_runs_michael.py		analyze_runs_michael.py
constraint.py		constraint.py
format.sh		format.sh
gen_cartpole_demos.py		gen_cartpole_demos.py
gen_dynamic_shelf_demos.py		gen_dynamic_shelf_demos.py
gen_dynamic_shelf_long_demos.py		gen_dynamic_shelf_long_demos.py
gen_pointbot0_demos.py		gen_pointbot0_demos.py
gen_pointbot1_demos.py		gen_pointbot1_demos.py
gen_pointbot_demos.py		gen_pointbot_demos.py
gen_pointbot_demos1.py		gen_pointbot_demos1.py
gen_reach_shelf_demos.py		gen_reach_shelf_demos.py
main.py		main.py
make_legend.py		make_legend.py
mesa_exps.txt		mesa_exps.txt
model.py		model.py
obstacle.py		obstacle.py
optimizers.py		optimizers.py
plotting_utils.py		plotting_utils.py
replay_memory.py		replay_memory.py
run.sh		run.sh
run_multitask.py		run_multitask.py
sac.py		sac.py
utils.py		utils.py
video_recorder.py		video_recorder.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Reproducing Experiments

About

Releases

Packages

Contributors 9

Languages

michaelzhiluo/mesa-safe-rl

Folders and files

Latest commit

History

Repository files navigation

Description

Reproducing Experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Languages

Packages