:orphan:
.. _examples:
===================
Gallery of examples
===================
.. raw:: html
.. raw:: html
.. only:: html
.. image:: /auto_examples/images/thumb/sphx_glr_plot_kernels_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_plot_kernels.py`
.. raw:: html
Plot kernel functions
.. raw:: html
.. only:: html
.. image:: /auto_examples/images/thumb/sphx_glr_adastop_example_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_adastop_example.py`
.. raw:: html
Compare PPO and A2C on Acrobot with AdaStop
.. raw:: html
.. only:: html
.. image:: /auto_examples/images/thumb/sphx_glr_plot_writer_wrapper_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_plot_writer_wrapper.py`
.. raw:: html
Record reward during training and then plot it
.. raw:: html
.. only:: html
.. image:: /auto_examples/images/thumb/sphx_glr_comparison_agents_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_comparison_agents.py`
.. raw:: html
Compare Bandit Algorithms
.. raw:: html
.. only:: html
.. image:: /auto_examples/images/thumb/sphx_glr_example_venv_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_example_venv.py`
.. raw:: html
Using multiple virtual environments with rlberry
.. raw:: html
.. only:: html
.. image:: /auto_examples/images/thumb/sphx_glr_plot_smooth_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_plot_smooth.py`
.. raw:: html
Illustration of plotting tools on Bandits
.. raw:: html
.. only:: html
.. image:: /auto_examples/images/thumb/sphx_glr_plot_agent_manager_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_plot_agent_manager.py`
.. raw:: html
A demo of Experiment Manager
.. raw:: html
.. only:: html
.. image:: /auto_examples/images/thumb/sphx_glr_plot_checkpointing_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_plot_checkpointing.py`
.. raw:: html
Checkpointing
.. raw:: html
.. toctree::
:hidden:
/auto_examples/plot_kernels
/auto_examples/adastop_example
/auto_examples/plot_writer_wrapper
/auto_examples/comparison_agents
/auto_examples/example_venv
/auto_examples/plot_smooth
/auto_examples/plot_agent_manager
/auto_examples/plot_checkpointing
Illustration of rlberry environments
====================================
.. raw:: html
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_chain_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_chain.py`
.. raw:: html
A demo of Chain environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_mountain_car_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_mountain_car.py`
.. raw:: html
A demo of MountainCar environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_apple_gold_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_apple_gold.py`
.. raw:: html
A demo of AppleGold environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_acrobot_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_acrobot.py`
.. raw:: html
A demo of Acrobot environment with RSUCBVIAgent
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_gridworld_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_gridworld.py`
.. raw:: html
A demo of Gridworld environment with ValueIterationAgent
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_twinrooms_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_twinrooms.py`
.. raw:: html
A demo of twinrooms environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_old_gym_compatibility_wrapper_old_acrobot_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_old_gym_compatibility_wrapper_old_acrobot.py`
.. raw:: html
A demo of OldGymCompatibilityWrapper with old_Acrobot environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_rooms_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_rooms.py`
.. raw:: html
A demo of rooms environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_pball_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_pball.py`
.. raw:: html
A demo of PBALL2D environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_springcartpole_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_springcartpole.py`
.. raw:: html
A demo of SpringCartPole environment with DQNAgent
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_video_plot_atari_freeway_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_video_plot_atari_freeway.py`
.. raw:: html
A demo of ATARI Freeway environment with DQNAgent
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_example_atari_atlantis_vectorized_ppo_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_example_atari_atlantis_vectorized_ppo.py`
.. raw:: html
A demo of ATARI Atlantis environment with vectorized PPOAgent
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_env/images/thumb/sphx_glr_example_atari_breakout_vectorized_ppo_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_env_example_atari_breakout_vectorized_ppo.py`
.. raw:: html
A demo of ATARI Breakout environment with vectorized PPOAgent
.. raw:: html
Illustration of rlberry agents
==============================
.. raw:: html
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_agents/images/thumb/sphx_glr_video_plot_ppo_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_agents_video_plot_ppo.py`
.. raw:: html
A demo of PPO algorithm in PBall2D environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_agents/images/thumb/sphx_glr_video_plot_vi_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_agents_video_plot_vi.py`
.. raw:: html
A demo of ValueIteration algorithm in Chain environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_agents/images/thumb/sphx_glr_video_plot_rsucbvi_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_agents_video_plot_rsucbvi.py`
.. raw:: html
A demo of RSUCBVI algorithm in MountainCar environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_agents/images/thumb/sphx_glr_video_plot_a2c_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_agents_video_plot_a2c.py`
.. raw:: html
A demo of A2C algorithm in PBall2D environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_agents/images/thumb/sphx_glr_demo_SAC_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_demo_agents_demo_SAC.py`
.. raw:: html
SAC Soft Actor-Critic
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_agents/images/thumb/sphx_glr_video_plot_mbqvi_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_agents_video_plot_mbqvi.py`
.. raw:: html
A demo of MBQVI algorithm in Gridworld environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_agents/images/thumb/sphx_glr_video_plot_rs_kernel_ucbvi_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_agents_video_plot_rs_kernel_ucbvi.py`
.. raw:: html
A demo of RSKernelUCBVIAgent algorithm in Acrobot environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_agents/images/thumb/sphx_glr_video_plot_dqn_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_agents_video_plot_dqn.py`
.. raw:: html
A demo of DQN algorithm in CartPole environment
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_agents/images/thumb/sphx_glr_video_plot_mdqn_thumb.jpg
:alt:
:ref:`sphx_glr_auto_examples_demo_agents_video_plot_mdqn.py`
.. raw:: html
A demo of M-DQN algorithm in CartPole environment
.. raw:: html
Illustration of bandits in rlberry
==================================
.. raw:: html
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_bandits/images/thumb/sphx_glr_plot_ucb_bandit_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_demo_bandits_plot_ucb_bandit.py`
.. raw:: html
UCB Bandit cumulative regret
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_bandits/images/thumb/sphx_glr_plot_exp3_bandit_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_demo_bandits_plot_exp3_bandit.py`
.. raw:: html
EXP3 Bandit cumulative regret
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_bandits/images/thumb/sphx_glr_plot_TS_bandit_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_demo_bandits_plot_TS_bandit.py`
.. raw:: html
Comparison of Thompson sampling and UCB on Bernoulli and Gaussian bandits
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_bandits/images/thumb/sphx_glr_plot_compare_index_bandits_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_demo_bandits_plot_compare_index_bandits.py`
.. raw:: html
Comparison subplots of various index based bandits algorithms
.. raw:: html
.. only:: html
.. image:: /auto_examples/demo_bandits/images/thumb/sphx_glr_plot_mirror_bandit_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_demo_bandits_plot_mirror_bandit.py`
.. raw:: html
A demo of Bandit BAI on a real dataset to select mirrors
.. raw:: html
.. toctree::
:hidden:
:includehidden:
/auto_examples/demo_env/index.rst
/auto_examples/demo_agents/index.rst
/auto_examples/demo_bandits/index.rst
.. only:: html
.. container:: sphx-glr-footer sphx-glr-footer-gallery
.. container:: sphx-glr-download sphx-glr-download-python
:download:`Download all examples in Python source code: auto_examples_python.zip `
.. container:: sphx-glr-download sphx-glr-download-jupyter
:download:`Download all examples in Jupyter notebooks: auto_examples_jupyter.zip `
.. only:: html
.. rst-class:: sphx-glr-signature
`Gallery generated by Sphinx-Gallery