Information Theoretic Trust Regions for Gradient Descent

  • Subject:Imitation Learning, Trust-Region Optimization
  • Type:Master Thesis
  • Supervisor:

    Philipp Becker, Maximilian Hüttenrauch

  • Person in Charge:Philipp Dahlinger
  • Add on:

    Link