PropertyValue
rdf:type
rdfs:label
  • Reinforcement learning
owl:sameAs
prov:wasDerivedFrom
skos:prefLabel
  • Reinforcement learning
skos:altLabel
  • deep reinforcement learning
  • reinforcement
  • RL
  • Reinforcement_learning#Algorithms_for_control_learning
  • :Reinforcement learning#Direct policy search
  • Reinforcement learning#Inverse reinforcement learning
  • Learning algorithms
  • Reinforcement
  • Reinforcement Learning
  • Reinforcement Loop
  • Reinforcement learning#Direct policy search
  • Reinforcement learning#Temporal difference methods
  • Reinforcement machine learning
  • Reinforcement_learning#Direct_policy_search
  • Reinforcement_learning#Safe reinforcement learning
  • action reinforcement
  • approximate dynamic programming
  • eligibility traces
  • exploration/exploitation
  • inverse reinforcement learning
  • learnt
  • policy gradient
  • reinforced
  • reinforced learning
  • reinforcement learning
  • reinforcement learning agent
  • reinforcement-learning
  • self-trained
  • single-agent reinforcement learning
  • state value function
  • Reinforcement_learning#Comparison_of_reinforcement_learning_algorithms
  • Reinforcement_learning#Partially supervised reinforcement learning (PSRL)
  • Reinforcement_learning#Associative_reinforcement_learning
is clgo:academicDiscipline of