Property | Value |
rdf:type | |
rdfs:label | |
owl:sameAs | |
prov:wasDerivedFrom | |
skos:prefLabel | |
skos:altLabel | - deep reinforcement learning
- reinforcement
- RL
- Reinforcement_learning#Algorithms_for_control_learning
- :Reinforcement learning#Direct policy search
- Reinforcement learning#Inverse reinforcement learning
- Learning algorithms
- Reinforcement
- Reinforcement Learning
- Reinforcement Loop
- Reinforcement learning#Direct policy search
- Reinforcement learning#Temporal difference methods
- Reinforcement machine learning
- Reinforcement_learning#Direct_policy_search
- Reinforcement_learning#Safe reinforcement learning
- action reinforcement
- approximate dynamic programming
- eligibility traces
- exploration/exploitation
- inverse reinforcement learning
- learnt
- policy gradient
- reinforced
- reinforced learning
- reinforcement learning
- reinforcement learning agent
- reinforcement-learning
- self-trained
- single-agent reinforcement learning
- state value function
- Reinforcement_learning#Comparison_of_reinforcement_learning_algorithms
- Reinforcement_learning#Partially supervised reinforcement learning (PSRL)
- Reinforcement_learning#Associative_reinforcement_learning
|
is clgo:academicDiscipline of | |