Talk:End-to-end reinforcement learning: Difference between revisions
→top: Added Template:WikiProject banner shell and other General fixes |
|||
Line 11: | Line 11: | ||
Went ahead and merged [[User:Anair13|Anair13]] ([[User talk:Anair13|talk]]) 02:30, 1 December 2020 (UTC) |
Went ahead and merged [[User:Anair13|Anair13]] ([[User talk:Anair13|talk]]) 02:30, 1 December 2020 (UTC) |
||
Can I ask why this was restored? Especially without discussion, after I had started a discussion on it? I don't think there is any formal distinction between [[deep reinforcement learning]] and "end-to-end reinforcement learning", and I would challenge someone to find a citation saying that there is in order to keep this page and also to make the distinction clear on this page. They both just vaguely mean reinforcement learning with function approximation to handle raw inputs. Moreover, this page is unbalanced towards the work of one author, Katsunari Shibata (perhaps a violation of [[Wikipedia:Neutral_point_of_view]]), at the exclusion of a lot more famous and foundational work. [[User:Anair13|Anair13]] ([[User talk:Anair13|talk]]) 17:00, 27 October 2021 (UTC) |
Revision as of 17:00, 27 October 2021
This redirect does not require a rating on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | ||||||||||||||||||||||||||||||||||||||||||
|
Merge with article on deep reinforcement learning?
The term "end-to-end reinforcement learning" is just another way to refer to "deep reinforcement learning" but deep RL is the more formal term. This article is actually better in giving examples of deep RL but the deep RL page is more informative/descriptive. I think these articles should be merged. Anair13 (talk) 19:59, 24 November 2020 (UTC)
Went ahead and merged Anair13 (talk) 02:30, 1 December 2020 (UTC)
Can I ask why this was restored? Especially without discussion, after I had started a discussion on it? I don't think there is any formal distinction between deep reinforcement learning and "end-to-end reinforcement learning", and I would challenge someone to find a citation saying that there is in order to keep this page and also to make the distinction clear on this page. They both just vaguely mean reinforcement learning with function approximation to handle raw inputs. Moreover, this page is unbalanced towards the work of one author, Katsunari Shibata (perhaps a violation of Wikipedia:Neutral_point_of_view), at the exclusion of a lot more famous and foundational work. Anair13 (talk) 17:00, 27 October 2021 (UTC)
- Redirect-Class AfC articles
- AfC submissions by date/04 April 2017
- Accepted AfC submissions
- NA-Class Engineering articles
- NA-importance Engineering articles
- WikiProject Engineering articles
- Redirect-Class science articles
- NA-importance science articles
- Redirect-Class Computer science articles
- NA-importance Computer science articles
- Automatically assessed Computer science articles
- WikiProject Computer science articles