Jump to content

Talk:End-to-end reinforcement learning: Difference between revisions

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
Content deleted Content added
BattyBot (talk | contribs)
Anair13 (talk | contribs)
Line 11: Line 11:


Went ahead and merged [[User:Anair13|Anair13]] ([[User talk:Anair13|talk]]) 02:30, 1 December 2020 (UTC)
Went ahead and merged [[User:Anair13|Anair13]] ([[User talk:Anair13|talk]]) 02:30, 1 December 2020 (UTC)

Can I ask why this was restored? Especially without discussion, after I had started a discussion on it? I don't think there is any formal distinction between [[deep reinforcement learning]] and "end-to-end reinforcement learning", and I would challenge someone to find a citation saying that there is in order to keep this page and also to make the distinction clear on this page. They both just vaguely mean reinforcement learning with function approximation to handle raw inputs. Moreover, this page is unbalanced towards the work of one author, Katsunari Shibata (perhaps a violation of [[Wikipedia:Neutral_point_of_view]]), at the exclusion of a lot more famous and foundational work. [[User:Anair13|Anair13]] ([[User talk:Anair13|talk]]) 17:00, 27 October 2021 (UTC)

Revision as of 17:00, 27 October 2021

Merge with article on deep reinforcement learning?

The term "end-to-end reinforcement learning" is just another way to refer to "deep reinforcement learning" but deep RL is the more formal term. This article is actually better in giving examples of deep RL but the deep RL page is more informative/descriptive. I think these articles should be merged. Anair13 (talk) 19:59, 24 November 2020 (UTC)[reply]

Went ahead and merged Anair13 (talk) 02:30, 1 December 2020 (UTC)[reply]

Can I ask why this was restored? Especially without discussion, after I had started a discussion on it? I don't think there is any formal distinction between deep reinforcement learning and "end-to-end reinforcement learning", and I would challenge someone to find a citation saying that there is in order to keep this page and also to make the distinction clear on this page. They both just vaguely mean reinforcement learning with function approximation to handle raw inputs. Moreover, this page is unbalanced towards the work of one author, Katsunari Shibata (perhaps a violation of Wikipedia:Neutral_point_of_view), at the exclusion of a lot more famous and foundational work. Anair13 (talk) 17:00, 27 October 2021 (UTC)[reply]