Talk:End-to-end reinforcement learning

This redirect does not require a rating on Wikipedia's content assessment scale.
It is of interest to the following WikiProjects:

Articles for creation

	This redirect was reviewed by member(s) of WikiProject Articles for creation. The project works to allow users to contribute quality articles and media files to the encyclopedia and track their progress as they are developed. To participate, please visit the project page for more information.Articles for creationWikipedia:WikiProject Articles for creationTemplate:WikiProject Articles for creationAfC articles
	This redirect was accepted from this draft on 4 April 2017 by reviewer SwisterTwister (talk · contribs).

Engineering

This redirect is within the scope of WikiProject Engineering, a collaborative effort to improve the coverage of engineering on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.EngineeringWikipedia:WikiProject EngineeringTemplate:WikiProject EngineeringEngineering articles

Science

This redirect is within the scope of WikiProject Science, a collaborative effort to improve the coverage of Science on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ScienceWikipedia:WikiProject ScienceTemplate:WikiProject Sciencescience articles

Computer science

This redirect is within the scope of WikiProject Computer science, a collaborative effort to improve the coverage of Computer science related articles on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Computer scienceWikipedia:WikiProject Computer scienceTemplate:WikiProject Computer scienceComputer science articles

This redirect has been automatically rated by a bot or other tool because one or more other projects use this class. Please ensure the assessment is correct before removing the |auto= parameter.

Things you can help WikiProject Computer science with:

Here are some tasks awaiting attention:

Article requests :
- Requested articles/Applied arts and sciences/Computer science, computing, and Internet
Cleanup :
- Computer science articles needing attention
- Computer science articles needing expert attention
Copyedit :
- Computing
Expand :
- Computer science
Infobox :
- Computer science articles without infoboxes
Maintain :
- Timeline of computing 2020–present
Photo :
- Find pictures for the biographies of computer scientists (see List of computer scientists)
- Computing articles needing images
Stubs :
- Computer science stubs
Unreferenced :
- WikiProject Computer science/Unreferenced BLPs
Project-related :
- Tag all relevant articles in Category:Computer science and sub-categories with {{WikiProject Computer science}}

Merge with article on deep reinforcement learning?

The term "end-to-end reinforcement learning" is just another way to refer to "deep reinforcement learning" but deep RL is the more formal term. This article is actually better in giving examples of deep RL but the deep RL page is more informative/descriptive. I think these articles should be merged. Anair13 (talk) 19:59, 24 November 2020 (UTC)[reply]

Went ahead and merged Anair13 (talk) 02:30, 1 December 2020 (UTC)[reply]

Can I ask why this was restored? Especially without discussion, after I had started a discussion on it? I don't think there is any formal distinction between deep reinforcement learning and "end-to-end reinforcement learning", and I would challenge someone to find a citation saying that there is in order to keep this page and also to make the distinction clear on this page. They both just vaguely mean reinforcement learning with function approximation to handle raw inputs. Moreover, this page is unbalanced towards the work of one author, Katsunari Shibata (perhaps a violation of Wikipedia:Neutral_point_of_view), at the exclusion of a lot more famous and foundational work. Anair13 (talk) 17:00, 27 October 2021 (UTC)[reply]