Jump to content

User:Jtumina/sandbox

From Wikipedia, the free encyclopedia

Overview[edit]

[edit]

A biological neural network, responsible for all computations done by the brain, is composed of a group or groups of chemically connected or functionally associated neurons. A single neuron may be connected to many other neurons and the total number of neurons and connections in a network may be extensive. Connections, called synapses, are usually formed from axons to dendrites. Apart from the electrical signaling, there are other forms of signaling that arise from neurotransmitter diffusion.[1]


Artificial intelligence, cognitive modelling, and neural networks are information processing paradigms inspired by the way biological neural systems process data. Artificial intelligence and cognitive modeling try to simulate some properties of biological neural networks:

Rather than programming them, we train neural networks by example. As children need to know nothing about comparative physiology to recognize their mothers, programmers need not to provide neural networks with quantitative descriptions of objects being recognized, nor sets of logical criteria to distinguish such objects from similar objects.

— P.d. Wasserman and T. Schwartz, Neural networks. II. What are they and why is everybody so interested in them now?, IEEE Expert, vol. 3, no. 1

When neural networks have similar properties to the brain, they are able to perform tasks like pattern recognition. When some neurons are not working, the network still functions overall making them reliable.[1]

In the artificial intelligence field, artificial neural networks have been applied successfully to speech recognition, image analysis and adaptive control, in order to construct software agents (in computer and video games) or autonomous robots.

Historically, digital computers evolved from the von Neumann model, and operate via the execution of explicit instructions via access to memory by a number of processors. On the other hand, the origins of neural networks are based on efforts to model information processing in biological systems. Unlike the von Neumann model, neural network computing does not separate memory and processing.

Neural network theory has served both to better identify how the neurons in the brain function and to provide the basis for efforts to create artificial intelligence.

History[edit]

[edit]

The preliminary theoretical base for contemporary neural networks was independently proposed by Alexander Bain (1873) and William James (1890). In their work, both thoughts and body activity resulted from interactions among neurons within the brain. Computer simulation of the branching architecture of the dendritesof pyramidal neurons.

For Bain, every activity led to the firing of a certain set of neurons. When activities were repeated, the connections between those neurons strengthened. According to his theory, this repetition was what led to the formation of memory. The general scientific community at the time was skeptical of Bain’s theory because it required what appeared to be an inordinate number of neural connections within the brain. It is now apparent that the brain is exceedingly complex and that the same brain “wiring” can handle multiple problems and inputs.

James’s theory was similar to Bain’s, however, he suggested that memories and actions resulted from electrical currents flowing among the neurons in the brain. His model, by focusing on the flow of electrical currents, did not require individual neural connections for each memory or action.

C. S. Sherrington (1898) conducted experiments to test James’s theory. He ran electrical currents down the spinal cords of rats. However, instead of demonstrating an increase in electrical current as projected by James, Sherrington found that the electrical current strength decreased as the testing continued over time. Importantly, this work led to the discovery of the concept of habituation.

McCulloch and Pitts (1943) created a computational model for neural networks based on mathematics and algorithms. They called this model threshold logic. The model paved the way for neural network research to split into two distinct approaches. One approach focused on biological processes in the brain and the other focused on the application of neural networks to artificial intelligence.

In the late 1940s psychologist Donald Hebb created a hypothesis of learning based on the mechanism of neural plasticity that is now known as Hebbian learning. Hebbian learning is considered to be a 'typical' unsupervised learning rule and its later variants were early models for long term potentiation. These ideas started being applied to computational models in 1948 with Turing's B-type machines.

Farley and Clark (1954) first used computational machines, then called calculators, to simulate a Hebbian network at MIT. Other neural network computational machines were created by Rochester, Holland, Habit, and Duda (1956).

Rosenblatt (1958) created the perceptron, an algorithm for pattern recognition based on a two-layer learning computer network using simple addition and subtraction. With mathematical notation, Rosenblatt also described circuitry not in the basic perceptron, such as the exclusive-or circuit, a circuit whose mathematical computation could not be processed until after the backpropagation algorithm was created by Werbos (1975).

Neural network research stagnated after the publication of machine learning research by Minsky and Papert (1969). They discovered two key issues with the computational machines that processed neural networks. The first issue was that single-layer neural networks were incapable of processing the exclusive-or circuit. The second significant issue was that computers were not sophisticated enough to effectively handle the long run time required by large neural networks. Neural network research slowed until computers achieved greater processing power. Also key in later advances was the backpropagation algorithm which effectively solved the exclusive-or problem (Werbos 1975).

In the early 1980s, research in neural networks and their applications began to resurface. John Hopfield wrote two papers in this time which brought many scientists into neural network research.[2] The parallel distributed processing of the mid-1980s became popular under the name connectionism. The text by Rumelhart and McClelland (1986) provided a full exposition on the use of connectionism in computers to simulate neural processes. IEEE International Conference on Neural Networks (1987) was the first conference with neural networks as the topic. Also in the late 1980s, many journals were founded such as the INNS journal Neural Networks (1988), and Neural Computation (1989).[2]

Neural networks and artificial intelligence[edit]

[edit]

Main article: Artificial neural network

A neural network (NN), in the case of artificial neurons called artificial neural network (ANN) or simulated neural network (SNN), is an interconnected group of natural or artificial neurons that uses a mathematical or computational model for information processing based on a connectionistic approach to computation. In most cases an ANN is an adaptive system that changes its structure based on external or internal information that flows through the network.

In more practical terms neural networks are non-linear statistical data modeling or decision making tools. They can be used to model complex relationships between inputs and outputs when a more basic control structure, such as Boolean logic, cannot be used. [3]

An artificial neural network involves a network of simple processing elements (artificial neurons) which can exhibit complex global behavior, determined by the connections between the processing elements and element parameters. Artificial neurons were first proposed in 1943 by Warren McCulloch, a neurophysiologist, and Walter Pitts, a logician, who first collaborated at the University of Chicago.

One classical type of artificial neural network is the recurrent Hopfield network.

The concept of a neural network appears to have first been proposed by Alan Turing in his 1948 paper Intelligent Machinery in which called them "B-type unorganised machines".

The utility of artificial neural networks lie in the fact that they can be used to infer a function from observations when the complexity of the data might make it impossible to come up with a function by hand.[1] Neural networks can also be used to learn representations of the input that capture the salient characteristics of the input distribution, e.g., see the Boltzmann machine(1983), and more recently, deep learning algorithms, which can implicitly learn the distribution function of the observed data.

When neural networks learn, they receive inputs to their artificial neurons and apply a weight to each of them. Then, all of the weights are added up and the sum is compared to a threshold in the neuron. The network outputs 0 if the sum of the weights is below the threshold, and 1 if the sum is greater. Over time and training, the threshold value will change to make the network as accurate as possible.[1]

The tasks to which artificial neural networks are applied tend to fall within the following broad categories:

Application areas of ANNs include nonlinear system identification  and control (vehicle control, process control), game-playing and decision making (backgammon, chess, racing), pattern recognition (radar systems, face identification, object recognition), sequence recognition (gesture, speech, handwritten text recognition[4]), medical diagnosis, financial applications, data mining (or knowledge discovery in databases, "KDD"), visualization and e-mail spam filtering.

Criticism

[edit]

A common criticism of neural networks, particularly in their applications in robotics, is their requirement for a large diversity of training for real-world operation. This is not surprising, since any learning machine needs sufficient representative examples in order to capture the underlying structure that allows it to generalize to new cases. Dean Pomerleau, in his research presented in the paper "Knowledge-based Training of Artificial Neural Networks for Autonomous Robot Driving," uses a neural network to train a robotic vehicle to drive on multiple types of roads (single lane, multi-lane, dirt, etc.). A large amount of his research is devoted to (1) extrapolating multiple training scenarios from a single training experience, and (2) preserving past training diversity so that the system does not become over-trained (if, for example, it is presented with a series of right turns—it should not learn to always turn right).[5] Because neural networks learn from the training data they are given, the network might encounter something it has never experienced before. Thus, the network would have no output for the input it received.[3]

A. K. Dewdney, a former Scientific American columnist, wrote in 1997, "Although neural nets do solve a few toy problems, their powers of computation are so limited that I am surprised anyone takes them seriously as a general problem-solving tool." (Dewdney, p. 82)

Arguments for Dewdney's position are that to implement large and effective software neural networks, much processing and storage resources need to be committed. While the brain has hardware tailored to the task of processing signals through a graph of neurons, simulating even a most simplified form on Von Neumann technology may compel a neural network designer to fill many millions of database rows for its connections—which can consume vast amounts of computer memory and hard disk space. Furthermore, the designer of neural network systems will often need to simulate the transmission of signals through many of these connections and their associated neurons—which must often be matched with incredible amounts of CPU processing power and time. While neural networks often yield effective programs, they too often do so at the cost of efficiency (they tend to consume considerable amounts of time and money).

Arguments against Dewdney's position are that neural nets have been successfully used to solve many complex and diverse tasks, ranging from autonomously flying aircraft, to detecting credit card fraud[citation needed].

Technology writer Roger Bridgman commented on Dewdney's statements about neural nets:

In response to this kind of criticism, one should note that although it is true that analyzing what has been learned by an artificial neural network is difficult, it is much easier to do so than to analyze what has been learned by a biological neural network. Furthermore, researchers involved in exploring learning algorithms for neural networks are gradually uncovering generic principles which allow a learning machine to be successful. For example, Bengio and LeCun (2007) wrote an article regarding local vs non-local learning, as well as shallow vs deep architecture.

Some other criticisms came from believers of hybrid models (combining neural networks and symbolic approaches). They advocate the intermix of these two approaches and believe that hybrid models can better capture the mechanisms of the human mind (Sun and Bookman, 1990).

  1. ^ a b c d Krogh, Anders (01 February 2008). "What are artificial neural networks?". Nature Biotechnology. 26 (2): 195–197. doi:10.1038/nbt1386. ISSN 1087-0156. {{cite journal}}: Check date values in: |date= (help)
  2. ^ a b Neha,, Yadav,. An introduction to neural network methods for differential equations. Yadav, Anupam,, Kumar, Manoj,. Dordrecht. ISBN 9789401798167. OCLC 904248956.{{cite book}}: CS1 maint: extra punctuation (link) CS1 maint: multiple names: authors list (link)
  3. ^ a b "Adaptive algorithms". Machine Design. 69: 144. Apr 17, 1997 – via ProQuest Central.
  4. ^ Wasserman, P.D.; Schwartz, T. (1988). "Neural networks. II. What are they and why is everybody so interested in them now?". IEEE Expert. 3 (1): 10–15. doi:10.1109/64.2091. ISSN 0885-9000.
  5. ^ Pomerleau, D.A. (1993). Knowledge-Based Training of Artificial Neural Networks for Autonomous Robot Driving. Boston, MA: Springer. pp. 19–43. ISBN 978-1-4613-6396-5.