Instantaneously trained neural networks

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Instantaneously trained neural networks are feedforward artificial neural networks that create a new hidden neuron node for each novel training sample. The weights to this hidden neuron separate out not only this training sample but others that are near it, thus providing generalization.[1][2] This training can be done in a variety of ways and the most popular network in this family is called the CC4 network where the separation is done using the nearest hyperplane that can be written down instantaneously. These networks use unary coding for an effective representation of the data sets.[3]

Instantaneously trained neural networks have been proposed as models of short term learning and used in web search, and financial time series prediction applications.[4] They have also been used in instant classification of documents[5] and for deep learning and data mining.[6][7]

As in other neural networks, their normal use is as software, but they have also been implemented in hardware using FPGAs[8] and by optical implementation.[9]

CC4 network[edit]

In the CC4 network, which is a three-stage network, the number of input nodes is one more than the size of the training vector, with the extra node serving as the biasing node whose input is always 1. For binary input vectors, the weights from the input nodes to the hidden neuron (say of index j) corresponding to the trained vector is given by the following formula:

w_{ij} = \begin{cases}
   -1, & \mbox{for } x_i = 0\\
                         +1, & \mbox{for } x_i = 1\\
  r+s-1, & \mbox{for } i = n+1
\end{cases}

where r is the radius of generalization and s is the Hamming weight (the number of 1s) of the binary sequence. From the hidden layer to the output layer the weights are 1 or -1 depending on whether the vector belongs to a given output class or not. The neurons in the hidden and output layers output 1 if the weighted sum to the input is 0 or positive and 0, if the weighted sum to the input is negative:

y = \left\{ \begin{matrix} 1 & \mbox{if } \sum x_i \ge 0\\ 0 & \mbox{if } \sum x_i< 0\end{matrix} \right.

Other networks[edit]

In feedback networks the Willshaw network as well as the Hopfield network are able to learn instantaneously. But these networks are plagued with spurious memories.[10]

References[edit]

  1. ^ Kak, S. On training feedforward neural networks. Pramana, vol. 40, pp. 35-42, 1993
  2. ^ Kak, S. New algorithms for training feedforward neural networks. Pattern Recognition Letters 15: 295-298, 1994.
  3. ^ Kak, S. On generalization by neural networks, Information Sciences 111: 293-302, 1998.
  4. ^ Kak, S. Faster web search and prediction using instantaneously trained neural networks. IEEE Intelligent Systems 14: 79-82, November/December 1999.
  5. ^ Zhang, Z. et al., TextCC: New feedforward neural network for classifying documents instantly. Advances in Neural Networks ISNN 2005. Lecture Notes in Computer Science 3497: 232-237, 2005.
  6. ^ Zhang, Z. et al., Document Classification Via TextCC Based on Stereographic Projection and for deep learning, International Conference on Machine Learning and Cybernetics, Dalin, 2006
  7. ^ Schmidhuber, J. Deep Learning in Neural Networks: An Overview, arXiv:1404.7828, 2014 http://arxiv.org/abs/1404.7828
  8. ^ Zhu, J. and G. Milne, Implementing Kak Neural Networks on a Reconfigurable Computing Platform, Lecture Notes in Computer Science Volume 1896: 260-269, 2000.
  9. ^ Shortt, A., J.G. Keating, L. Moulinier, C.N. Pannell, Optical implementation of the Kak neural network, Information Sciences 171: 273-287, 2005.
  10. ^ Ponnath, A. Instantaneously trained neural networks. 2006. http://arxiv.org/pdf/cs/0601129.pdf