Saturday, January 11, 2014

Forms of Learning
Several classification of learning systems are possible based on the above components as
follows:
Goal/Task/Target Function:
Prediction: To predict the desired output for a given input based on previous input/output
pairs. E.g., to predict the value of a stock given other inputs like market index, interest
rates etc.
Categorization: To classify an object into one of several categories based on features of
the object. E.g., a robotic vision system to categorize a machine part into one of the
categories, spanner, hammer etc based on the parts’ dimension and shape.
Clustering: To organize a group of objects into homogeneous segments. E.g., a satellite
image analysis system which groups land areas into forest, urban and water body, for
better utilization of natural resources.
Planning: To generate an optimal sequence of actions to solve a particular problem. E.g.,
an Unmanned Air Vehicle which plans its path to obtain a set of pictures and avoid
enemy anti-aircraft guns.
Models:
• Propositional and FOL rules
• Decision trees
• Linear separators
• Neural networks
• Graphical models
• Temporal models like hidden Markov models
Learning Rules:
Learning rules are often tied up with the model of learning used. Some common rules are
gradient descent, least square error, expectation maximization and margin maximization.
Version 1 CSE IIT, Kharagpur
Experiences:
Learning algorithms use experiences in the form of perceptions or perception action pairs
to improve their performance. The nature of experiences available varies with
applications. Some common situations are described below.

Supervised learning: In supervised learning a teacher or oracle is available which
provides the desired action corresponding to a perception. A set of perception action pair
provides what is called a training set. Examples include an automated vehicle where a set
of vision inputs and the corresponding steering actions are available to the learner.

Unsupervised learning: In unsupervised learning no teacher is available. The learner only
discovers persistent patterns in the data consisting of a collection of perceptions. This is
also called exploratory learning. Finding out malicious network attacks from a sequence
of anomalous data packets is an example of unsupervised learning.

Active learning: Here not only a teacher is available, the learner has the freedom to ask
the teacher for suitable perception-action example pairs which will help the learner to
improve its performance. Consider a news recommender system which tries to learn an
users preferences and categorize news articles as interesting or uninteresting to the user.
The system may present a particular article (of which it is not sure) to the user and ask
whether it is interesting or not.

Reinforcement learning: In reinforcement learning a teacher is available, but the teacher
instead of directly providing the desired action corresponding to a perception, return
reward and punishment to the learner for its action corresponding to a perception.
Examples include a robot in a unknown terrain where its get a punishment when its hits
an obstacle and reward when it moves smoothly.
In order to design a learning system the designer has to make the following choices
based on the application.

No comments:

Post a Comment