Information Gain in Decision Trees

Information Gain In Decision Trees

In information theory and machine learning, information gain is an alternative synonym for Kullback–Leibler divergence.

In particular, the information gain about a random variable X obtained from an observation that a random variable A takes the value A=a is the Kullback-Leibler divergence DKL(p(x | a) || p(x | I)) of the prior distribution p(x | I) for x from the posterior distribution p(x | a) for x given a.

The expected value of the information gain is the mutual information I(X; A) of X and A — i.e. the reduction in the entropy of X achieved by learning the state of the random variable A.

In machine learning this concept can be used to define a preferred sequence of attributes to investigate to most rapidly narrow down the state of X. Such a sequence (which depends on the outcome of the investigation of previous attributes at each stage) is called a decision tree. Usually an attribute with high information gain should be preferred to other attributes.

Read more about Information Gain In Decision Trees:  General Definition, Formal Definition, Drawbacks, Constructing A Decision Tree Using Information Gain

Famous quotes containing the words information, gain, decision and/or trees:

    I was brought up to believe that the only thing worth doing was to add to the sum of accurate information in the world.
    Margaret Mead (1901–1978)

    Competition is. In every business, no matter how small or how large, someone is just around the corner forever trying to steal your ideas and build his success out of your imagination, struggling after that which you have toiled endless years to secure, striving to outdo you in each and every way. If such a competitor would work as hard to originate as he does to copy, he would much more quickly gain success.
    Alice Foote MacDougall (1867–1945)

    A good decision is based on knowledge and not on numbers.
    Plato (c. 427–347 B.C.)

    It were a blessed sight to see
    That child become a willow tree,
    His brother trees among.
    He’d be four times as tall as me,
    And live three times as long.
    Catherine Maria Fanshawe (1765–1834)