COMPARISON OF FUZZY AND CRISP CLASSIFICATION TREES USING GINI INDEX, CHI-SQUARE STATISTIC AND THE GAIN RATIO

  • Type: Project
  • Department: Philosophy
  • Project ID: PHI0184
  • Access Fee: ₦5,000 ($14)
  • Pages: 85 Pages
  • Format: Microsoft Word
  • Views: 1.1K
  • Report This work

For more Info, call us on
+234 8130 686 500
or
+234 8093 423 853

ABSTRACT

Discriminant (classification) analysis is a classification problem where a new individual is allocated into one of known populations or classes based on the measured characteristics of the individual. Different models are used in allocating the new individual into one of the populations (classes). Some of the models depend on the underlying distribution of the populations, hese are known as parametric models. If the model does not depend on any underlying distribution it is known as a distribution free or non parametric model. In this work a distribution free model known as classification tree is used. A classification tree is a presentation of edges and nodes. It is a model that is used to assign an individual to one of many classes or populations. At each node a test is applied on a value of one of the attributes (variables) of the individual. The individual moves to the next node (child node) along an edge depending on the result of the test. The attribute, on which the test is applied, is known as the splitting attribute and the value the splitting value. Tests are carried out at each node until it is not possible to carry out more tests. The final nodes are known as terminal or leaf nodes. Classification is done at the terminal nodes by assigning all the individuals on that node to a class. If the splitting value is a fuzzy value, then the tree is known as a fuzzy classification tree otherwise the tree is known as a crisp classification tree. When there are only two possible answers to the test at each node, the resulting tree is known as a binary tree. Classification trees have been used to model many situations. These include speech recognition, data mining and market surveys among others. In this study the performance of crisp and fuzzy classification trees was compared. The performance was based on probabilities of correct allocation and probabilities of misclassification. Simulated data and real data were used. Data was simulated using R and the real data was obtained from machine learning repository. Gini Index, Chi-Square Statistic and Gain Ratio impurity measures were applied to both the simulated data and real data. The performance of Gini Index, Chi-Square Statistic and Gain Ratio impurity measures was also compared. Finally the performance of the trees using varied sample sizes was compared. It was found that for the simulated data, fuzzy classification tree performed better than the crisp classification tree when all the three impurity measures were applied. It was found that the Gini Index and Chi-Square Statistic impurity measures were appropriate as impurity measures for the data used in the study and gave similar results. However the Gain Ratio impurity measure did not perform as well as the other two impurity measures. It was also found that there was no significant difference in the probabilities of misclassification irrespective of different sample sizes in the populations

COMPARISON OF FUZZY AND CRISP CLASSIFICATION TREES USING GINI INDEX, CHI-SQUARE STATISTIC AND THE GAIN RATIO
For more Info, call us on
+234 8130 686 500
or
+234 8093 423 853

Share This
  • Type: Project
  • Department: Philosophy
  • Project ID: PHI0184
  • Access Fee: ₦5,000 ($14)
  • Pages: 85 Pages
  • Format: Microsoft Word
  • Views: 1.1K
Payment Instruction
Bank payment for Nigerians, Make a payment of ₦ 5,000 to

Bank GTBANK
gtbank
Account Name Obiaks Business Venture
Account Number 0211074565

Bitcoin: Make a payment of 0.0005 to

Bitcoin(Btc)

btc wallet
Copy to clipboard Copy text

Details

Type Project
Department Philosophy
Project ID PHI0184
Fee ₦5,000 ($14)
No of Pages 85 Pages
Format Microsoft Word

Related Works

ABSTRACT Discriminant (classification) analysis is a classification problem where a new individual is allocated into one of known populations or classes based on the measured characteristics of the individual. Different models are used in allocating the new individual into one of the populations (classes). Some of the models depend on the... Continue Reading
ABSTRACT In chapter one, the general introduction and important definitions were well stated. In chapter two, the chi-square distribution and chi-square goodness-of-fit test were treated. In chapter three, the chi-square test of independence was highlighted. Finally, in chapter four, we considered the application of chi-square test of independence... Continue Reading
ABSTRACT     This research work was focused on the EFFECT OF LACK OF PRACTICAL TOOLS ON THE ACADEMIC PERFORMANCE OF STUDENTS in The Ibarapa Polytechnic, Eruwa.     The objectives and purpose of the research work is to determine whether factors such as lack of practical tools, lack of enough practical lecturers, improvising of practical tools... Continue Reading
ABSTRACT     This research work was focused on the EFFECT OF LACK OF PRACTICAL TOOLS ON THE ACADEMIC PERFORMANCE OF STUDENTS in The Ibarapa Polytechnic, Eruwa.     The objectives and purpose of the research work is to determine whether factors such as lack of practical tools, lack of enough practical lecturers, improvising of practical tools... Continue Reading
Health related quality of life is one of the growing subjects used for evaluating health conditions among patients who suffer from specific ailment.It has been assumed that identification of the variables is able to mirror the one's general health conditions.However,formulating the extent of contribution is multiple variable towards overall health... Continue Reading
ABSTRACT In our present society, wireless communication has become most prevalent and mobile units (hand held devices /GSM/ Computers) are one of the most common equipments for conveying messages wirelessly. Owing to the fact that most youths in our society today on the average make use of at least one wireless device that communicate with other... Continue Reading
ABSTRACT In our present society, wireless communication has become most prevalent and mobile units (hand held devices /GSM/ Computers) are one of the most common equipments for conveying messages wirelessly. Owing to the fact that most youths in our society today on the average make use of at least one wireless device that communicate with other... Continue Reading
ABSTRACT Condition monitoring of Electrical power equipment has attracted considerable attention for years. The aim of this work is to use Fuzzy logic (FL) Tool Box in building a simulation system that will diagnose all kinds of incipient faults, phase to phase fault and overloading in a transformer and monitor its conditions. Current and rate of... Continue Reading
Health related quality of life is one of the growing subjects used for evaluating health conditions among patients who suffer from specific ailment.It has been assumed that identification of the variables is able to mirror the one's general health conditions.However,formulating the extent of contribution is multiple variable towards overall... Continue Reading
ABSTRACT Condition monitoring of Electrical power equipment has attracted considerable attention for years. The aim of this work is to use Fuzzy logic (FL) Tool Box in building a simulation system that will diagnose all kinds of incipient faults, phase to phase fault and overloading in a transformer and monitor its conditions. Current and rate of... Continue Reading
Call Us
whatsappWhatsApp Us