Modification of information gain measure to select the best group of attributes in a data set for a binary decision tree inducer

dc.contributor.authorRathnayaka, N.S.
dc.contributor.authorWijerathna, J. K.
dc.date.accessioned2022-01-28T04:15:12Z
dc.date.available2022-01-28T04:15:12Z
dc.date.issued2015
dc.description.abstractClassification is one of the frequently used techniques in data mining processes which can be applied to accurately predict the target class for each case in a data set. The Decision tree (DT) algorithms are one of the powerful classification and prediction methods which facilitate decision making in sequential decision making for a given dataset (Han & Kamber, 2006; Bramer, 2007). The major strengths of the DT algorithms are their ability to generate understandable rules, to handle both numerical and categorical attributes and also provide a clear indication of which attributes are most salient for prediction or classification (Kangaiammal, 2013). ID3 and C4.5 are multi splitting algorithms and developed by J. Ross Quinlan in1986 and 1993 respectively. That can be used to Entropy, information gain (IG) and Gain ratio as attribute selection measures. These measurements can be utilized to make the binary decision tree to reduce the complexity of the decision tree. If the algorithm identifies more than one attributes with equal IG in the data set, then it will select the initial attribute as a splitting node of a tree. This attribute may not be the best attribute for decision making when it is compared with the other attributes of equal IG. Therefore, the aim of this study is to improve the IG measure to select the best attribute in a dataset and plot a binary decision tree.en_US
dc.identifier.isbn9789550481088
dc.identifier.urihttp://www.erepo.lib.uwu.ac.lk/bitstream/handle/123456789/8240/14-CST-Modification%20of%20information%20gain%20measure%20to%20select%20the%20best%20group%20of%20attributes%20.pdf?sequence=1&isAllowed=y
dc.language.isoenen_US
dc.publisherUva Wellassa University of Sri Lankaen_US
dc.subjectScience and Technologyen_US
dc.subjectTechnologyen_US
dc.subjectInformation Technologyen_US
dc.subjectDatabaseen_US
dc.subjectSystemen_US
dc.titleModification of information gain measure to select the best group of attributes in a data set for a binary decision tree induceren_US
dc.title.alternativeResearch Symposium 2015en_US
dc.typeOtheren_US
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
14-CST-Modification of information gain measure to select the best group of attributes .pdf
Size:
422.33 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: