Modification of information gain measure to select the best group of attributes in a data set for a binary decision tree inducer
dc.contributor.author | Rathnayaka, N.S. | |
dc.contributor.author | Wijerathna, J. K. | |
dc.date.accessioned | 2022-01-28T04:15:12Z | |
dc.date.available | 2022-01-28T04:15:12Z | |
dc.date.issued | 2015 | |
dc.description.abstract | Classification is one of the frequently used techniques in data mining processes which can be applied to accurately predict the target class for each case in a data set. The Decision tree (DT) algorithms are one of the powerful classification and prediction methods which facilitate decision making in sequential decision making for a given dataset (Han & Kamber, 2006; Bramer, 2007). The major strengths of the DT algorithms are their ability to generate understandable rules, to handle both numerical and categorical attributes and also provide a clear indication of which attributes are most salient for prediction or classification (Kangaiammal, 2013). ID3 and C4.5 are multi splitting algorithms and developed by J. Ross Quinlan in1986 and 1993 respectively. That can be used to Entropy, information gain (IG) and Gain ratio as attribute selection measures. These measurements can be utilized to make the binary decision tree to reduce the complexity of the decision tree. If the algorithm identifies more than one attributes with equal IG in the data set, then it will select the initial attribute as a splitting node of a tree. This attribute may not be the best attribute for decision making when it is compared with the other attributes of equal IG. Therefore, the aim of this study is to improve the IG measure to select the best attribute in a dataset and plot a binary decision tree. | en_US |
dc.identifier.isbn | 9789550481088 | |
dc.identifier.uri | http://www.erepo.lib.uwu.ac.lk/bitstream/handle/123456789/8240/14-CST-Modification%20of%20information%20gain%20measure%20to%20select%20the%20best%20group%20of%20attributes%20.pdf?sequence=1&isAllowed=y | |
dc.language.iso | en | en_US |
dc.publisher | Uva Wellassa University of Sri Lanka | en_US |
dc.subject | Science and Technology | en_US |
dc.subject | Technology | en_US |
dc.subject | Information Technology | en_US |
dc.subject | Database | en_US |
dc.subject | System | en_US |
dc.title | Modification of information gain measure to select the best group of attributes in a data set for a binary decision tree inducer | en_US |
dc.title.alternative | Research Symposium 2015 | en_US |
dc.type | Other | en_US |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- 14-CST-Modification of information gain measure to select the best group of attributes .pdf
- Size:
- 422.33 KB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: