MINING SIMILAR PATTERN WITH ATTRIBUTE ORIENTED INDUCTION HIGH LEVEL EMERGING PATTERN (AOI-HEP) DATA MINING TECHNIQUE

Harco Leslie Hendric Spits Warnars, Nizirwan Anwar, Richard Randriatoamanana, Horacio Emilio Perez Sanchez

Abstract


AOI-HEP (Attribute Oriented Induction High Emerging Pattern) as new data mining technique has been success to mine frequent pattern and is extended to mine similar patterns. AOI-HEP is success to mine 3 and 1 similar patterns from IPUMS and breast cancer UCI machine learning datasets respectively. Meanwhile, the experiments showed that there was no finding similar patterns on adult and census UCI machine learning datasets. The experiments showed that finding AOI-HEP similar pattern in dataset is influenced by learning on chosen high level concept attribute in concept hierarchy and it is applied to AOI-HEP frequent pattern in previous research as well. The experiments chosed high level concept attributes such as workclass, clump thickness, means and marts for adult, breast cancer, census and IPUMS datasets respectively. In order to proof that the chosen high level concept attribute will influences the AOI-HEP similar pattern in dataset, then extended experiments were carried on and the finding were census dataset which had been none AOI-HEP similar pattern, had AOI-HEP similar pattern when learned on high level concept in marital attribute. Meanwhile, Breast cancer which had been had 1 AOI-HEP similar pattern, had none AOI-HEP similar pattern when learned on high level concept in attributes such as cell size, cell shape and bare nuclei. The 2 of 3 finding Similar patterns in IPUMS dataset have strong discriminant rule since having large growth rates such as 1.53% and 3.47%, and having large supports in target dataset such as 4.54% and 5.45 respectively. Moreover, there have small supports in contrasting dataset such as 2.96% and 1.57% respectively.         


Keywords


Similar pattern, Data Mining, AOI-HEP, High Level Emerging Pattern, Attribute Oriented Induction, discriminant rule

Full Text:

PDF

References


Warnars, H. L. H. S. 2012. Attribute Oriented Induction of High-level Emerging Patterns. IEEE International Symposium on Foundations and Frontiers of Data Mining in conjunction with IEEE International Conference on Granular Computing (IEEE GrC2012), Hangzhou, China, 11-13 August 2012.

Warnars, H. L. H. S. 2014. Mining Frequent Pattern with Attribute Oriented Induction High level Emerging Pattern (AOI-HEP). IEEE the 2nd International Conference on Information and Communication Technology (IEEE ICoICT 2014), Bandung, Indonesia, 28-30 May 2014. 144-149.

Warnars, H. L. H. S. 2014. Attribute Oriented Induction High Level Emerging Pattern (AOI-HEP) Future Research. IEEE the 8th International Conference on Information & Communication Technology and Systems (ICTS), Surabaya, Indonesia, 24-25 September 2014. 13-18.

Warnars, H. L. H. S. 2015. Mining Patterns with Attribute Oriented Induction, the International Conference on Database, Data Warehouse, Data Mining and Big Data (DDDMBD2015), Tangerang, Indonesia, 10-12 September 2015.

Warnars, H. L. H. S. 2014. Mining Frequent and Similar Patterns with Attribute Oriented Induction High Level Emerging Pattern (AOI-HEP) Data Mining Technique. International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS). 3(11): 266-276.

Warnars, H. L. H. S. 2016. Using Attribute Oriented Induction High level Emerging Pattern (AOI-HEP) to mine Frequent Pattern. International Journal of Electrical and Computer Engineering (IJECE). Desember 2016. 6(6).

Frank, A. and Asuncion, A. 2010. UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.

Warnars, H. L. H. S. 2010. Measuring Interesting rules in characteristic rule. Proceeding of the 2nd International Conference on Soft Computing, Intelligent System and Information Technology (ICSIIT), Bali, Indonesia. 152-156.

Ramamohanarao, K., Bailey, J. and Fan, H. 2005. Efficient Mining of Contrast Patterns and Their Applications to Classification. Proceedings of the 3rd International Conference on Intelligent Sensing and Information Processing (ICISIP '05), IEEE Computer Society. 39-47.

Fan, H. and Ramamohanarao, K. 2003. A Bayesian Approach to Use Emerging Patterns for Classification. In Proceedings Of The 14th Australasian Database Conference (ADC '03). 39-48.

Dong, G. and Li, J. 1999. Efficient Mining of Emerging Patterns: Discovering Trends and Differences. Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 43-52.

Podraza, R. and Tomaszewski, K. 2005. KTDA: Emerging Patterns Based Data Analysis System. Proceedings of XXI Fall Meeting of Polish Information Processing Society. 213-221.

Dong, G. and Li, J. 2005. Mining Border Description of Emerging Patterns from Dataset Pairs. Journal of Knowledge and Information Systems. 8(2): 178-202.

Dong, G., Zhang, X., Wong, L. and Li, J. 1999. CAEP: Classification by Aggregating Emerging Patterns. Proceeding of the 2nd international Conference on Discovery Science S. Arikawa and K. Furukawa, Eds. Lecture Notes in Computer Science. Springer-Verlag, London. 1721: 30-42.

Han, J., Cai, Y. and Cercone, N. 1992. Knowledge Discovery in Databases: An Attributed Approach. Proceeding of the 18th International Conference on Very Large Data Bases. 547-559.

Cai, Y., Cercone, N. and Han, J. 1990. An Attribute-Oriented Approach for Learning Classification Rules From Relational Databases. Proceedings of 6th International Conference on Data Engineering. 281-288.

Li, J., Dong, G. and Ramamohanarao, K. 2000. Instance-based Classification by Emerging Patterns. Proceeding of the 4th European Conference on Principles of Data Mining and Knowledge Discovery (PKDD’00). 191-200.




DOI: http://dx.doi.org/10.11113/jt.v79.11876

Refbacks

  • There are currently no refbacks.


Copyright © 2012 Penerbit UTM Press, Universiti Teknologi Malaysia.
Disclaimer : This website has been updated to the best of our knowledge to be accurate. However, Universiti Teknologi Malaysia shall not be liable for any loss or damage caused by the usage of any information obtained from this web site.
Best viewed: Mozilla Firefox 4.0 & Google Chrome at 1024 × 768 resolution.