From Data Mining to Knowledge Mining




Kaufman, Kenneth A.
Michalski, Ryszard S.

Journal Title

Journal ISSN

Volume Title



In view of the tremendous production of computer data worldwide, there is a strong need for new powerful tools that can automatically generate useful knowledge from a variety of data, and present it in human-oriented forms. In efforts to satisfy this need, researchers have been exploring ideas and methods developed in machine learning, statistical data analysis, data mining, text mining, data visualization, pattern recognition, etc. The first part of this chapter is a compendium of ideas on the applicability of symbolic machine learning and logical data analysis methods toward this goal. The second part outlines a multistrategy methodology for an emerging research direction, called knowledge mining, by which we mean the derivation of high-level concepts and descriptions from data through symbolic reasoning involving both data and relevant background knowledge. The effective use of background as well as previously created knowledge in reasoning about new data makes it possible for the knowledge mining system to derive useful new knowledge not only from large amounts of data, but also from limited and weakly relevant data.




Kaufman, K. and Michalski, R. S., "From Data Mining to Knowledge Mining," Handbook in Statistics, Vol. 24: Data Mining and Data Visualization, Rao, C.R., Solka, J.L. and Wegman, E.J. (Eds.), 47-75, Elsevier/North Holland, 2005.