preview

Computer Network And The World Wide Web System

Better Essays

Introduction
The past thirty years have seen increasingly rapid advances in the field of Database. Moreover the amount of data being stored in electronic format has been increased dramatically. This increased gives rise to increase accumulation of data at a very quick rate. In addition, the volume of information in the world has been projected to doubles every two years. For example, the health care database system or financial database system is worth instances for the types of data that are being collected and increased dramatically. In fact we are living in a world where vast amounts of data are collected daily and we cannot stop our live to interact with data because we are actually living in an age of the data. There are Terabytes or …show more content…

These necessities have prompted the conception of Data Mining that has been changing the live from the data age toward the coming information age. A considerable amount of literature has been published on Data Mining and the aim of this survey is concerned with the ideas behind the processes; purpose and techniques of Data Mining. [1][2]
1. What is Data mining In every day live, the word ‘Mining’ refer to the process that discovered a small set of valuable pieces from a great deal of raw material as in mining process of gold from rocks or sand. According to [3] Data Mining, or Knowledge Discovery in Databases (KDD) as it is also known, is the process of extraction of implicit information that previously unknown and potentially useful from database. By using a number of different technical, such as clustering, data summarization, learning classification, finding dependency networks, analyzing changes, and detecting anomalies. Data Mining refers to a variety of techniques that can be used to analyses and observes database in order to find relationships or summarize the data in ways that can be put to use in different areas such as decision making, prediction and estimation and to do that there are a sequence of the process [2] . As show in figure (1.1)

(1) A petabyte is a unit of measurement of amount of data storage in computer and it equal to a thousand terabytes, or 1 million gigabytes

1. Data cleaning: that is the process where noise

Get Access