Data Preprocessing Shailaja K.P What Is Data
1 / 1

Data Preprocessing Shailaja K.P What Is Data

Author : luanne-stotts | Published Date : 2025-05-22

Description: Data Preprocessing Shailaja KP What Is Data Mining Many people treat data mining as a synonym for another popularly used term knowledge discovery from data or KDD while others view data mining as merely an essential step in the process

Presentation Embed Code

Download Presentation

Download Presentation The PPT/PDF document "Data Preprocessing Shailaja K.P What Is Data" is the property of its rightful owner. Permission is granted to download and print the materials on this website for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.

Transcript:Data Preprocessing Shailaja K.P What Is Data:
Data Preprocessing Shailaja K.P What Is Data Mining? Many people treat data mining as a synonym for another popularly used term, knowledge discovery from data, or KDD, while others view data mining as merely an essential step in the process of knowledge discovery. 8/2/2022 2 The knowledge discovery process is shown in Figure below as an iterative sequence 8/2/2022 3 The knowledge discovery process has the following steps: 1. Data cleaning (to remove noise and inconsistent data) 2. Data integration (where multiple data sources may be combined) 3. Data selection (where data relevant to the analysis task are retrieved from the database) 4. Data transformation (where data are transformed and consolidated into forms appropriate for mining by performing summary or aggregation operations) 5. Data mining (an essential process where intelligent methods are applied to extract data patterns) 6. Pattern evaluation (to identify the truly interesting patterns representing knowledge based on interestingness measures) 7. Knowledge presentation (where visualization and knowledge representation techniques are used to present mined knowledge to users) 8/2/2022 4 Steps 1 through 4 are different forms of data preprocessing, where data are prepared for mining. The data mining step may interact with the user or a knowledge base. The interesting patterns are presented to the user and may be stored as new knowledge in the knowledge base. “Data mining is the process of discovering interesting patterns and knowledge from large amounts of data. The data sources can include databases, data warehouses, the Web, other information repositories, or data that are streamed into the system dynamically” . 8/2/2022 5 Data Preprocessing- Today’s real-world databases are highly susceptible to noisy, missing, and inconsistent data due to their typically huge size and their origin from multiple, heterogenous sources. Low-quality data will lead to low-quality mining results. “How can the data be preprocessed in order to help improve the quality of the data and, consequently, of the mining results? How can the data be preprocessed so as to improve the efficiency and ease of the mining process?” There are several data preprocessing techniques. Data cleaning can be applied to remove noise and correct inconsistencies in data. 8/2/2022 6 Data integration merges data from multiple sources into a coherent data store such as a data warehouse. Data reduction can reduce data size by, aggregating, eliminating redundant features, or clustering. Data transformations (e.g., normalization) may be applied, where data are scaled to fall within

Download Document

Here is the link to download the presentation.
"Data Preprocessing Shailaja K.P What Is Data"The content belongs to its owner. You may download and print it for personal use, without modification, and keep all copyright notices. By downloading, you agree to these terms.

Related Presentations