DATA MINING LECTURE 10 Classification Basic
1 / 1

DATA MINING LECTURE 10 Classification Basic

Author : karlyn-bohler | Published Date : 2025-06-23

Description: DATA MINING LECTURE 10 Classification Basic Concepts Decision Trees Catching taxevasion Taxreturn data for year 2011 A new tax return for 2012 Is this a cheating tax return An instance of the classification problem learn a method for

Presentation Embed Code

Download Presentation

Download Presentation The PPT/PDF document "DATA MINING LECTURE 10 Classification Basic" is the property of its rightful owner. Permission is granted to download and print the materials on this website for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.

Transcript:DATA MINING LECTURE 10 Classification Basic:
DATA MINING LECTURE 10 Classification Basic Concepts Decision Trees Catching tax-evasion Tax-return data for year 2011 A new tax return for 2012 Is this a cheating tax return? An instance of the classification problem: learn a method for discriminating between records of different classes (cheaters vs non-cheaters) What is classification? Classification is the task of learning a target function f that maps attribute set x to one of the predefined class labels y One of the attributes is the class attribute In this case: Cheat Two class labels (or classes): Yes (1), No (0) Why classification? The target function f is known as a classification model Descriptive modeling: Explanatory tool to distinguish between objects of different classes (e.g., understand why people cheat on their taxes) Predictive modeling: Predict a class of a previously unseen record Examples of Classification Tasks Predicting tumor cells as benign or malignant Classifying credit card transactions as legitimate or fraudulent Categorizing news stories as finance, weather, entertainment, sports, etc Identifying spam email, spam web pages, adult content Understanding if a web query has commercial intent or not General approach to classification Training set consists of records with known class labels Training set is used to build a classification model A labeled test set of previously unseen data records is used to evaluate the quality of the model. The classification model is applied to new records with unknown class labels Illustrating Classification Task Evaluation of classification models Counts of test records that are correctly (or incorrectly) predicted by the classification model Confusion matrix Predicted Class Actual Class Classification Techniques Decision Tree based Methods Rule-based Methods Memory based reasoning Neural Networks Naïve Bayes and Bayesian Belief Networks Support Vector Machines Classification Techniques Decision Tree based Methods Rule-based Methods Memory based reasoning Neural Networks Naïve Bayes and Bayesian Belief Networks Support Vector Machines Decision Trees Decision tree A flow-chart-like tree structure Internal node denotes a test on an attribute Branch represents an outcome of the test Leaf nodes represent class labels or class distribution Example of a Decision Tree Refund MarSt TaxInc YES NO NO NO Yes No Married Single, Divorced < 80K > 80K Splitting Attributes Training Data Model: Decision Tree Test outcome Class labels Another Example of Decision Tree categorical categorical continuous class MarSt Refund TaxInc YES NO NO Yes No Married Single, Divorced < 80K > 80K There could be more than one tree that fits

Download Document

Here is the link to download the presentation.
"DATA MINING LECTURE 10 Classification Basic"The content belongs to its owner. You may download and print it for personal use, without modification, and keep all copyright notices. By downloading, you agree to these terms.

Related Presentations

Application of Data Mining in TV and Films Data Mining Concepts Ecological restoration of mining generated wastelands using Data Mining  Shakespeare Data Mining And Privacy Protection Introduction to Data Mining Data Mining  Shakespeare 1 Data Mining Chapter 1 Kirk Scott 1 Data Mining Chapter 1 Kirk Scott Introduction to Data Mining Data Mining Applications Core Methods in  Educational Data Mining EDUC691 Spring 2019 REVIEWED BROAD-BASED BLACK ECONOMIC EMPOWERMENT CHARTER FOR THE SOUTH AFRICAN MINING AND