Sachin singh
Download
1 / 16

- Sachin Singh - PowerPoint PPT Presentation


  • 470 Views
  • Updated On :

CS 551 Research Track Filtering and Comparing of Classification trees using XML - Sachin Singh Data Mining - Concepts Extracting meaningful knowledge from huge chunk of ‘raw’ data. Types Association Classification Temporal Classification Method Prediction model

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about '- Sachin Singh' - benjamin


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Sachin singh l.jpg

CS 551 Research Track

Filtering and Comparing of

Classification trees using XML

- Sachin Singh


Data mining concepts l.jpg
Data Mining - Concepts

  • Extracting meaningful knowledge from huge chunk of ‘raw’ data.

  • Types

    • Association

    • Classification

    • Temporal


Classification method l.jpg
Classification Method

  • Prediction model

  • The C4.5 Tree algorithm



Analysis of trees l.jpg
Analysis of Trees

  • Current work focuses largely on generation of trees

    • Efficient algorithms

    • Disk Resident gigantic data sources

    • Improving accuracy of the generated models

  • Motivation

    • Current research area – need for analysis


Areas of analysis l.jpg
Areas of Analysis

  • Two Sub Problems

    • Filtering Sub Problem

    • Comparison Sub Problem


Filtering sub problem l.jpg
Filtering Sub Problem

  • Typical data warehouses are huge !!

  • Generation of “Bushy” trees

  • Not all outcomes are significant

  • Need to filter trees based on the required outcomes


Filtering sub problem8 l.jpg
Filtering Sub Problem

Filtered Classification Tree

Full Classification Tree


Filtering sub problem9 l.jpg
Filtering Sub Problem

  • Advantages

    • Efficient querying. Faster results

    • Easy Managed

    • Useful for comparison sub problem


Comparison sub problem l.jpg
Comparison Sub Problem

  • Need to monitor changes in data trends by comparing the classification trees

  • Levels of changes identified

    • Change in test (partition) value

    • Change in the partitions

    • Change in node levels

    • Change in outcome(leaves)


Comparison sub problem11 l.jpg
Comparison Sub Problem

  • Issues

    • Structure of trees unpredictable

    • Comparing two trees with no standard structure


Solution l.jpg
Solution

  • XML Trees

    • Convert the tree structure in XML files

    • XML inherently tree structure

    • Take advantage of existing XML related technologies

    • Standard specs



Approach l.jpg
Approach

  • Devise Algorithms to solve filtering and comparison problems

  • Analyzing results of comparison in logical terms

  • Measuring efficiency of the algorithms through time and space complexities



Slide16 l.jpg

Suggestions Preferred !!

Over questions !!

Thank You !!


ad