sachin singh l.
Download
Skip this Video
Download Presentation
- Sachin Singh

Loading in 2 Seconds...

play fullscreen
1 / 16

- Sachin Singh - PowerPoint PPT Presentation


  • 475 Views
  • Uploaded on

CS 551 Research Track Filtering and Comparing of Classification trees using XML - Sachin Singh Data Mining - Concepts Extracting meaningful knowledge from huge chunk of ‘raw’ data. Types Association Classification Temporal Classification Method Prediction model

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about '- Sachin Singh' - benjamin


Download Now An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
sachin singh

CS 551 Research Track

Filtering and Comparing of

Classification trees using XML

- Sachin Singh

data mining concepts
Data Mining - Concepts
  • Extracting meaningful knowledge from huge chunk of ‘raw’ data.
  • Types
    • Association
    • Classification
    • Temporal
classification method
Classification Method
  • Prediction model
  • The C4.5 Tree algorithm
analysis of trees
Analysis of Trees
  • Current work focuses largely on generation of trees
    • Efficient algorithms
    • Disk Resident gigantic data sources
    • Improving accuracy of the generated models
  • Motivation
    • Current research area – need for analysis
areas of analysis
Areas of Analysis
  • Two Sub Problems
    • Filtering Sub Problem
    • Comparison Sub Problem
filtering sub problem
Filtering Sub Problem
  • Typical data warehouses are huge !!
  • Generation of “Bushy” trees
  • Not all outcomes are significant
  • Need to filter trees based on the required outcomes
filtering sub problem8
Filtering Sub Problem

Filtered Classification Tree

Full Classification Tree

filtering sub problem9
Filtering Sub Problem
  • Advantages
    • Efficient querying. Faster results
    • Easy Managed
    • Useful for comparison sub problem
comparison sub problem
Comparison Sub Problem
  • Need to monitor changes in data trends by comparing the classification trees
  • Levels of changes identified
    • Change in test (partition) value
    • Change in the partitions
    • Change in node levels
    • Change in outcome(leaves)
comparison sub problem11
Comparison Sub Problem
  • Issues
    • Structure of trees unpredictable
    • Comparing two trees with no standard structure
solution
Solution
  • XML Trees
    • Convert the tree structure in XML files
    • XML inherently tree structure
    • Take advantage of existing XML related technologies
    • Standard specs
approach
Approach
  • Devise Algorithms to solve filtering and comparison problems
  • Analyzing results of comparison in logical terms
  • Measuring efficiency of the algorithms through time and space complexities
slide16

Suggestions Preferred !!

Over questions !!

Thank You !!

ad