1 / 13

Visualizing Results of Data Mining Source Code

Visualizing Results of Data Mining Source Code . by Mike McCallie. I want to combine Data Mining tools + Visualization tools I am motivated in using information in various forms to make informed decisions

alagan
Download Presentation

Visualizing Results of Data Mining Source Code

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Visualizing Results of Data Mining Source Code by Mike McCallie

  2. I want to combine Data Mining tools + Visualization tools • I am motivated in using information in various forms to make informed decisions • I believe inherit software structure (compliable source code) has an advantage over free-form text from a data mining perspective • I wish to “mine” data from source code and “build” visual models of code representation that are useful from a software engineer’s perspective Thoughts

  3. Exploring tools at Moose for Data exploration

  4. Exploring “Code City” for Visual Representation CodeCity is programmed in VisualWorks Smalltalk on top of the Moose platform, uses OpenGL for rendering Classes are represented as buildings in the city. Packages are depicted as the districts in which the buildings reside.

  5. Conceptual Model “Mining” Algorithms Data Output SourceCode Data Mining “Engine” Visualization “Engine” Visual Results

  6. Theoretical Discussion • Data mining and visualization investigation • 80’s and 90’s focus on program comprehension • What worked • What were dead-ends (as important as what worked IMHO) • Literature review on program comprehension • Gestalt principles were explored in previous class • Results of past empirical studies Thesis Approach – Part i

  7. Motivating Scenario • Problem that is not too big, but not too small • “Bob the programmer was given the assignment to add enhancement X to legacy system Y.” • Bob has ability to mine data from source code and visualize results • Question: What information is MOST relevant for Bob to succeed? (bound problem) Thesis Approach – Part 1

  8. Implementation • Moose tools for software analysis • Code City for software visualization • Source Code Analysis: • Public domain: Analyzing JHotDraw • Private domain: Analyzing 20+ year old legacy system at present employer Thesis Approach – Part 2

  9. JHotDraw Framework Classes Model Design Patterns Role-Model-Enhanced Class Model

  10. Thesis Approach – Part 3 • Empirical Study – Compare resultant artifacts JHotDraw Artifacts Data Mining “Engine” + Visualization “Engine” JHotDraw Source Code Compare to existing JHotDraw artifacts Legacy System Artifacts Legacy System Source Code Compare to existing Legacy System “expertise”

  11. Thesis Approach – Part 4 • Results and Conclusions… “Rule of Thumb” Mathematical Model “I am very curious how close to a workable mathematical model I can create based on the findings of my empirical study”

  12. Dr. ParvathiChundi • Dr. Bill Mahoney • Dr. Harvey Siy A big thank you to my thesis committee

  13. Questions • Comments • Concerns • Observations • Puns • Jokes • Limericks • etc. And thank you for your time as well…

More Related