Weather mining
Download
1 / 13

Weather Mining - PowerPoint PPT Presentation


  • 131 Views
  • Uploaded on

Weather Mining. Hayato Akatsuka. Objective. Cluster a region which shares similar climate. Input. Each weather station in the United States is an input Each station contains more than 50 parameters i.e. Latitude, Longitude, Elevation, Minimum Temperature, Maximum Temperature, so on….

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Weather Mining' - lane


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Weather mining l.jpg

Weather Mining

Hayato Akatsuka


Objective l.jpg
Objective

  • Cluster a region which shares similar climate.


Input l.jpg
Input

  • Each weather station in the United States is an input

  • Each station contains more than 50 parameters

    • i.e. Latitude, Longitude, Elevation, Minimum Temperature, Maximum Temperature, so on…


Stations l.jpg
Stations

  • 6000 ~ 19000 Stations


Overview l.jpg
Overview

Input (text file)

Station1 2005/01/01 MaxTemp MinTemp Lantitude Longitude Elevation ….Station2 2005/01/01 MaxTemp MinTemp Lantitude Longitude Elevation ….Station3 2005/01/01 MaxTemp MinTemp Lantitude Longitude Elevation …..

output(Image)

Clustering


Distance measure l.jpg
Distance Measure

  • Euclidean Distance

If you are interested in some particular parameters, adjust k accordingly


About clustering l.jpg
About Clustering

  • Day 1(Hierachical Clustering)

    • This is an initialization Stage.

    • Pick a number of clusters

    • Then, Perform Hierarchical Clustering

  • Day 2(Clustering variant)

    • For each input, cluster with the nearest centroid obtained from the previous day (Day 1 in this case).

    • Do not update centroid

    • Repeat until you cluster all the input for Day 2.

    • Recalculate centroid

  • Day 3

    • Repeat Day2 ….


Centroid calculation l.jpg
Centroid Calculation

  • For same cluster

2nd Day:

3rd Day:

4th Day:


Quick animation l.jpg
Quick Animation

Day2

Day1


Result l.jpg
Result

  • For simplicity, just use only 1 parameter (TMIN). Number of Clusters = 5


Comparison l.jpg
Comparison

Output

Hardiness Zone


Conclusion l.jpg
Conclusion

  • Well… there are not much different between a map I received from January and one from December.

  • Simply making a map out of annual data, instead of daily data, might be better.


Reference l.jpg
Reference

  • Hardiness Map http://www.arborday.org/treeinfo/zonelookup.cfm