Hemera kickoff october 5th 2010
This presentation is the property of its rightful owner.
Sponsored Links
1 / 6

Hemera KickOff October 5th, 2010 PowerPoint PPT Presentation


  • 86 Views
  • Uploaded on
  • Presentation posted in: General

Hemera KickOff October 5th, 2010. Working Group B5 Efficient management of very large volumes of information for data-intensive applications Gabriel Antoniu, Jean-Marc Pierson. Challenges. Tremendous volumes of data (up to Petabytes), increasing every year

Download Presentation

Hemera KickOff October 5th, 2010

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Hemera kickoff october 5th 2010

Hemera KickOffOctober 5th, 2010

Working Group B5

Efficient management of very large volumes of information for data-intensive applications

Gabriel Antoniu, Jean-Marc Pierson


Challenges

Challenges

  • Tremendous volumes of data (up to Petabytes), increasing every year

  • Cloud infrastructures enforce this trend

  • Large span of diverse applications

  • Different modalities of data: images, text, video, raw values

  • Distributed, heterogeneous, structured or not, semantically (en-)riched, confidential

  • Stored in DFS or DDB, Cloud storage services, Warehouses


Aim of the wg

Aim of the WG

  • Explore research issues related to high-level services for information management (search, mining, visualisation, processing)

  • For large volumes of distributed data

  • Taking into account

    • security, efficiency and heterogeneity

    • applications requirements

    • and the execution infrastructure (grids, clouds)


Issues to be addressed

Issues to be addressed

  • Low-level:

    • Fault-tolerance, caching, transport, security (encryption, confidentiality), consistency, location transparency

  • Intermediate-level:

    • Interoperability among storage systems

    • Data indexing

  • High-level:

    • Data mining, data classification, data assimilation, knowledge extraction, data visualization

    • Metadata management


Communities involved

Communities involved

  • Distributed applications

  • Distributed systems

    • clusters, grids, P2P, clouds

  • Fault-tolerant systems

  • Databases, data mining

  • Security

  • Numerical algorithms


Roadmap

Roadmap

  • Identify research teams

    • Active in the area of the WG

    • With experience in data-intensive applications on Aladdin-G5K

    • And new comers…

  • Organize workshops and possibly schools to share and disseminate experience and knowledge


  • Login