hemera kickoff october 5th 2010
Download
Skip this Video
Download Presentation
Hemera KickOff October 5th, 2010

Loading in 2 Seconds...

play fullscreen
1 / 6

Hemera KickOff October 5th, 2010 - PowerPoint PPT Presentation


  • 122 Views
  • Uploaded on

Hemera KickOff October 5th, 2010. Working Group B5 Efficient management of very large volumes of information for data-intensive applications Gabriel Antoniu, Jean-Marc Pierson. Challenges. Tremendous volumes of data (up to Petabytes), increasing every year

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Hemera KickOff October 5th, 2010' - forrest-riddle


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
hemera kickoff october 5th 2010

Hemera KickOffOctober 5th, 2010

Working Group B5

Efficient management of very large volumes of information for data-intensive applications

Gabriel Antoniu, Jean-Marc Pierson

challenges
Challenges
  • Tremendous volumes of data (up to Petabytes), increasing every year
  • Cloud infrastructures enforce this trend
  • Large span of diverse applications
  • Different modalities of data: images, text, video, raw values
  • Distributed, heterogeneous, structured or not, semantically (en-)riched, confidential
  • Stored in DFS or DDB, Cloud storage services, Warehouses
aim of the wg
Aim of the WG
  • Explore research issues related to high-level services for information management (search, mining, visualisation, processing)
  • For large volumes of distributed data
  • Taking into account
    • security, efficiency and heterogeneity
    • applications requirements
    • and the execution infrastructure (grids, clouds)
issues to be addressed
Issues to be addressed
  • Low-level:
    • Fault-tolerance, caching, transport, security (encryption, confidentiality), consistency, location transparency
  • Intermediate-level:
    • Interoperability among storage systems
    • Data indexing
  • High-level:
    • Data mining, data classification, data assimilation, knowledge extraction, data visualization
    • Metadata management
communities involved
Communities involved
  • Distributed applications
  • Distributed systems
    • clusters, grids, P2P, clouds
  • Fault-tolerant systems
  • Databases, data mining
  • Security
  • Numerical algorithms
roadmap
Roadmap
  • Identify research teams
    • Active in the area of the WG
    • With experience in data-intensive applications on Aladdin-G5K
    • And new comers…
  • Organize workshops and possibly schools to share and disseminate experience and knowledge
ad