Big Events. Hans-Arno Jacobsen Middleware Systems Research Group MSRG.org. Big Event Data. Traditional Big Data Domain vs. Rest of Universe. There are other emerging domains with needs similar to Big Data Smart grids Smart cities ….
Middleware Systems Research Group
My first message: There are other relevant Big Data domains –
Big event data challenge
~2.3 TB per year and 1k panels
High frequency measurements required
Several metrics of interest, many spatially distributed measurement points
Source: National Oceanic & Atmospheric Administration (U.S.)
~ 0.5 TB per year
and 1k vehicles
Source: Auto21 Project, University of Winnipeg
~ 27.5 PB per year
and 1k homes
Source: UCI Machine Learning Repository
My second message: Detecting events in real-time in the sea of Big Data is just as important.
Event Stream Processing
Linearly ordered event sequences
Schema-based, single schema per stream
Stream tuples follow schema
More single-expression processing-based
Aggregation is a key requirement
Focused on processing queries/expressions over event streams
My final message: Big Data Benchmarking efforts should take this into account.