Transportation Secure Data Center
This presentation is the property of its rightful owner.
Sponsored Links
1 / 27

Transportation Secure Data Center PowerPoint PPT Presentation

  • Uploaded on
  • Presentation posted in: General

Transportation Secure Data Center. Elaine Murakami FHWA Office of Planning Washington, DC. Agenda. Motivations Different approach to traditional research centers Datasets currently available, examples of analyses Processing steps taken by NREL Data access using VMWare.

Download Presentation

Transportation Secure Data Center

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript

Transportation secure data center

Transportation Secure Data Center

Elaine Murakami

FHWA Office of Planning

Washington, DC



  • Motivations

  • Different approach to traditional research centers

  • Datasets currently available, examples of analyses

  • Processing steps taken by NREL

  • Data access using VMWare

Transportation secure data center tsdc

Transportation Secure Data Center (TSDC)

  • Goal: Securely archive and provide public access to detailed transportation data

    • Transportation research has numerous topics of interest, many of which can be explored using GPS devices placed in a vehicle or on a person

    • Publicly-available GPS-based data sets are rare

      • Data sets are expensive to collect, and difficult to interpret

      • Sharing transportation data places participants privacy at risk limiting its distribution

Comparisons to a traditional research data center

Comparisons to a traditional research data center

Benefits of the tsdc

Benefits of the TSDC

Some of the data available through the secure controlled access portal

Some of the Data Available through the Secure Controlled Access Portal

You are limited only by your imagination

You are limited only by your imagination

Seattle puget sound traffic choices fhwa value pricing project by psrc

Seattle - Puget Sound Traffic Choices (FHWA value pricing project, by PSRC

  • GPS recording at X per minute; insufficient for drive cycle processing, but still useful for spatial analyses

  • 447 vehicles

  • Sampling occurred between November 2004 & April 2006

  • 18 month samples


Average Speed

20-30 mph

30-40 mph

10-20 mph

0-10 mph

40- mph

Puget sound traffic choices w urbansim

Puget Sound Traffic Choices w/UrbanSim

Puget sound traffic choices average week location of vehicle

Puget Sound Traffic Choices: average week, location of vehicle

Using real world driving behavior to estimate energy efficiency

Using Real World Driving behavior to estimate energy efficiency

Atlanta atlanta regional commission arc

Atlanta – Atlanta Regional Commission (ARC)

  • 1653 vehicles

  • Sampling occurred between March 2011 & October 2011

  • 7 Day samples



Average Speed

20-30 mph

30-40 mph

10-20 mph

0-10 mph

40- mph

  • 797 persons

  • Sampling occurred between March 2011 & September 2011

  • 7 Day samples

Linking travel to roadway func class roadway electrification study at nrel

Linking travel to roadway func class: Roadway electrification study at NREL

DRAFT only

Lexington ky gps pilot 1995 1996

Lexington KY GPS Pilot 1995-1996

Security procedures

Security - Procedures

  • Establish MOU agreement with data provider

    • Receive data via mail or secure FTP

  • Load onto secure raw data handling server

    • Building badge access

    • On-site security force

    • Room key access

    • Limited to data center staff

  • Maintain data backups

    • Data mirrored on large storage array

    • Regular tape back-up

    • Fire/disaster protection for copies

NREL Data Center Storage Arrays

Data processing

Data Processing

  • Two groups of processing routines are available to handle data sets

  • Six questions to determine how to handle the study:

    • Is the vehicle GPS sample interval greater than 0.25Hz?

    • Is study data provided?

      • Yes - Ask the remaining questions

      • No - Continue with drive cycle processing

    • Is vehicle configuration indicated?

    • Is trip level data analysis available in the original study?

    • Does the study include a wearable GPS component?

    • If a wearable GPS component is included is trip level data available?

  • If vehicle GPS data is above 0.25 Hz it is always fed through drive cycle processing

  • The study data is handled separately but a link is maintained between NREL results and the original study results

Drive cycle processing calculations results

Drive Cycle Processing: Calculations - Results

  • Calculations

    • 250+ variables characterizing the vehicle operation over the sequence are generated for each sequence

  • Filtered point data are used to build trip lines based on the sequences identified

  • Calculation results are appended to the feature


30-45 mph

45-60 mph

15-30 mph

0-15 mph

60-75 mph


Line Drawn From Points – Order assigned using time

Additional tsdc processing

Additional TSDC Processing

  • EPA Vehicle Match – Links vehicle configuration data to the EPA database and adds vehicle class(type)

  • Person Database Update – Assigns an NREL identifier number to each person

  • Unfiltered Trip Processing - Uses original trip data (start/end times) to sequence raw point data

    • Applied to both wearable and vehicle trip data when available

    • Outputs statistics indicating the quality of the data, and builds a line representing the path of travel

    • Operates on the unfiltered data only

Maintaining the link

Maintaining the Link

  • A link is maintained between NREL results and the original study data

    • All studies use either a single column or at most 2 columns to indicate a vehicle or person

    • NREL assigns a single unique integer for all vehicles and records the original study’s vehicle identifiers as a single column in the vehicle tables (applies to persons as well)

Atlanta Example:

  • sampno - is the household identifier (800042)

  • vehno - is the unique vehicle identifier relative to the household (2)

  • Original vehicle identifier assigned as (800042_2) NREL vehicle identifier assigned (54)

Tsdc master database

TSDC Master Database

  • All study data is loaded in a single database

  • Smaller databases are created for each study and transferred to the TSDC access areas

  • Study data often includes:

    • Wearable GPS add-ons

    • Survey data

    • Results for the full study

Tsdc data access

TSDC – Data Access

  • The TSDC processes data sets and provides access through two areas

    • Cleansed Download Data Area: A website where anonymized versions of the processed and original data sets are available to the public

      • Spatial reference and personally identifying information (PII) are removed


    • Secure Portal for Controlled Access: Remote connection to a virtual machine at NREL where users can log on to work with full data sets after completing a simple application process

      • Controls within the secure environment prevent data removal (e.g., no local drive sharing or external internet connection)

      • Software tools provided for working with the data

      • Users may receive aggregated results from their analyses

Action steps

ACTION steps

  • Submit YOUR GPS data into the archive

  • Use data in the archive to understand what you can do with GPS data for your area BEFORE you spend money on a GPS travel survey

  • Use the archive to support transportation, land use, energy, and emissions research

Thank you for more information

Thank you! For more information:

  • Elaine Murakami, FHWA Office of Planning

    • [email protected]

    • 206-220-4460

  • Jeff Gonder, National Renewable Energy Lab

    • [email protected]

    • 303-275-4462

Extra slides

Extra slides

Chicago chicago metropolitan agency for planning

Chicago – Chicago Metropolitan Agency for Planning

  • 408 vehicles

  • Sampling occurred between March 2007 & November 2007

  • 7 Day samples


Average Speed


20-30 mph

30-40 mph

10-20 mph

0-10 mph

40- mph

  • 209 Persons

  • Sampling occurred between September 2007 & January 2008

  • 7 Day samples

Los angeles southern california association of governments

Los Angeles – Southern California Association of Governments


Average Speed

  • 626 vehicles

  • Sampling occurred between June 2001 & March 2002

  • 2 day samples

20-30 mph

30-40 mph

10-20 mph

0-10 mph

40- mph

  • Login