high performance computing on laptops with multicores gpus
Download
Skip this Video
Download Presentation
High Performance Computing On Laptops With Multicores & GPUs

Loading in 2 Seconds...

play fullscreen
1 / 18

High Performance Computing On Laptops With Multicores & GPUs - PowerPoint PPT Presentation


  • 137 Views
  • Uploaded on

High Performance Computing On Laptops With Multicores & GPUs. Sushil K. Prasad Computer Science [email protected] About me. Research Area: Parallel and Distributed Algorithms and Systems - over multicores , GPUs, clusters, sensors, handhelds, web services, …

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' High Performance Computing On Laptops With Multicores & GPUs ' - channing-vega


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
about me
About me

Research Area: Parallel and Distributed Algorithms and Systems

- over multicores, GPUs, clusters, sensors, handhelds, web services, …

Lab: Distributed and Mobile Systems (DiMoS)

at Ga. Tech campus, 5 PhD students, 2 M.S. students

  • IEEE TCPP Chair (elected)
  • 2 NSF grants – currently looking for PhD/MS/undergraduate students
    • Distributed Algorithms
    • High Performance Cloud Computing
gpus vs multicores
GPUs Vs Multicores
  • Combined power exceeds 180 GFLOPs
intel core 2 duo multicore
Intel Core-2 Duo Multicore
  • Difficult to parallelize
  • Memory hierarchy is a barrier:
    • 1 cycle core
    • 3 cycles L1 cache
    • 14 cycles L2
    • 250 cycles RAM
gpu graphics processing unit
GPU: Graphics Processing Unit
  • Nvidia 280 GTX
  • 240 cores
  • Extreme memory hierarchy
    • Registers
    • Local memory
    • Shared memory/8 cores
    • Off chip Global Memory
    • bottleneck bus to CPU
slide7

Nvidia 8800 GTX

  • Smith Waterman Seq Alignment, Fasta, and Blast
  • Database: SwissProt
  • Manavskiand Valle 2008
parallel data structures priority queues

2

6

7

5

5

6

3

8

8

1

7

9

14

10

9

65

34

38

19

21

12

12

15

16

13

14

23

25

Parallel Data Structures -Priority Queues
  • Large Scale Event Simulation
    • Immune System Simulation
    • VLSI Logic simulation
  • Branch and Bound
  • Task Scheduling
  • Challenge: Fine Grained Systems
  • Students: DineshAgarwal, Nick Mancuso
np hard distributed problems in networks
NP-Hard Distributed Problems in Networks

NSF Grant

  • Minimum Vertex/Target Cover
  • Minimum Triangle Packing
  • Optimum mobile sensor network target tracking
  • Minimum channel assignment in mobile ad-hoc networks
  • Students: John Daigle, ThamerSulaiman
middleware for mobile ad hoc applications

2. Lookup

1. Register

Middleware for Mobile Ad–hoc Applications

Mobile Support Station

Applications

Deviceware

Process Requests

3. p2p communication

Listener

Applications

Applications

Deviceware

Process Requests

Listener

Deviceware

Groupware

Process Requests

Listener

Listener

Process Requests

UM-Morris

Directory

bondflow distributed workflow over web services student janaka balasooriya
BondFlow: Distributed Workflow over Web Services(Student: JanakaBalasooriya)
  • Web service interface module
  • Proxy object generator module
  • Workflow configuration module
  • Execution module.
  • Mobile Web Services

Web Service Interface Module

Lookup for Web services

Web Services Registry (UDDI)

S

O

A

P

WS Locator

WSDL

WSDL Parser

Parsed WSDL

Workflow Execution Module

Proxy Object Generator Module

Web Bond Runtime

SOAP/ SyD

Workflow Configuration Module

JVM

p2p search based on bayesian decision and value of information voi student rasanjalee

A Priori Uncertainty

: U1

A Posterior Uncertainty

: U2

U1 –U2 = Information

P2P Search based on Bayesian Decision and Value of Information (VOI) – (Student: Rasanjalee)

The meaning of Uncertainty based Information

  • Peer Selection:
    • Sending/forwarding query at each node along query path

= series of decision making steps based on incomplete data

    • A decision step: query the node that will reduce the uncertainty of current belief most.
  • Experimental Results:

The reduction in uncertainty at each decision step

Current Belief

Decision step 1

.

.

.

.

Decision step n

middleware on distributed smart cameras
Middleware on Distributed Smart Cameras
  • Middleware on DSC networks
    • provide a high-level programming interface for applications.
    • simplify the development of distributed applications on DSC networks.
    • provide networking functionality as part of the middleware
  • Student: JayampathiSampat

cmucam3

about me1
About me

Research Area: Parallel and Distributed Algorithms and Systems

- over multicores, GPUs, clusters, sensors, handhelds, web services, …

Lab: Distributed and Mobile Systems (DiMoS)

at Ga. Tech campus, 5 PhD students, 2 M.S. students

  • IEEE TCPP Chair (elected)
  • 2 NSF grants – currently looking for PhD/MS/undergraduate students
    • Distributed Algorithms
    • High Performance Cloud Computing
ad