The 4th International Conference on Algorithms and Architecture for Parallel Processing

The 4th International Conference on Algorithms and Architecture for Parallel Processing ICA3PP 2000 Hong Kong December 11-13 2000

会议承办单位 • Deakin University, Australia • City University of Hong Kong, Hong Kong • The Croucher Foundation • IEEE Hong Kong Section xcyin@dislab.nju.edu.cn

1 概述 • 收到来自15个国家的127篇文章 • 每篇文章经至少3人审阅 • 录用60篇 (+4，Special session on High-Performance Data Management +9，poster papers) xcyin@dislab.nju.edu.cn

1 概述(续)——主题 • Architectures, Algorithms and Networks • Parallel Architectures and Parallel I/O Systems • Interconnection Networks and Routing • Parallel Algorithms • Distributed Scheduling and Load Balancing • Systems and Applications xcyin@dislab.nju.edu.cn

1 概述(续)——主题 • Systems and Applications • Tools and Environments for Parallel and Distributed Software Development • High-performance Scientific Computing • Parallel and Distributed Databases • Cluster Computing • Distributed and Parallel Operating Systems and Middleware • Fault-tolerant Computing • Parallel Processing on Web-based Systems xcyin@dislab.nju.edu.cn

1 概述(续)——Guest Speakers • On-line Algorithms for Management of Heterogeneous Resource in Scalable Computing Clusters • Amnon Barak,Hebrew Uni. of Jerusalem, Israel • Parallel Processing and Stochastic Search: An Application in Nonlinear Constrained Optimization • Benjamin Wah, Uni. of Illinois at Urbana-Champaign (President of IEEE CS) • Making Internet, A Parallel Processing Machine, Faster, Cheaper, and Better • Wei Zhao, Texas A&M Uni. xcyin@dislab.nju.edu.cn

2 回顾——并行与分布计算技术的发展 • PVP(或VPP)：Cray YMP 90、NEX SX 3、Fujitsu VP 2000等； • SMP：SGI Power Challenge、Sun SPARC Center 2000、曙光1号等； • MPP：Intel Paragon、IBM SP 2、Cray T3D、曙光1000、曙光2000等； • DSM：Sequent的NUMA-Q、HP的SPP、SGI的Origin系列 • Clusters • Computational Grid • Meta computing xcyin@dislab.nju.edu.cn

2 回顾(续)——结构特性比较 xcyin@dislab.nju.edu.cn

2 回顾(续)——访存模型 DSM xcyin@dislab.nju.edu.cn

3 Cluster——两个侧重点 • 一是如何减少结点机间的通信开销 • 一是使用新的高速网，如ATM、快速Ethernet、以及用户自行设计的专用互联网(如Myrinet) • 二是设计新的精简通信协议，减少传统通信协议的层次，以减少通信开销 • 二是有关计算机机群的工作环境 • 主要研究编程环境、任务调度、负载平衡以及全局资源的管理和使用等 xcyin@dislab.nju.edu.cn

3 Cluster——高效的通信系统 • 机群系统一般使用通用局域网连接 • 目前常用的局域网技术大体可以分成两类： • 一类是共享介质网络，最常见的是10 Mbps或100Mbps的Ethernet； • 另一类是开关网络，如155 Mbps/622 Mbps的ATM、640 Mbps/1.28Gbps的Myrinet和100 Mbps的交换式Ethernet • 目前，通信系统的研究方向主要是在减小往返延迟和提高链路带宽的利用率上，实现方法有精简协议处理、开发新的通信机制和减少系统开销 xcyin@dislab.nju.edu.cn

3 Cluster——并行程序设计环境 • 目前研制的机群系统大多支持PVM和MPI ，在支持语言、容错及工具等方面都不完善 • 需要相应支持工具，比如并行调试器、性能评测工具、并行化辅助工具，它们对程序得开发效率与运行效率都有重要得作用 xcyin@dislab.nju.edu.cn

3 Cluster——多种并行语言的支持 • 已有的机群系统大多支持FORTRAN、C和C++，实现方法： • 主要是使用原有顺序编译器链接并行库函数，比如PVM、MPI， • 加入预编译，比如Multi-thread C，MPC++ • 目前机群系统并行程序设计语言的研究： • 扩展原有顺序语言，提供广泛的并行语言支持，例如，清华大学可扩展机群系统的ADA、MPC++ • 提供全新的并行语言，比如Occam • 研究自动化并行编译方法，直接将顺序程序编译成并行代码，如UIUC的Polaris、Stanford的SUIF、复旦大学的AFT 、南大 xcyin@dislab.nju.edu.cn

3 Cluster——全局资源的管理与利用 • 有效地管理系统中的所有资源是机群系统的一个重要方面，常用的并行编程环境PVM、MPI等对这方面的支持都比较弱，仅提供统一的虚拟机 • 例：UC Berkeley的NOW项目中提出，在一般操作系统(Unix、Linux、Windows NT)之上建立一个全局Unix——GL Unix，以解决机群系统中的所有资源管理，包括组调度、资源分配和并行文件系统 xcyin@dislab.nju.edu.cn

3 Cluster——其他方面的研究 • 广播、多播等全局操作的高效实现 • DSM并行模型的支持 • 并行I/O的研究 xcyin@dislab.nju.edu.cn

3 Cluster——典型机群系统 xcyin@dislab.nju.edu.cn

Efficient deployment of shared memory models on clusters of PCs using the SMiLEing HAMSTER approach Martin Schulz (Germany) • SMiLE(Shared Memory in a LAN-like Environment) • HAMSTER(Hybrid-DSM based adaptive and modular shared memory architecture) xcyin@dislab.nju.edu.cn

Parallel Programming with Object Groups—The TACO Approach J Nolte et al (Japan) • TACO——Topologies and Collections • 分布对象组是一种协调并行活动的有效方法 • TACO是一种利用拓扑类和C++模板的重用在机群系统上进行分布式数据并行处理的纯模板库 xcyin@dislab.nju.edu.cn

A Reduced Communication Protocol for Network of Workstations Weimin Zheng et al • FMP(fast message passing protocol) • 关键技术 • 缓冲区管理 • DMA • 避免死锁 • 局部通信 xcyin@dislab.nju.edu.cn

A Software Development Methodology to Support Distributed Computing Clusters David Levine et al (USA) • PARSA(Prism Parallel Technologies, Inc研制的软件开发环境) • 本文提出对PARSA的改进，以使其适合于利用标准程序设计语言和库来开发分布式机群软件 xcyin@dislab.nju.edu.cn

Parallel and Distributed Knowledge Discovery on the Grid: A Reference Architecture M Cannataro and D Talia (Italy) • 介绍基于Grid的并行与分布式数据挖掘PDKD的概念 • 给出PDKD一个参考结构 xcyin@dislab.nju.edu.cn

INTERESTINGS(1) • 超立方体网络上的多点广播算法 • 洗牌交换网络的可重排性(rearrangeability) • 2D Meshes中容错多点传送Wormhole(虫蚀)路由 • 无全局反馈的并行无线路由 • Anycast服务的QoS路由算法 xcyin@dislab.nju.edu.cn

INTERESTINGS(2) • 求复杂矩阵特征值的动态方程的并行实现 • 微积分代数方程(IDAE)松弛方法加速收敛的并行处理 • 线路图查询的有效并行处理方法 • 基于PC机群的最短路径算法 • 分布式环境中个人身份识别的并行生物统计学计算 • 利用HPF求解时间依赖的Maxwell方程的高效可扩放并行实现 xcyin@dislab.nju.edu.cn

Q&C Method for Solving Large Problems in Fixed Size Processor Array X.C. Yin, L. Xie • 提出一种在固定大小的阵列上解决大问题的分解及调度的一般性方法，对于问题的并行算法以及给定大小的阵列，使用分治技术得到一个调度过程来调度所分解的计算和数据。 xcyin@dislab.nju.edu.cn

The 4th International Conference on Algorithms and Architecture for Parallel Processing

The 4th International Conference on Algorithms and Architecture for Parallel Processing

Presentation Transcript

Parallel Algorithms

Parallel Algorithms

INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING

Parallel Algorithms

Parallel Processing: Architecture Overview

4th INTERNATIONAL CONFERENCE ON MPGDs ( MPGD2015 ) @ TRIESTE

Parallel Algorithms

International Conference on Neural Information Processing

The 4th International Conference on Population and Geographies 2007.07.12

The 4th International Symposium on Organic and

4th International Congress on Endometriosis

Paradigms for Parallel Algorithms

4th International Conference on Retroviral Integration

Parallel Algorithms

Parallel Algorithms

4th BIENNIAL INTERNATIONAL CONFERENCE ON BUSINESS, BANKING AND FINANCE

4th International Conference on Flood Defence

4th International Scientific e-Conference

CSC 2224: Parallel Computer Architecture and Programming Parallel Processing, Multicores

INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING

Parallel Algorithms

Parallel Processing: Architecture Overview