200 likes | 383 Views
Understanding the role of the Power delivery network in 3-d stacked memory devices. Manjunath Shevgoor , Niladrish Chatterjee , Rajeev Balasubramonian , Al Davis University of Utah. Jung- Sik Kim DRAM Design Team, Samsung Electronics. Aniruddha N. Udipi ARM R&D. Background .
E N D
Understanding the role of the Power delivery network in 3-d stacked memory devices Manjunath Shevgoor, NiladrishChatterjee, Rajeev Balasubramonian, Al Davis University of Utah Jung-Sik Kim DRAM Design Team, Samsung Electronics Aniruddha N. Udipi ARM R&D
Background • Only part of the Supply Voltage reaches the circuit elements • This loss of Voltage over the Power Delivery Network (PDN) is called IR -Drop DIE 1 • 3D stacking increases current density • Top layer in a 9 high stack needs to go through 8 TSV layers • IR Drop violations can lead to correctness issues DIE 2 DIE 3 Wire Resistance Circuit Element B A 1.5V Voltage along Wire A-B GND 1.5V 1.2V
Addressing IR Drop • Reduce Resistance • Make wires wider • Add more VDD/VSS bumps • Increases Cost • Reduce current • Control Activity on chip • Decreases Performance • This paper tries to reduce the performance impact without increasing cost Relationship between pin count and package cost Source: Dong el al. Fabrication Cost Analysis and Cost-Aware Design Space Exploration for 3-D ICs
A big shift going forward • Current limiting constraints already exist • DDR3 uses tFAW and tRRD • Recent work on PCM (Hay et al.) using Power Tokens to limit PCM current draw • These solutions use Temporal Constraints, which are not optimal to address IR Drop in 3D DRAM • Quality of Power Delivery also depends on location on die • We propose Spatial Constraints to leverage this disparity
DRAM Layout – Spatial Dependence VDD on M1 on Layer 9 Y Coordinate VDD X Coordinate • We use an HMC like architecture for our evaluation • IR Drop worsens as distance from TSVs increase
IR Drop Profile Layer 2 Layer 3 Layer 4 Layer 5 Layer 6 Layer 7 Layer 8 Layer 9 • Figures illustrate IR Drop when all banks in the 3D stack are executing ACT • IR Drop worsens as the distance from the source increases
Iso-IR Drop Regions • IR Drop worsens as distance from TSVs increase • IR Drop also worsens as height from C4 bumps increase • We define activity Constraints on a region by region basis
DRAM Currents • Read Consumes the highest current of any DRAM command • To keep design complexity down, we define all other currents in terms of READS Source: Micron Data Sheet for 4Gb x16 part
Region based Read constraints • Different Regions have very different IR Drop characteristics • To not be constrained by the worst region, we determine max. number of Reads supported by each Region • IR Drop in any region is not determined by just the activity in that region • Memory controller complexity increases with the number of ‘Regions’
Region based Read constraints • To keep the memory controller simple, four kinds of constraints are created for Reads • Single Region Constraints- Assume all Reads happen in only Region • Two Region Constraints- Assume all Reads happen in only two adjacent regions • Four Region Constraints- Assume all Reads happen in either top four or bottom four dies • Die Stack wide constraint- Reads can be happening any where in the die stack
Read Based constraints • To limit controller complexity, we define ACT, PRE and Write constraints in terms of Read • The Read-Equivalent is the min. number of ACT/PRE/WR that cause the same IR-Drop as the Read with the least IR-Drop in that Region
Proposals1- Controlling Starvation • As long as Bottom Regions are serving more than 8 Reads, Top Regions can never service a Read • Requests mapped to Top regions suffer • Prioritize Requests that are older than N* Avg. Read Latency (N is empirically determined to be 1.2 in our simulations)
Case Study– Page Placement • Profile Applications to find out the most accessed pages • Map most accessed pages to the most IR Drop resistant regions (Bottom Regions) • The profile is divided into 8 sections. The 4 most accessed sections are mapped to Bottom regions • The rest are mapped to C_TOP, B_TOP, D_TOP, A_TOP, in that order
Modeling IR Drop • Power assigned to each block is assumed to be distributed evenly over the block • Current sources are used to model the power consumption • More details in the paper Source: Sani R. Nassif, Power Grid Analysis Benchmarks
Methodology • HMC based memory system • Simics coupled with augmented USIMM • SPEC CPU 2006 mp
Results • With All Constraints, (Real PDN) performance falls by 4.6X • With Starvation management, gap is reduced to 1.47X • Profiled Page Placement with Starvation Control is within 15.35% of unrealistic Ideal PDN
Future Work • We present a Case study, which explores the performance improvement of IR Drop aware page placement • A realistic page placement/migration scheme is required to leverage the disparity in IR Drop tolerance of different regions • Metrics other than number of page accesses might be more appropriate when identifying critical pages • Prioritizing requests to Bottom regions could help overcome the detrimental effects of the Top regions • Exploring the complexity – performance tradeoff for memory controllers
Conclusions • This paper presents the problem of problem IR Drop in 3D DRAM • We reduce the impact of worst case IR Drop by introducing Region based constraints • Memory controller complexity is limited by simplifying IR Drop constraints • By addressing both the spatial and the temporal aspects of the problem, we achieve performance that is very close to that of the Ideal PDN