Panthera: Holistic Memory Management for - PowerPoint Presentation

343 views
Uploaded On 2022-08-03

Panthera: Holistic Memory Management for - PPT Presentation

Big Data Processing over Hybrid Memories Chenxi Wang Huimin Cui Ting Cao John Zigman Haris Volos Onur Mutlu Fang Lv Xiaobing Feng Guoqing Harry Xu Big Data Workloads MLlib Current ID: 934285

dram memory rdd data memory dram data rdd nvm hybrid overhead objects heap big performance panthera generation java profiling

Link:

Copy

Embed:

<iframe width="560" height="315" src="https://www.docslides.com/embed/934285" frameborder="0" allowfullscreen></iframe>

Download Presentation from below link

Download Presentation The PPT/PDF document "Panthera: Holistic Memory Management fo..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.

Presentation Transcript

Slide1

Panthera: Holistic Memory Management for Big Data Processing over Hybrid Memories

Chenxi Wang

Huimin Cui

Ting Cao

John

Zigman

Haris Volos

Onur

Mutlu

Fang

Xiaobing Feng

Guoqing Harry Xu

Slide2

Big Data

Workloads

MLlib

Current

Memory

: DRAM 40% of

total energy consumption

Capacity starts to hit limit

Written in managed languages

Slide3

Non-Volatile Memory

(NVM)

Byte

addressable memory

material

ProsHigher

memory capacity

densityLower

price

Negligible background energy

consumption

ConsIncreased

read/write latencyReduced bandwidth

Slide4

Hybrid Memory

DRAM +

Non-Volatile

Memory (NVM)

Cache

DRAM

SSD/HD

Cache

DRAM

SSD/HD

Non

Volatile

Memory

Current

Memory

Architecture

Hybrid

Memory

Architecture

Hot

Warm

Cold

Divide

and

place

data

into

Hybrid

Memory

Slide5

Hybrid memory management for big dataOpportunities &

Challenges

Slide6

Current

Solution

Hybrid

Memory Management

Divide

Java heap into

a DRAM and an NVM areas*

Profile

and

migrate the frequently accessed objects to

DRAM*[

*] Write-rationing garbage collection for hybrid memories,

Akram et al., PLDI’18

Young

Generation

Old

Generation

DRAM

NVM

Java

Heap

Frequently

accessed

object

Significant

online

profiling

overhead!

Slide7

Data Characteristics

Big Data

Systems

Execution

Memory

Spark

Memory Management

Temporary

RDDs

Storage Memory

Frequently

used

RDDs

Off-Heap

Memory

Fault

Tolerance

RDDs

Spark

Distributed

data collection (RDD)

Different

RDDs

have

different

and

clear access

patterns

Application-level

memory

subsystem

Coarse-grained

granularity

Slide8

Working with

Big

Data Characteristics

Use

the

characteristics of RDD

to do

coarse-grained data division

Objects

within one RDD have the

same access pattern and lifetime

=> Save lots

of profiling

overhead !

❌

Runtime cannot see these semantics

of RDDs

Java

objects

RDD

Runtime

Slide9

Design

Slide10

Panthera

Holistic

Memory

Management System

for

Big

Data Systems

DRAM

NVM

Runtime

(OpenJDK)

Spark

Applications

Physical

Memory

Data

profiling

Static

inference

Coarse-grained dynamic

analysis

Map

Java

heap

space

physical

NVM/DRAM

RDD

Slide11

var links

ctx.textFile..persist()

for

(

1 to

iters

){

.....

var contribs

links.join(..)..persist()

.....

}

Static

Inference

RDD

Memory

Panthera: Holistic Memory Management for - PowerPoint Presentation

Panthera: Holistic Memory Management for - PPT Presentation

Share:

Link:

Embed:

Related Contents