top of page

GPU ๋น„์šฉ์ด ๋ถ€๋‹ด๋˜์‹œ๋‚˜์š”? AWS ๊ตฌ๋งค ์˜ต์…˜๋ณ„ ์ตœ๋Œ€ 72% ์ ˆ๊ฐํ•˜๋Š” 3๊ฐ€์ง€ ๋ฐฉ๋ฒ•

  • 2์›” 4์ผ
  • 5๋ถ„ ๋ถ„๋Ÿ‰

์ตœ์ข… ์ˆ˜์ •์ผ: 5์›” 15์ผ

GPU ๋น„์šฉ์ด ๋ถ€๋‹ด๋˜์‹œ๋‚˜์š”? ์ตœ๋Œ€ 72% ์ ˆ๊ฐํ•˜๋Š” 3๊ฐ€์ง€ ๋ฐฉ๋ฒ•. Savings Plans vs RI vs Capacity Blocks ๋น„๊ต. ์Šค๋งˆ์ผ์ƒคํฌ.

Written by ์ •์—ฐ์ฃผ | Edited by ์œคํšจ์ •

GPU ์ธ์Šคํ„ด์Šค ๋น„์šฉ์ด ๋„ˆ๋ฌด ๋ถ€๋‹ด์Šค๋Ÿฌ์šด๋ฐ, ์ ˆ๊ฐํ•  ์ˆ˜ ์žˆ๋Š” ๋ฐฉ๋ฒ• ์—†์„๊นŒ์š”?

p5.48xlarge ์ธ์Šคํ„ด์Šค๋ฅผ ์˜จ๋””๋งจ๋“œ๋กœ ์ผ์ฃผ์ผ๋งŒ ์จ๋„ ์•ฝ $3,600 ์ด์ƒ. ์˜ˆ์‚ฐ์ด ํ•œ์ •๋œ ๊ธฐ์—…์—๊ฒŒ GPU ๋น„์šฉ์€ ํ”„๋กœ์ ํŠธ์˜ ๊ฐ€์žฅ ํฐ ์žฅ๋ฒฝ์ž…๋‹ˆ๋‹ค. ์ด ๊ธ€์—์„œ๋Š” GPU ๋น„์šฉ์„ ์ตœ๋Œ€ 72%๊นŒ์ง€ ์ ˆ๊ฐํ•  ์ˆ˜ ์žˆ๋Š” ๊ตฌ๋งค ์˜ต์…˜์„ ๋น„๊ตํ•˜๊ณ , ์ƒํ™ฉ๋ณ„ ์„ ํƒ ๊ฐ€์ด๋“œ๋ฅผ ์ •๋ฆฌํ–ˆ์Šต๋‹ˆ๋‹ค.


โœ… GPU ๋น„์šฉ, ์ œ๋Œ€๋กœ ๊ด€๋ฆฌํ•˜๊ณ  ๊ณ„์‹ ๊ฐ€์š”?

17๊ฐœ ํ•ญ๋ชฉ์œผ๋กœ GPU ๋ˆ„์ˆ˜๋ฅผ ์ ๊ฒ€ํ•ด๋ณด์„ธ์š” โ†’ [ ์ฒดํฌ๋ฆฌ์ŠคํŠธ ๋ฌด๋ฃŒ ๋‹ค์šด๋กœ๋“œ ]


1. GPU ๊ตฌ๋งค ์˜ต์…˜ 4๊ฐ€์ง€, ํ•œ ๋ˆˆ์— ๋ณด๊ธฐ

AWS์—์„œ๋Š” GPU ์ธ์Šคํ„ด์Šค์— ๋Œ€ํ•ด ์—ฌ๋Ÿฌ ๊ตฌ๋งค ์˜ต์…˜์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ๊ฐ ์˜ต์…˜์˜ ํŠน์ง•์— ๋Œ€ํ•ด์„œ ๋จผ์ € ์•Œ์•„๋ณด๋„๋ก ํ•˜๊ฒ ์Šต๋‹ˆ๋‹ค.

๊ตฌ๋งค ์˜ต์…˜

์ตœ๋Œ€ ํ• ์ธ์œจ

์•ฝ์ • ๊ธฐ๊ฐ„

์šฉ๋Ÿ‰ ๋ณด์žฅ

์ ํ•ฉํ•œ ์›Œํฌ๋กœ๋“œ

On-Demand

0%(๊ธฐ์ค€๊ฐ€)

์—†์Œ

์—†์Œ

๋‹จ๊ธฐ ํ…Œ์ŠคํŠธ, ์˜ˆ์ธก ๋ถˆ๊ฐ€ ์›Œํฌ๋กœ๋“œ

Savings Plans

์ตœ๋Œ€ 72%

1๋…„ ๋˜๋Š” 3๋…„

์—†์Œ

์žฅ๊ธฐ์ ์œผ๋กœ ์•ˆ์ •์ ์ธ ์‚ฌ์šฉ๋Ÿ‰

Reserved Instance (Zonal)

์ตœ๋Œ€ 72%

1๋…„ ๋˜๋Š” 3๋…„

๋ณด์žฅ

ํŠน์ • AZ์—์„œ ์žฅ๊ธฐ ์šด์˜

Capacity Blocks

๋™์  ๊ฐ€๊ฒฉ

1์ผ ~ 6๊ฐœ์›”

๋ณด์žฅ

๋Œ€๊ทœ๋ชจ ML ํ•™์Šต, ๋งˆ๊ฐ์ด ์žˆ๋Š” ํ”„๋กœ์ ํŠธ

๐Ÿ’ก์ฐธ๊ณ : Spot Instance๋Š” ์ตœ๋Œ€ 90% ํ• ์ธ์ด ๊ฐ€๋Šฅํ•˜์ง€๋งŒ, 2๋ถ„ ์ „ ์ค‘๋‹จ ๋ฆฌ์Šคํฌ์™€ ์šด์˜ ๋ณต์žก์„ฑ(์ฒดํฌ ํฌ์ธํŠธ ๊ตฌ์„ฑ, Spot Fleet ๊ด€๋ฆฌ ๋“ฑ)์œผ๋กœ ์ธํ•ด ์•ˆ์ •์ ์ธ GPU ์›Œํฌ๋กœ๋“œ์—์„œ๋Š” ๊ถŒ์žฅํ•˜์ง€ ์•Š์•„ ๋ณธ ๋ธ”๋กœ๊ทธ์—์„œ๋Š” ๋‹ค๋ฃจ์ง€ ์•Š์Šต๋‹ˆ๋‹ค.




2. ๊ฐ ์˜ต์…˜ ๋ณ„ ์ƒ์„ธ ๋ถ„์„

1) Savings Plans: ์ตœ๋Œ€ 72% ํ• ์ธ์œผ๋กœ ์œ ์—ฐํ•œ ์•ฝ์ •

Savings Plans๋Š” 1๋…„ ๋˜๋Š” 3๋…„ ๋™์•ˆ ์ผ์ • ๊ธˆ์•ก($/hour)์˜ ์ปดํ“จํŒ… ์‚ฌ์šฉ์„ ์•ฝ์ •ํ•˜๋Š” ๋Œ€์‹  ํ• ์ธ์„ ๋ฐ›๋Š” ๋ฐฉ์‹์ž…๋‹ˆ๋‹ค.

์œ ํ˜•

ํ• ์ธ์œจ

์œ ์—ฐ์„ฑ

์ ์šฉ ๋ฒ”์œ„

Compute Savings Plans

์ตœ๋Œ€ 66%

๋†’์Œ

EC2, Fargate, Lambda ์ „์ฒด

EC2 Savings Plans

์ตœ๋Œ€ 72%

์ค‘๊ฐ„

ํŠน์ • ์ธ์Šคํ„ด์Šค ํŒจ๋ฐ€๋ฆฌ, ๋ฆฌ์ „

SageMaker Savings Plans

์ตœ๋Œ€ 64%

๋†’์Œ

SageMaker ์ „์šฉ

โ€ป 2025๋…„ 6์›” ์—…๋ฐ์ดํŠธ ๋œ ๋‚ด์šฉ:ย AWS๋Š” P4d, P4de, P5, P5en ์ธ์Šคํ„ด์Šค์— ๋Œ€ํ•ด ์˜จ๋””๋งจ๋“œ ๋ฐ Savings Plans ๊ฐ€๊ฒฉ์„ ์ตœ๋Œ€ 45% ์ธํ•˜ํ–ˆ์Šต๋‹ˆ๋‹ค. P6-B200 ์ธ์Šคํ„ด์Šค๋„ Savings Plans๋กœ ๊ตฌ๋งค ๊ฐ€๋Šฅํ•ด์กŒ์Šต๋‹ˆ๋‹ค.

| ์ถœ์ฒ˜: AWS Savings Plans , AWS GPU Price Reduction Announcement


[ ์žฅ์  ]

  • ์•ฝ์ • ๊ธˆ์•ก ์ดˆ๊ณผ ์‚ฌ์šฉ๋ถ„์€ ์˜จ๋””๋งจ๋“œ ์š”๊ธˆ ์ ์šฉ

  • Compute SP๋Š” ์ธ์Šคํ„ด์Šค ํŒจ๋ฐ€๋ฆฌ, ๋ฆฌ์ „, OS ๋ณ€๊ฒฝ ๊ฐ€๋Šฅ

  • ์ž๋™์œผ๋กœ ์ตœ์ ์˜ ํ• ์ธ์œจ ์ ์šฉ

[ ๋‹จ์  ]

  • ์‚ฌ์šฉ๋Ÿ‰์ด ์•ฝ์ •๋ณด๋‹ค ์ ์–ด๋„ ์•ฝ์ • ๊ธˆ์•ก ์ง€๋ถˆ

  • ๊ตฌ๋งค ํ›„ ๋ณ€๊ฒฝ/์ทจ์†Œ ๋ถˆ๊ฐ€

  • ์šฉ๋Ÿ‰์ด ๋ณด์žฅ๋˜์ง€ ์•Š์Œย (ํ• ์ธ๋งŒ ์ œ๊ณต)

2) Reserved Instance: Regional VS. Zonal์˜ ํ•ต์‹ฌ ์ฐจ์ด

Reserved Instance(RI)๋Š” 1๋…„ ๋˜๋Š” 3๋…„ ์•ฝ์ •์œผ๋กœ ์ตœ๋Œ€ 72% ํ• ์ธ์„ ๋ฐ›๋Š” ์ „ํ†ต์ ์ธ ๋ฐฉ์‹์ž…๋‹ˆ๋‹ค. ๋งŽ์€ ๋ถ„๋“ค์ด ๋†“์น˜๋Š” ์ค‘์š”ํ•œ ์ฐจ์ด๊ฐ€ ์žˆ๋Š”๋ฐ์š”, Regional RI์™€ Zonal RI๋Š” ์šฉ๋Ÿ‰ ๋ณด์žฅ ์—ฌ๋ถ€๊ฐ€ ๋‹ค๋ฅด๋‹ค๋Š” ์ ์ž…๋‹ˆ๋‹ค.

๊ตฌ๋ถ„

Regional RI

Zonal RI

์šฉ๋Ÿ‰ ๋ณด์žฅ

X

O

ํ• ์ธ ์ ์šฉ ๋ฒ”์œ„

๋ฆฌ์ „ ๋‚ด ๋ชจ๋“  AZ

์ง€์ •ํ•œ AZ๋งŒ

์ธ์Šคํ„ด์Šค ์‚ฌ์ด์ฆˆ ์œ ์—ฐ์„ฑ

O, ๊ฐ™์€ ํŒจ๋ฐ€๋ฆฌ ๋‚ด ๊ฐ€๋Šฅ

X, ์ง€์ • ์‚ฌ์ด์ฆˆ๋งŒ

๊ฐ€๊ฒฉ

๋™์ผ

๋™์ผ

โ€ป AWS ๊ณต์‹ ๋ฌธ์„œ์— ๋”ฐ๋ฅด๋ฉด:ย 

  • "A regional Reserved Instance does notย reserve capacity."

  • "A zonal Reserved Instance reserves capacity in the specified Availability Zone."

์ฆ‰, Zonal RI๋ฅผ ๊ตฌ๋งคํ•˜๋ฉด ํ•ด๋‹น AZ์—์„œ ์šฉ๋Ÿ‰์ด ๋ณด์žฅ๋ฉ๋‹ˆ๋‹ค.


[ ์žฅ์  ]

  • ๋น„์šฉ ์˜ˆ์ธก ๊ฐ€๋Šฅ (์›”๋ณ„ ์ถœ๋ ์ž„ ์—†์ด ๊ณ ์ •)

  • Zonal RI๋Š” ์šฉ๋Ÿ‰ ๋ณด์žฅ์œผ๋กœ "GPU๊ฐ€ ์•ˆ ๋œจ๋Š” ์ƒํ™ฉ" ๋ฐฉ์ง€

  • ์ตœ๋Œ€ 72% ํ• ์ธ์œผ๋กœ ์žฅ๊ธฐ ์šด์˜ ์‹œ ๋น„์šฉ ํšจ์œจ์ 

[ ๋‹จ์  ]

  • ์ตœ์†Œ 1๋…„ ์•ฝ์ • ํ•„์ˆ˜ โ†’ ๋‹จ๊ธฐ ํ”„๋กœ์ ํŠธ์—๋Š” ๋ถ€๋‹ด

  • ์›ํ•˜๋Š” AZ์— ์ฟผํƒ€๊ฐ€ ์—†์œผ๋ฉด Zonal RI ๊ตฌ๋งค ์ž์ฒด๊ฐ€ ์–ด๋ ค์šธ ์ˆ˜ ์žˆ์Œ

  • ๋Œ€๊ทœ๋ชจ ๋ถ„์‚ฐ ํ•™์Šต์— ํ•„์š”ํ•œ UltraCluster ํ™˜๊ฒฝ ๋ฏธ์ง€์›

3) Capacity Blocks for ML: ์šฉ๋Ÿ‰ ๋ณด์žฅ + ๋‹จ๊ธฐ ์•ฝ์ •์˜ ์กฐํ•ฉ

EC2 Capacity Blocks for ML์€ ๋ฏธ๋ž˜ ํŠน์ • ๊ธฐ๊ฐ„ ๋™์•ˆ GPU ์šฉ๋Ÿ‰์„ ์˜ˆ์•ฝํ•˜๋Š” ์„œ๋น„์Šค์ž…๋‹ˆ๋‹ค. ๊ฐ€๊ฒฉ์€ ์ˆ˜์š”์™€ ๊ณต๊ธ‰์— ๋”ฐ๋ผ ๋™์ ์œผ๋กœ ๊ฒฐ์ •๋˜๋ฉฐ, ์šฉ๋Ÿ‰์ด 100% ๋ณด์žฅ๋ฉ๋‹ˆ๋‹ค.


[์ง€์› ์ธ์Šคํ„ด์Šค (2025๋…„ 1์›” ๊ธฐ์ค€)]

P6 ์‹œ๋ฆฌ์ฆˆ

P6e-GB200, P6-B300, P6-B200 (NVIDIA Blackwell GPU)

P5 ์‹œ๋ฆฌ์ฆˆ

P5en, P5e, P5 (NVIDIA H200/H100 GPU)

P4 ์‹œ๋ฆฌ์ฆˆ

P4d, P4de (NVIDIA A100 GPU)

Trainium

Trn2, Trn1

[ ์žฅ์  ]

  • H100 ๊ฐ™์€ ๊ณ ์„ฑ๋Šฅ GPU๋ฅผ ๋ฏธ๋ฆฌ ํ™•๋ณดํ•ด๋‘๊ณ  ์‹œ์ž‘ย ๊ฐ€๋Šฅ

  • 1๋…„ ์•ฝ์ • ์—†์ด 1์ผ~6๊ฐœ์›”ย ๋‹จ์œ„๋กœ ์œ ์—ฐํ•˜๊ฒŒ ์‚ฌ์šฉ

  • UltraCluster ๋ฐฐ์น˜๋กœ ๋Œ€๊ทœ๋ชจ ๋ถ„์‚ฐ ํ•™์Šต์— ์ตœ์ ํ™”๋œ ๋„คํŠธ์›Œํ‚น ์ œ๊ณต

  • ๋‹น์ผ๋ถ€ํ„ฐ ์ตœ๋Œ€ 8์ฃผ ์ „๊นŒ์ง€ย ์˜ˆ์•ฝ ๊ฐ€๋Šฅ โ†’ ์ค‘์š”ํ•œ ํ•™์Šต ์ผ์ • ํ™•์‹คํ•˜๊ฒŒ ๊ณ„ํš

[ ๋‹จ์  ]

  • ๊ฐ€๊ฒฉ์ด ์ˆ˜์š”/๊ณต๊ธ‰์— ๋”ฐ๋ผ ๋ณ€๋™ โ†’ ์ด์šฉ ์‹œ์ ์— ํ™•์ธ ํ•„์š”

  • ์žฅ๊ธฐ ์ƒ์‹œ ์›Œํฌ๋กœ๋“œ์—๋Š” RI๋‚˜ Savings Plans๋ณด๋‹ค ๋น„ํšจ์œจ์ 

  • ์ง€์› ์ธ์Šคํ„ด์Šค๊ฐ€ ML์šฉ ๊ณ ์„ฑ๋Šฅ GPU(P4d, P4de, P5, P6, Trn)๋กœ ์ œํ•œ์ 

  • ๊ธฐ์กด RI/SP์™€ ์‹ ์ฒญ ๋ฐฉ์‹์ด ๋‹ฌ๋ผ ์ฒ˜์Œ ์‚ฌ์šฉ ์‹œ ๋‹ค์†Œ ์ƒ์†Œํ•  ์ˆ˜ ์žˆ์Œ


๐Ÿ’ก Capacity Blocks ์‹ ์ฒญ์ด ์ฒ˜์Œ์ด๋ผ๋ฉด, ์Šค๋งˆ์ผ์ƒคํฌ์—์„œ ์˜ˆ์•ฝ ํƒ€์ด๋ฐ๋ถ€ํ„ฐ ์‹ ์ฒญ ์ ˆ์ฐจ๊นŒ์ง€ ์ง€์›ํ•ด ๋“œ๋ฆฝ๋‹ˆ๋‹ค


[ ๋น„์šฉ ์ ˆ๊ฐ ์˜ˆ์‹œ ]

p4d.24xlarge ์ธ์Šคํ„ด์Šค ๊ธฐ์ค€์œผ๋กœ, ์˜จ๋””๋งจ๋“œ ๊ฐ€๊ฒฉ์ด ์›” ์•ฝ $16,029์ธ ๋ฐ˜๋ฉด Capacity Blocks๋ฅผ ์ด์šฉํ•˜๋ฉด ์•ฝ $8,496 ์ˆ˜์ค€์œผ๋กœ ์•ฝ 47% ์ ˆ๊ฐ์ด ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค. (๋‹จ, ๊ฐ€๊ฒฉ์€ ์ˆ˜์š”/๊ณต๊ธ‰์— ๋”ฐ๋ผ ๋ณ€๋™๋ฉ๋‹ˆ๋‹ค.)


[ AWS ๊ณต์‹ ์‚ฌ์šฉ ์‚ฌ๋ก€ ]

๊ธฐ์—…

ํ™œ์šฉ ๋ฐฉ์‹

Arcee

SLM(Small Language Model) ํ•™์Šต, ์žฅ๊ธฐ ์•ฝ์ • ์—†์ด ์œ ์—ฐํ•œ GPU ํ™•๋ณด

Dashtoon

Stable Diffusion XL ๋“ฑ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ชจ๋ธ ํ•™์Šต, P5๋กœ P4d ๋Œ€๋น„ 3๋ฐฐ ์„ฑ๋Šฅ ๊ตฌํ˜„

Leonardo.Ai

์ƒ์„ฑํ˜• AI ๋ชจ๋ธ ํ•™์Šต ๋ฐ ์‹คํ—˜, ๋‹ค์–‘ํ•œ ์ธ์Šคํ„ด์Šค๋กœ ์œ ์—ฐํ•˜๊ฒŒ ์ „ํ™˜



3. ์šฉ๋Ÿ‰ ๋ณด์žฅ์ด ํ•„์š”ํ•˜๋‹ค๋ฉด? Zonal RI vs. Capacity Blocks

GPU ์ธ์Šคํ„ด์Šค ๊ตฌ๋งค ์‹œ Zonal RI์™€ Capacity Blocks ๋ชจ๋‘ ์šฉ๋Ÿ‰์„ ๋ณด์žฅํ•˜์ง€๋งŒ, ์ ํ•ฉํ•œ ์ƒํ™ฉ์ด ๋‹ค๋ฆ…๋‹ˆ๋‹ค.

Zonal RI vs Capacity Blocks ์„ ํƒ ๊ฐ€์ด๋“œ ํ”Œ๋กœ์šฐ์ฐจํŠธ. ์‹œ์ž‘์—์„œ '์šฉ๋Ÿ‰ ๋ณด์žฅ์ด ํ•„์š”ํ•œ๊ฐ€์š”?' ์งˆ๋ฌธ์œผ๋กœ ๋ถ„๊ธฐ. No๋ฉด Savings Plans ๊ฒ€ํ† , Yes๋ฉด '1๋…„ ์ด์ƒ ์‚ฌ์šฉ ์˜ˆ์ •์ธ๊ฐ€์š”?' ์งˆ๋ฌธ์œผ๋กœ ์ด๋™. Yes๋ฉด Zonal RI ์ถ”์ฒœ, No๋ฉด Capacity Blocks ์ถ”์ฒœ.

๊ตฌ๋ถ„

Zonal RI

Capacity Blocks

์šฉ๋Ÿ‰ ๋ณด์žฅ

O

O

์•ฝ์ • ๊ธฐ๊ฐ„

1๋…„ ๋˜๋Š” 3๋…„

1์ผ ~ 6๊ฐœ์›”

์ง€์› ์ธ์Šคํ„ด์Šค

๋Œ€๋ถ€๋ถ„์˜ EC2

P4d, P5, P6, Trn ๋“ฑ ML์šฉ ๊ณ ์„ฑ๋Šฅ๋งŒ

๋„คํŠธ์›Œํฌ

์ผ๋ฐ˜

UltraCluster (์ €์ง€์—ฐ, ๊ณ ๋Œ€์—ญํญ)

๊ฐ€๊ฒฉ ๊ตฌ์กฐ

๊ณ ์ • (์ตœ๋Œ€ 72% ํ• ์ธ)

๋™์  (์ˆ˜์š”/๊ณต๊ธ‰์— ๋”ฐ๋ผ ๋ณ€๋™)

์ด๋Ÿฐ ๊ฒฝ์šฐ์—” Zonal RI๋ฅผ ์„ ํƒํ•˜์„ธ์š”!

  • GPU๊ฐ€ 365์ผ ๊ฑฐ์˜ ํ•ญ์ƒ ์ผœ์ ธ ์žˆ์Œ

  • ํŠน์ • AZ์—์„œ ์žฅ๊ธฐ ์šด์˜์ด ํ™•์ •๋จ

  • ๋น„์šฉ์„ ๋งค๋‹ฌ ์˜ˆ์ธก ๊ฐ€๋Šฅํ•˜๊ฒŒ ๊ณ ์ •ํ•˜๊ณ  ์‹ถ์Œ

์ด๋Ÿฐ ๊ฒฝ์šฐ์—” Capacity Blocks๋ฅผ ์„ ํƒํ•˜์„ธ์š”!

  • 1์ฃผ~์ˆ˜๊ฐœ์›”์งœ๋ฆฌย ๋งˆ๊ฐ์ด ์žˆ๋Š” ํ•™์Šต/ํŒŒ์ธํŠœ๋‹ ํ”„๋กœ์ ํŠธ

  • H100/H200 ๋“ฑ ๊ณ ์„ฑ๋Šฅ GPU๋ฅผ ๋ฐ˜๋“œ์‹œ ํ™•๋ณดํ•ด์•ผ ํ•จ

  • ๋Œ€๊ทœ๋ชจ ๋ถ„์‚ฐ ํ•™์Šต์œผ๋กœ ์ €์ง€์—ฐ ๋„คํŠธ์›Œํ‚น์ด ํ•„์š”ํ•จ



4. GPU ์ธ์Šคํ„ด์Šค ํƒ€์ž… ์„ ํƒ ๊ฐ€์ด๋“œ

GPU ์ธ์Šคํ„ด์Šค๋ฅผ ์„ ํƒํ•  ๋•Œ ๊ฐ€์žฅ ์ค‘์š”ํ•œ ๊ฒƒ์€ GPU ๋ฉ”๋ชจ๋ฆฌ(VRAM)์™€ ์›Œํฌ๋กœ๋“œ ํŠน์„ฑ์ž…๋‹ˆ๋‹ค.

1) ์ถ”๋ก /๊ฒฝ๋Ÿ‰ ํ•™์Šต์šฉ - G ์‹œ๋ฆฌ์ฆˆ

์ธ์Šคํ„ด์Šค

GPU

GPU ๋ฉ”๋ชจ๋ฆฌ

์ฃผ์š” ์šฉ๋„

G5

NVIDIA A10G

24GB/GPU

ML ์ถ”๋ก , ๊ทธ๋ž˜ํ”ฝ ๋ Œ๋”๋ง

G6

NVIDIA L4

24GB/GPU

์ถ”๋ก , ๋น„๋””์˜ค ์ฒ˜๋ฆฌ

G6e

NVIDIA L40S

48GB/GPU

LLM ์ถ”๋ก (13B ํŒŒ๋ผ๋ฏธํ„ฐ), ๋Œ€๊ทœ๋ชจ ๊ทธ๋ž˜ํ”ฝ

2) ๋Œ€๊ทœ๋ชจ ํ•™์Šต์šฉ - P ์‹œ๋ฆฌ์ฆˆ

์ธ์Šคํ„ด์Šค

GPU

GPU ๋ฉ”๋ชจ๋ฆฌ

์ฃผ์š” ์šฉ๋„

P4d

NVIDIA A100

40GB/GPU

๋Œ€๊ทœ๋ชจ ML ํ•™์Šต

P4de

NVIDIA A100

80GB/GPU

์ดˆ๋Œ€ํ˜• ๋ชจ๋ธ ํ•™์Šต

P5

NVIDIA H100

80GB/GPU

์ตœ์‹  LLM ํ•™์Šต

P6-B200

NVIDIA Blackwell

192GB/GPU

์ตœ์ฒจ๋‹จ AI ์›Œํฌ๋กœ๋“œ

โ€ป ์ฃผ์˜:ย ๊ฐ€๊ฒฉ์€ ์ธ์Šคํ„ด์Šค ์‚ฌ์ด์ฆˆ์— ๋”ฐ๋ผ ๋‹ค๋ฅด๋ฏ€๋กœ ์›Œํฌ๋กœ๋“œ์— ๋งž๋Š” ๋น„๊ต๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค!



5. ์ด๋Ÿฐ ์‹ค์ˆ˜, ํ•˜๊ณ  ๊ณ„์‹œ์ง„ ์•Š๋‚˜์š”?

AWS MSP ํŒŒํŠธ๋„ˆ๋กœ์„œ ๊ณ ๊ฐ์‚ฌ GPU ์›Œํฌ๋กœ๋“œ๋ฅผ ์ง€์›ํ•˜๋ฉด์„œ ๋ฐœ๊ฒฌํ•œ ํ”ํ•œ ์‹ค์ˆ˜๋“ค์„ ๊ณต์œ ํ•ฉ๋‹ˆ๋‹ค.

์‹ค์ˆ˜ 1: GPU ๋ฉ”๋ชจ๋ฆฌ ์šฉ๋Ÿ‰ ๋ฏธํ™•์ธ

๋ฌธ์ œ: ๋ชจ๋ธ ํ•™์Šต ์ค‘ CUDA Out of Memory ์—๋Ÿฌ ๋ฐœ์ƒ

ํ•ด๊ฒฐ: ์ธ์Šคํ„ด์Šค ์„ ํƒ ์ „ ๋ชจ๋ธ์˜ GPU ๋ฉ”๋ชจ๋ฆฌ ์š”๊ตฌ์‚ฌํ•ญ์„ ํ™•์ธํ•˜์„ธ์š”.

  • G5/G6: 24GB/GPU โ†’ 7B ํŒŒ๋ผ๋ฏธํ„ฐ ๋ชจ๋ธ๊นŒ์ง€ ์ ํ•ฉ

  • G6e: 48GB/GPU โ†’ 13B ํŒŒ๋ผ๋ฏธํ„ฐ ๋ชจ๋ธ๊นŒ์ง€ ์ ํ•ฉ

  • P5: 80GB/GPU โ†’ 70B+ ํŒŒ๋ผ๋ฏธํ„ฐ ๋ชจ๋ธ ํ•™์Šต ๊ฐ€๋Šฅ


์‹ค์ˆ˜ 2: ์ธ์Šคํ„ด์Šค ๋‹น GPU ๊ฐœ์ˆ˜ ๋ฏธํ™•์ธ

๋ฌธ์ œ:ย g5.12xlarge๋ฅผ ์ฃผ๋ฌธํ–ˆ๋Š”๋ฐ ์˜ˆ์ƒ๋ณด๋‹ค ๋น„์šฉ์ด ๋†’์Œ

ํ•ด๊ฒฐ:ย ์ธ์Šคํ„ด์Šค ์‚ฌ์ด์ฆˆ๋ณ„ GPU ๊ฐœ์ˆ˜๋ฅผ ํ™•์ธํ•˜์„ธ์š”.

์ธ์Šคํ„ด์Šค

GPU ๊ฐœ์ˆ˜

์ด GPU ๋ฉ”๋ชจ๋ฆฌ

g5.xlarge

1

24GB

g5.4xlarge

1

24GB

g5.12xlarge

4

96GB

g5.48xlarge

8

192GB

์‹ค์ˆ˜ 3: ๋ฆฌ์ „ ์„ ํƒ ๋ฏธ๊ณ ๋ ค

๋ฌธ์ œ:ย ์›ํ•˜๋Š” GPU ์ธ์Šคํ„ด์Šค์˜ ์ฟผํƒ€๋ฅผ ๋ฐ›์„ ์ˆ˜ ์—†์Œ

ํ•ด๊ฒฐ:ย 

  • ์„œ์šธ ๋ฆฌ์ „(ap-northeast-2)์€ GPU ํƒ€์ž…์ด ์ œํ•œ์ 

  • ์ง€์—ฐ ์‹œ๊ฐ„์ด ๋œ ์ค‘์š”ํ•œ ํ•™์Šต ์›Œํฌ๋กœ๋“œ๋Š” ๋ฏธ๊ตญ ๋ฆฌ์ „(us-east-1, us-east-2) ํ™œ์šฉ ๊ถŒ์žฅ

  • Capacity Blocks๋Š” ํ˜„์žฌ ์ฃผ๋กœ ๋ฏธ๊ตญ ๋ฆฌ์ „์—์„œ ์‚ฌ์šฉ ๊ฐ€๋Šฅ.


์‹ค์ˆ˜ 4: Regional RI ์™€ Zonal RI ํ˜ผ๋™

๋ฌธ์ œ:ย RI๋ฅผ ๊ตฌ๋งคํ–ˆ๋Š”๋ฐ ์šฉ๋Ÿ‰์ด ๋ณด์žฅ๋˜์ง€ ์•Š์Œ

ํ•ด๊ฒฐ: ์šฉ๋Ÿ‰ ๋ณด์žฅ์ด ํ•„์š”ํ•˜๋ฉด ๋ฐ˜๋“œ์‹œ Zonal RI(ํŠน์ • AZ ์ง€์ •)๋กœ ๊ตฌ๋งคํ•˜์„ธ์š”. Regional RI๋Š” ํ• ์ธ๋งŒ ์ œ๊ณตํ•˜๊ณ  ์šฉ๋Ÿ‰์€ ๋ณด์žฅํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.



6. ์šฐ๋ฆฌ ํšŒ์‚ฌ ์ƒํ™ฉ์— ๋งž๋Š” ์„ ํƒ์€?

AWS GPU ์ƒํ™ฉ๋ณ„ ๊ตฌ๋งค ์ „๋žต ์š”์•ฝ. ์Šคํƒ€ํŠธ์—… ์ดˆ๊ธฐ ๋ชจ๋ธ์€ Savings Plans ์ถ”์ฒœ(50~66% ์ ˆ๊ฐ), ํ”„๋กœ๋•์…˜ ์ถ”๋ก  ์„œ๋น„์Šค๋Š” Savings Plans ์ถ”์ฒœ(50~72% ์ ˆ๊ฐ), ๋‹จ๊ธฐ LLM ํ•™์Šต์€ Capacity Blocks ์ถ”์ฒœ(40~50% ์ ˆ๊ฐ), ์žฅ๊ธฐ GPU ์šด์˜์€ Zonal RI ์ถ”์ฒœ(์ตœ๋Œ€ 72% ์ ˆ๊ฐ).

์‹œ๋‚˜๋ฆฌ์˜ค A: ์Šคํƒ€ํŠธ์—…์˜ ์ดˆ๊ธฐ ๋ชจ๋ธ ์‹คํ—˜

  • ์ถ”์ฒœ: Savings Plans (Compute SP) + G5/G6 ์†Œ๊ทœ๋ชจ๋กœ ์‹œ์ž‘

  • ์ด์œ : ์œ ์—ฐํ•œ ์•ฝ์ •์œผ๋กœ ๋น„์šฉ ์ ˆ๊ฐ๊ณผ ์•ˆ์ •์  ์šด์˜ ๊ฐ€๋Šฅ

  • ์˜ˆ์ƒ ์ ˆ๊ฐ: 50~66%

์‹œ๋‚˜๋ฆฌ์˜ค B: ํ”„๋กœ๋•์…˜ ์ถ”๋ก  ์„œ๋น„์Šค

  • ์ถ”์ฒœ: Savings Plans (Compute SP ๋˜๋Š” EC2 Instance SP)

  • ์ด์œ : ์•ˆ์ •์ ์ธ ์‚ฌ์šฉ๋Ÿ‰, ์žฅ๊ธฐ ๋น„์šฉ ์ตœ์ ํ™”

  • ์˜ˆ์ƒ ์ ˆ๊ฐ: 50~72%

์‹œ๋‚˜๋ฆฌ์˜ค C: ๋Œ€๊ทœ๋ชจ LLM ํŒŒ์ธํŠœ๋‹ (ํ”„๋กœ์ ํŠธ ์„ฑ)

  • ์ถ”์ฒœ: Capacity Blocks (P5 ๋˜๋Š” P4d)

  • ์ด์œ : ์šฉ๋Ÿ‰ ๋ณด์žฅ, ๋งˆ๊ฐ ์žˆ๋Š” ํ”„๋กœ์ ํŠธ, 1๋…„ ์•ฝ์ • ๋ถ€๋‹ด ์—†์Œ

  • ์˜ˆ์ƒ ์ ˆ๊ฐ: ์ˆ˜์š”/๊ณต๊ธ‰์— ๋”ฐ๋ผ ๋ณ€๋™ (์•ฝ 40-50% ์ˆ˜์ค€)

์‹œ๋‚˜๋ฆฌ์˜ค D: ํŠน์ • AZ์—์„œ ์žฅ๊ธฐ GPU ์šด์˜

  • ์ถ”์ฒœ: Zonal Reserved Instance

  • ์ด์œ : ์šฉ๋Ÿ‰ ๋ณด์žฅ, ๊ณ ์ •๋œ ํ• ์ธ์œจ

  • ์˜ˆ์ƒ ์ ˆ๊ฐ: ์ตœ๋Œ€ 72%



GPU ๋น„์šฉ, ์ฒด๊ณ„์ ์œผ๋กœ ์ ๊ฒ€ํ•˜๊ณ  ๊ณ„์‹ ๊ฐ€์š”?

์ง€๊ธˆ๊นŒ์ง€ AWS GPU ์ธ์Šคํ„ด์Šค์˜ ์ฃผ์š” ๊ตฌ๋งค ์˜ต์…˜(Savings Plans, Reserved Instance, Capacity Blocks)์„ ๋น„๊ตํ•˜๊ณ , ์ƒํ™ฉ๋ณ„ ์ตœ์ ์˜ ์„ ํƒ ๊ธฐ์ค€์„ ์‚ดํŽด๋ดค์Šต๋‹ˆ๋‹ค.


ํ•˜์ง€๋งŒ ์‹ค์ œ๋กœ ์šฐ๋ฆฌ ํŒ€์ด GPU ๋น„์šฉ์„ ์ž˜ ๊ด€๋ฆฌํ•˜๊ณ  ์žˆ๋Š”์ง€, ๋†“์น˜๊ณ  ์žˆ๋Š” ๋ถ€๋ถ„์€ ์—†๋Š”์ง€ ์ ๊ฒ€ํ•ด๋ณด์‹  ์  ์žˆ์œผ์‹ ๊ฐ€์š”?


์Šค๋งˆ์ผ์ƒคํฌ์—์„œ๋Š” AWS MSP ํŒŒํŠธ๋„ˆ๋กœ์„œ ๋‹ค์–‘ํ•œ ๊ณ ๊ฐ์‚ฌ์˜ GPU ์›Œํฌ๋กœ๋“œ๋ฅผ ์ตœ์ ํ™”ํ•ด์˜จ ๊ฒฝํ—˜์„ ๋ฐ”ํƒ•์œผ๋กœ, GPU ๋น„์šฉ ์ตœ์ ํ™” ์…€ํ”„ ์ฒดํฌ๋ฆฌ์ŠคํŠธ๋ฅผ ์ค€๋น„ํ–ˆ์Šต๋‹ˆ๋‹ค.



โœ… ์ฒดํฌ๋ฆฌ์ŠคํŠธ์—์„œ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋Š” ๋‚ด์šฉ:
โ˜ ์ธ์Šคํ„ด์Šค ์„ ํƒ ์ ๊ฒ€ (GPU ๋ฉ”๋ชจ๋ฆฌ, G vs P ์‹œ๋ฆฌ์ฆˆ ๋“ฑ)
โ˜ ๊ตฌ๋งค ์˜ต์…˜ ์ ๊ฒ€ (Savings Plans, Capacity Blocks ํ™œ์šฉ ์—ฌ๋ถ€)
โ˜ ์šด์˜ ์ตœ์ ํ™” ์ ๊ฒ€
โ˜ ๋ชจ๋‹ˆํ„ฐ๋ง ๋ฐ ๋น„์šฉ ๊ด€๋ฆฌ
 

SmileShark Logo

์Šค๋งˆ์ผ์ƒคํฌ๋Š” 
AWS ํ”„๋ฆฌ๋ฏธ์–ด ํŒŒํŠธ๋„ˆ์ด๋ฉฐ
AI MSP ์„ธ์ƒ์„ ๋งŒ๋“ค์–ด ๊ฐ€๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค

์ž์‚ฐ 2premier tier.png
ISMS-P.webp

์ธ์ฆ๋ฒ”์œ„  |  ํด๋ผ์šฐ๋“œ ๋ฉ”๋‹ˆ์ง€๋“œ ์„œ๋น„์Šค ์šด์˜           Cloud MSP

โ€‹์œ ํšจ๊ธฐ๊ฐ„  |  2026-03-05 ~ 2029-03-04     (์‹ฌ์‚ฌ ๋ฐ›์ง€ ์•Š์€ ๋ฌผ๋ฆฌ์  ์ธํ”„๋ผ ๋ฐ SaaS ์„œ๋น„์Šค ์ธํ”„๋ผ ์˜์—ญ ์ œ์™ธ)

์Šค๋งˆ์ผ์ƒคํฌ ์ฃผ์‹ํšŒ์‚ฌ  |  ์‚ฌ์—…์ž๋ฒˆํ˜ธ : 198-87-01516  |  ๋Œ€ํ‘œ์ด์‚ฌ : ์žฅ์ง„ํ™˜      ์„œ์šธ ๊ฐ•๋‚จ๊ตฌ ํ…Œํ—ค๋ž€๋กœ44๊ธธ 5, 8์ธต (๋Œ€์•„๋นŒ๋”ฉ)  |  ๋Œ€ํ‘œ์ „ํ™” : 070-5001-2205 

 |  ์ด๋ฉ”์ผ : contact@smileshark.kr

  • LinkedIn
  • Youtube
  • Facebook
ยฉCopyright
bottom of page