12 æ 2 æ¥ãNVIDIA H200 Tensor ã³ã¢ GPU ãšãAWS ã§ã®ã¿å©çšå¯èœãª 3.2 GHz ã®ãªãŒã«ã³ã¢ã¿ãŒãåšæ³¢æ° (æå€§ã³ã¢ã¿ãŒãåšæ³¢æ° 3.8 GHz) ã®ã«ã¹ã¿ã 第 4 äžä»£ Intel Xeon ã¹ã±ãŒã©ãã«ããã»ããµãŒãæèŒãã Amazon Elastic Compute Cloud (Amazon EC2) P5en ã€ã³ã¹ã¿ã³ã¹ ã®äžè¬æäŸã«ã€ããŠãç¥ããããŸãããããã®ããã»ããµã§ã¯ãã¡ã¢ãªåž¯åå¹
ã 50% åäžããPCIe Gen5 ã§ CPU ãš GPU éã®ã¹ã«ãŒããããæå€§ 4 åã«ãªãã®ã§ãæ©æ¢°åŠç¿ (ML) ãã¬ãŒãã³ã°ãšæšè«ã¯ãŒã¯ããŒãã®ããã©ãŒãã³ã¹ã倧ããåäžããŸãã P5en ã¯ãNitro v5 ã䜿çšããæå€§ 3200 Gbps ã®ç¬¬ 3 äžä»£ Elastic Fabric Adapter (EFAv3) ãæèŒããŠãããåäžä»£ã® EFA ãš Nitro ã䜿çšãã P5 ã«æ¯ã¹ãŠã¬ã€ãã³ã·ãŒãæå€§ 35% åäžããŠããŸããããã«ããã æ·±å±€åŠç¿ ã çæ AI ã ãªã¢ã«ã¿ã€ã ããŒã¿åŠç ã ãã€ããã©ãŒãã³ã¹ã³ã³ãã¥ãŒãã£ã³ã° (HPC) ãªã©ã®çšéã«ããã忣åãã¬ãŒãã³ã°ã¯ãŒã¯ããŒãã®éå£éä¿¡ããã©ãŒãã³ã¹ãåäžããŸãã P5en ã€ã³ã¹ã¿ã³ã¹ã®ä»æ§ã¯æ¬¡ã®ãšããã§ãã ã€ã³ã¹ã¿ã³ã¹ãµã€ãº vCPU ã¡ã¢ãª (GiB) GPU (H200) ãããã¯ãŒã¯åž¯åå¹
(Gbps) GPU ãã¢ããŒã㢠(GB/ç§) ã€ã³ã¹ã¿ã³ã¹ã¹ãã¬ãŒãž (GB) EBS 垯åå¹
(Gbps) p5en.48xlarge 192 2048 8 3200 900 8 x 3.84 100 9 æ 9 æ¥ã Amazon EC2 P5e ã€ã³ã¹ã¿ã³ã¹ãçºè¡š ãããŸããããã®ã€ã³ã¹ã¿ã³ã¹ã¯ã1128 GB ã®é«åž¯åå¹
GPU ã¡ã¢ãªãæèŒãã 8 åºã® NVIDIA H200 GPUã第 3 äžä»£ AMD EPYC ããã»ããµã2 TiB ã®ã·ã¹ãã ã¡ã¢ãªã30 TB ã®ããŒã«ã« NVMe ã¹ãã¬ãŒãžãåããŠããŸãããããã®ã€ã³ã¹ã¿ã³ã¹ã¯ GPUDirect RDMA ããµããŒãããEFAv2 ã«ããæå€§ 3200 Gbps ã®éçŽãããã¯ãŒã¯åž¯åå¹
ãæäŸããŸããããã«ãããããŒãééä¿¡ã§ CPU ããã€ãã¹ããããšã§ã¬ã€ãã³ã·ãŒãäœæžããããã©ãŒãã³ã¹ãå¹ççã«ã¹ã±ãŒã«ã¢ãŠãããããšãå¯èœã«ãªããŸãã P5en ã€ã³ã¹ã¿ã³ã¹ã§ã¯ãæšè«ãšãããã¯ãŒã¯ã¬ã€ãã³ã·ãŒãããã«åæžãããããŸããŸãª GPU ã¢ã¯ã»ã©ã¬ãŒã·ã§ã³ã¢ããªã±ãŒã·ã§ã³ã®å
šäœçãªå¹çæ§ãé«ããããšãã§ããŸããP5 ã€ã³ã¹ã¿ã³ã¹ãšæ¯èŒããŠãP5en ã€ã³ã¹ã¿ã³ã¹ã§ã¯ãããŒã«ã«ã¹ãã¬ãŒãžã®ããã©ãŒãã³ã¹ãæå€§ 2 ååäžãã Amazon Elastic Block Store (Amazon EBS) ã®åž¯åå¹
ãæå€§ 25% åäžããã®ã§ãããŒã«ã«ã¹ãã¬ãŒãžã䜿çšããŠã¢ãã«ã®éã¿ããã£ãã·ã¥ããŠãããŠãŒã¶ãŒã®æšè«ã¬ã€ãã³ã·ãŒããã©ãŒãã³ã¹ãããã«åäžããŸãã é »ç¹ãªããŒã¿äº€æãå¿
èŠãšããå€§èŠæš¡ãªããŒã¿ã»ãããã¯ãŒã¯ããŒãã§ã¯ãç¹ã« CPU ãš GPU éã®ããŒã¿è»¢éã«æéããããå ŽåããããŸããP5 ã P5e ã€ã³ã¹ã¿ã³ã¹ãšæ¯èŒããŠãPCIe Gen 5 ã§ã® CPU ãš GPU éã®åž¯åå¹
ãæå€§ 4 åã«ãªããããè€éãª å€§èŠæš¡èšèªã¢ãã« (LLM) ãšãã«ãã¢ãŒãã« åºç€ã¢ãã« (FM) ã«å ããŠãã·ãã¥ã¬ãŒã·ã§ã³ãå»è¬åçºèŠã倩æ°äºå ±ã財åã¢ããªã³ã°ãªã©ãã¡ã¢ãªã倧éã«æ¶è²»ãã HPC çšéã®ã¢ãã«ãã¬ãŒãã³ã°ã埮調æŽãæšè«ã®å®è¡ã«ãããã¬ã€ãã³ã·ãããã«æ¹åã§ããŸãã Amazon EC2 P5en ã€ã³ã¹ã¿ã³ã¹ã®äœ¿çšãéå§ãã ç±³åœæ±éš (ãªãã€ãª)ãç±³åœè¥¿éš (ãªã¬ãŽã³)ãã¢ãžã¢ãã·ãã£ã㯠(æ±äº¬) ã® AWS ãªãŒãžã§ã³ã§å©çšå¯èœãª EC2 P5en ã€ã³ã¹ã¿ã³ã¹ã¯ã EC2 Capacity Blocks for ML ããªã³ããã³ããSavings Plan ã®è³Œå
¥ãªãã·ã§ã³ãéããŠäœ¿çšã§ããŸãã ãªãã·ã§ã³ãšããŠãã£ãã·ãã£äºçŽãå«ã P5en ã€ã³ã¹ã¿ã³ã¹ã®äœ¿ç𿹿³ã玹ä»ããããšæããŸããEC2 ãã£ãã·ãã£ãããã¯ãäºçŽããã«ã¯ã Amazon EC2 ã³ã³ãœãŒã« ã§ç±³åœæ±éš (ãªãã€ãª) ã® AWS ãªãŒãžã§ã³ã® [ãã£ãã·ãã£äºçŽ] ãéžæããŸãã [ML çšãã£ãã·ãã£ãããã¯ã賌å
¥] ãéžæããŠããåèšå®¹éãéžæãã p5en.48xlarge ã€ã³ã¹ã¿ã³ã¹çšã® EC2 ãã£ãã·ãã£ãããã¯ãå¿
èŠãªæéãæå®ããŸããEC2 ãã£ãã·ãã£ãããã¯ãäºçŽã§ããåèšæ¥æ°ã¯ 1ïœ14 æ¥ã21 æ¥ããŸã㯠28 æ¥ã§ããEC2 ãã£ãã·ãã£ãããã¯ã¯æå€§ 8 é±éåã«è³Œå
¥ã§ããŸãã [ãã£ãã·ãã£ãããã¯ãæ€çŽ¢] ãéžæãããšããŠãŒã¶ãŒæå®ã®æ¥ä»ç¯å²å
ã§ä»æ§ãæºããå©çšå¯èœãªæäœæéã®ãªãã·ã§ã³ãè¿ãããŸããEC2 ãã£ãã·ãã£ãããã¯ã®è©³çްãã¿ã°ãããã³åèšæéæ
å ±ã確èªãã [賌å
¥] ãéžæããŸãã ããã§ãEC2 ãã£ãã·ãã£ãããã¯ãæ£åžžã«ã¹ã±ãžã¥ãŒã«ãããŸããEC2 ãã£ãã·ãã£ãããã¯ã®åèšæéã¯åæãã§è«æ±ããã賌å
¥åŸã«æéã倿Žãããããšã¯ãããŸãããæ¯æãã¯ãEC2 ãã£ãã·ãã£ãããã¯ã賌å
¥ããŠãã 12 æé以å
ã«ã客æ§ã®ã¢ã«ãŠã³ãã«è«æ±ãããŸãã詳现ã«ã€ããŠã¯ãAmazon EC2 ãŠãŒã¶ãŒã¬ã€ãã®ã Capacity Blocks for ML ããåç
§ããŠãã ããã 賌å
¥ãããã£ãã·ãã£ãããã¯å
ã§ã€ã³ã¹ã¿ã³ã¹ã¯ã AWS ãããžã¡ã³ãã³ã³ãœãŒã« ã AWS ã³ãã³ãã©ã€ã³ã€ã³ã¿ãŒãã§ã€ã¹ (AWS CLI) ããŸã㯠AWS SDK ã䜿çšããŠå®è¡ããããšãã§ããŸãã ããã§ã¯ã16 åã® P5en ã€ã³ã¹ã¿ã³ã¹ãå®è¡ã㊠EFAv3 ã®ã¡ãªãããæå€§å ãã AWS CLI ã³ãã³ãã®äŸã瀺ããŸãããã®æ§æã§ã¯ã8 ã€ã®ãã©ã€ããŒã IP ã¢ãã¬ã¹ã§æå€§ 3200 Gbps ã® EFA ãããã¯ãŒã¯åž¯åå¹
ãšæå€§ 800 Gbps ã® IP ãããã¯ãŒã¯åž¯åå¹
ãæäŸãããŸãã $ aws ec2 run-instances --image-id ami-abc12345 \ --instance-type p5en.48xlarge \ --count 16 \ --key-name MyKeyPair \ --instance-market-options MarketType='capacity-block' \ --capacity-reservation-specification CapacityReservationTarget={CapacityReservationId=cr-a1234567} --network-interfaces "NetworkCardIndex=0,DeviceIndex=0,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa" \ "NetworkCardIndex=1,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=2,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=3,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=4,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa" \ "NetworkCardIndex=5,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=6,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=7,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=8,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa" \ "NetworkCardIndex=9,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=10,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=11,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=12,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa" \ "NetworkCardIndex=13,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=14,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=15,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=16,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa" \ "NetworkCardIndex=17,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=18,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=19,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=20,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa" \ "NetworkCardIndex=21,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=22,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=23,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=24,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa" \ "NetworkCardIndex=25,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=26,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=27,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=28,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa" \ "NetworkCardIndex=29,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=30,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" \ "NetworkCardIndex=31,DeviceIndex=1,Groups=security_group_id,SubnetId=subnet_id,InterfaceType=efa-only" ... P5en ã€ã³ã¹ã¿ã³ã¹ãèµ·åãããšãã AWS Deep Learning AMI (DLAMI) ã䜿çšã㊠EC2 P5en ã€ã³ã¹ã¿ã³ã¹ããµããŒãã§ããŸããDLAMI ã¯ãäºåèšå®ãããç°å¢ã§ã¹ã±ãŒã©ãã«ã§å®å
šãªåæ£å ML ã¢ããªã±ãŒã·ã§ã³ããã°ããæ§ç¯ããããã®ã€ã³ãã©ã¹ãã©ã¯ãã£ãšããŒã«ã ML ã®å°éå®¶ãç ç©¶è
ã«æäŸããŸãã Amazon Elastic Container Service (Amazon ECS) ãŸã㯠Amazon Elastic Kubernetes Service (Amazon EKS) ã®ã©ã€ãã©ãªã䜿çšããŠãP5en ã€ã³ã¹ã¿ã³ã¹äžã§ AWS Deep Learning Containers ã§ã³ã³ããåããã ML ã¢ããªã±ãŒã·ã§ã³ãå®è¡ã§ããŸãã å€§èŠæš¡ãªããŒã¿ã»ããã«ãã°ããã¢ã¯ã»ã¹ããã«ã¯ãæå€§ 30 TB ã®ããŒã«ã« NVMe SSD ã¹ãã¬ãŒãžã䜿çšãããã Amazon Simple Storage Service (Amazon S3) ã§è²»çšå¯Ÿå¹æã®é«ãäºå®äžç¡å¶éã®ã¹ãã¬ãŒãžã䜿çšããããšãã§ããŸããP5en ã€ã³ã¹ã¿ã³ã¹ã§ Amazon FSx for Lustre ãã¡ã€ã«ã·ã¹ãã ã䜿çšããŠãå€§èŠæš¡ãªæ·±å±€åŠç¿ãš HPC ã¯ãŒã¯ããŒãã«å¿
èŠãªæ°çŸ GB/ç§ã®ã¹ã«ãŒããããš 1 ç§ãããæ°çŸäžåã®å
¥åºåãªãã¬ãŒã·ã§ã³ (IOPS) ã§ããŒã¿ã«ã¢ã¯ã»ã¹ããããšãã§ããŸãã ä»ãããå©çšããã ããŸã çŸåšãAmazon EC2 P5en ã€ã³ã¹ã¿ã³ã¹ã¯ãEC2 Capacity Blocks for MLããªã³ããã³ããSavings Plan ã®è³Œå
¥ãªãã·ã§ã³ãéããŠãç±³åœæ±éš (ãªãã€ãª)ãç±³åœè¥¿éš (ãªã¬ãŽã³)ãã¢ãžã¢ãã·ãã£ã㯠(æ±äº¬) ã® AWS ãªãŒãžã§ã³ãšç±³åœæ±éš (ã¢ãã©ã³ã¿) ããŒã«ã«ãŸãŒã³ us-east-1-atl-2a ã§ãå©çšããã ããŸãã詳现ã«ã€ããŠã¯ã Amazon EC2 æéã®ããŒãž ãåç
§ããŠãã ããã Amazon EC2 ã³ã³ãœãŒã« ã§ Amazon EC2 P5en ã€ã³ã¹ã¿ã³ã¹ã詊ããŠã¿ãŠãã ããã詳现ã«ã€ããŠã¯ã Amazon EC2 P5 ã€ã³ã¹ã¿ã³ã¹ã®ããŒãž ãåç
§ããŠãã ããããã£ãŒãããã¯ã¯ã EC2 ã® AWS re:Post ããŸãã¯éåžžã® AWS ãµããŒãã®æ
åœè
ãŸã§ãå¯ããã ããã â Channy åæã¯ ãã¡ã ã§ãã