оƬ¾ÞÍ·AIÕù°ÔÕ½£¡Ðû²¼Ê±¼ä£º2023-04-17 22:09À´Ô´£ºEETOP È˹¤ÖÇÄÜר¼ÒºÍÐÐÒµ¸ß¹Ü£¬°üÀ¨°£Â¡-Âí˹¿Ë£¬×î½üÐû²¼ÁËÒ»·â¹ûÕæÐÅ£¬ÒªÇóÔÚÁù¸öÔÂÄÚÍ£Ö¹±ÈOpenAI×î½üµÄGPT-4¸üÇ¿´óµÄAI¿ª·¢¡£µ«ÒýÁìÏñ ChatGPTÕâÑùµÄÁ¢ÒìµÄÕù¶áÈ˹¤ÖÇÄܰÔÖ÷ְλµÄÓ²¼þ¹«Ë¾Ã»ÓÐÏÔʾ³ö·Å»ºµÄ¼£Ïó¡£ Òµ½ç×î´óµÄһЩӲ¼þÅÌË㹫˾£¬°üÀ¨Ó¢Î°´ï¡¢¸ßͨºÍ¹È¸è£¬×î½ü¶¼ÔÚýÌåÉÏÐû³ÆÓµÓж¥¼¶µÄÉ豸ÐÔÄÜ¡£ ÉÏÒ»´ú¹È¸èµÄ TPU ΪЧÀÍÆ÷»ú·¿Ìṩ¶¯Á¦ ÔÚ±¾ÎÄÖУ¬ÎÒÃǽ«ÉóÊÓÆäÖеÄһЩ×îÐÂͨ¸æ£¬ÒÔÆÀ¹ÀËûÃǵÄÉùÃ÷²¢¸üºÃµØÁ˽â AI Ó²¼þÐÐÒµµÄ¾ºÕù»¨Ñù¡£ ¸ßͨÔÚÄÜЧ·½ÃæÃûÁÐǰé ±¾ÖÜ£¬¸ßͨÐû²¼Æä×îÐÂÌá½»µÄ MLPerf v3.0 ÊÇÄÜЧÀà±ðÖеÄÁìÏÈÕß¡£ ¸ßͨµÄ Cloud AI 100¡£Í¼Æ¬ÓɸßͨÌṩ ¸Ã¹«Ë¾¶ÔÆä Qualcomm Cloud AI 100 ½øÐÐÁ˶àÏî²âÊÔ£¬ÆäÖÐÒýÈëÁË PCIe Lite ¼ÓËÙÆ÷¡£¾Ý¸ßͨ¹«Ë¾³Æ£¬Cloud AI 100Éè¼ÆÎª¿ÉÅäÖà 35-55 W ÈÈÉè¼Æ¹¦ÂÊ (TDP)£¬×¨ÎªµÍ¹¦ºÄºÍ¸ßÐÔÄܶøÉè¼Æ¡£ ¸ßͨʵÏÖÁËÿÃë 430 K+ ÍÆÀíµÄ ResNet-50 ÀëÏß·åÖµÐÔÄÜ£¬ÓâÔ½ÁËÆä֮ǰÔÚËùÓÐÀà±ðÖеķåÖµÀëÏßÐÔÄÜ¡¢ÄÜЧºÍÑӳٵļͼ¡£Ìá½»µÄÎļþ»¹Éù³ÆÊµÏÖÁË241¸öÍÆÀí/Ãë/ÍߵŦºÄЧÂÊ¡£¸ßͨ¹«Ë¾Éù³Æ£¬Í¨¹ýÈí¼þÓÅ»¯ÊµÏÖÁËÕâЩ¸ïУ¬Èç¸ïÐÂAI±àÒëÆ÷¡¢DCVSËã·¨ºÍÄÚ´æÊ¹Óᣠ¹È¸èÐû³Æ×Ô¼ºÊdz¬µÈÅÌËãÁìÓòµÄÁìµ¼Õß ¹È¸è±¾ÖÜÒ²Ðû²¼ÁË×Ô¼ºµÄÖØ¸ßÉùÃ÷:¸Ã¹«Ë¾Éù³ÆÆä¹È¸èCloud TPU v4Ϊ´ó¹æÄ£»úеѧϰÌṩÁËÐÐÒµÁìÏȵÄЧÂÊ¡£ ÕÅÁ¿´¦Àíµ¥Î» (TPU) v4 ÊǹȸèµÄµÚÎå´úÌØ¶¨ÁìÓò¼Ü¹¹ (DSA£¬domain-specific architecture) ºÍµÚÈý¸öרΪѵÁ·´ó¹æÄ£»úеѧϰģÐͶøÉè¼ÆµÄ³¬µÈÅÌËã»ú¡£ÔÚ×î½üÐû²¼¸ø ISCA µÄһƪÂÛÎÄÖУ¬¹È¸è¹¤³Ìʦ¸üÏêϸµØÃèÊöÁË TPU v4 ϵͳ¡£TPU v4 µÄÈý´óÌØÐÔ°üÀ¨Æä¹â·¿ª¹Ø¡¢¶ÔǶÈë DLRM£¨Éî¶ÈÑ§Ï°ÍÆ¼öÄ£ÐÍ£©µÄÓ²¼þÖ§³ÖÒÔ¼°¶Ô all-to-all ͨÐÅģʽµÄÖ§³Ö¡£ TPU v4 pod£¨1/8²¿·Ö£©¡£Í¼Æ¬ÓÉ ¹È¸èÔÆÌṩ ÔÚ¸ßÌõÀíÉÏ£¬TPU v4ÌṩÁ˰ÙÒÚÒڴμ¶µÄ»úеѧϰÐÔÄÜ£¬ÓÐ4,096¸öоƬ£¬Í¨¹ýÒ»¸ö¿ÉÖØÐÂÅäÖõĹâ·¿ª¹Ø£¨OCS£©½øÐл¥Á¬¡£OCSµÄÊÂÇéÊǶ¯Ì¬µØÖØÐÂÅäÖû¥Á¬ÍØÆË½á¹¹£¬ÒÔÌá¸ß¹æÄ£¡¢¿ÉÓÃÐÔ¡¢ÀûÓÃÂÊ¡¢¹¦ÂʺÍÐÔÄÜ¡£ÕâʹµÃËü¸üÈÝÒ×ÈÆ¹ý¹ÊÕϲ¿¼þ£¬²¢Í¨¹ý¶¯Ì¬¸Ä±ä³¬µÈÅÌËã»ú»¥Á¬µÄÍØÆË½á¹¹À´Ìá¸ßÐÔÄÜ¡£Æä½á¹ûÊǼÓËÙÁËMLÄ£Ð͵ÄÐÔÄÜ¡£Ã¿¸öTPU v4»¹°üÀ¨SparseCores£¬¼´Êý¾ÝÁ÷´¦ÀíÆ÷£¬¿É¼ÓËÙÒÀÀµÇ¶ÈëµÄÄ£ÐÍ¡£ ÔÚÐÔÄÜ·½Ã棬TPU v4 ÔÚÿ¸öоƬµÄ»ù´¡ÉÏ±È TPU v3 ºá¿ç 2.1 ±¶£¬Í¬Ê±ÐÔÄܹ¦ºÄ±ÈÒ²Ìá¸ßÁË 2.7 ±¶£¬Æ½¾ù¹¦ºÄΪ 200 W¡£°ÙÒÚÒڴμ¶ NVIDIA ĿǰÈÔÈ»ÁìÏÈ ¾¡¹Ü¸ßͨºÍ¹È¸è×î½üÍÆ³öÁË AI »ù×¼²âÊÔ£¬µ« NVIDIA ÈÔȻռ¾Ý¿É²Ù×÷ AI Ó²¼þµÄ×î¸ßÊг¡·Ý¶î¡£ÊÂʵÉÏ£¬Â·Í¸Éç×î½ü±¨µÀ³Æ£¬NVIDIA Õ¼¾ÝÁËͼÐδ¦Àíµ¥Î»(GPU) Êг¡ 80% µÄ·Ý¶î¡ª¡ªÕâЩоƬΪ OpenAI µÄ ChatGPT ÁÄÌì»úеÈËÌṩÁËÅÌËãÄÜÁ¦¡£AMD ÔÚÊг¡·Ý¶î¿ØÖÆ·½Ãæ½ôËæ NVIDIA£¨Ô¼ 20%£©£¬Ê¹Æä³ÉΪ GPU Êг¡µÄµÚ¶þ´óÍæ¼Ò¡£ ËäÈ»ËùÓÐÖ÷ÒªµÄÈí¼þÊÂÇéÊÒĿǰ¶¼ÔÚʹÓà NVIDIA µÄ A100 ´¦ÀíÆ÷£¬µ«¹È¸èÉù³ÆÆä×îÐÂÒ»´ú TPU±È A100 ¸ü¿ì¡¢¸ü½ÚÄÜ¡ª¡ªÉù³Æ×îÊܽӴýµÄÑ¡Ïî²¢²»×ÜÊǵÈͬÓÚÐÔÄÜ×î¼ÑµÄÑ¡Ïî¡£ Óë A100 GPU Ïà±È£¬¹È¸è±¨¸æµÄ MLPerf ѵÁ· 2.0 µÄ BERT£¨ÉÏ£©ºÍ ResNet£¨Ï£©ÐÔÄÜ¡£Í¼Æ¬ÓÉarXivÌṩ |