• ? Æ»¹û°æ-Ô¼ÉÏÃÅ·þÎñ-ÕâÊÇÕæ²»ÒªÃü£¡£¡£¡¶í¾üÄÃTM-62µ±Õ¨Ò©°üÓ㬲îµã°Ñ×Ô¼ººä·ÉÁË

    Ò¼¶¨·¢

    ɽÎ÷ÐÂÎÅÍø

    ×îÐÂAPP

    ÈÈÃÅAPP

    • º«¹ú¡°179ËÀ¿ÕÄÑ¡±75ÃëºÚÏ»×Ó¼ÒôÊ×´ÎÐû²¼£»ÊÓ²ì³ÆÈôÎÞΧǽ¿ÉȫԱÉú»¹

      ¹Ï£¡£¡£¡Ì©¡°»¤¹úÉñÊÞ¡±¹«Ö÷ÁµÉÏÌ©×ÜÀí¶ù×Ó£¿£¿Äз½Îªµ±æâÂíÒÉ˦¸ÕÎ͍7ÄêÅ®ÓÑ£¡£¡£¡

      ÕâÊÇÒ»¸ö¹ØÓÚ AI µ×²ãÂß¼­Öع¹µÄʱ¿Ì¡£ºã¾ÃÒÔÀ´£¬Transformer ¼Ü¹¹±»À§ÔÚÒ»¸öÌÚ¹óµÄã£ÂÛÖУºÎÒÃÇÓÃ×Å×îÏȽøµÄ GPU ËãÁ¦£¬È¥Èà AI Ä£×Ó " ËÀ¼ÇÓ²±³ " ÄÇЩ²é×Öµä¾ÍÄÜÖªµÀµÄ¾²Ì¬ÖªÊ¶¡£DeepSeek ÁºÎÄ·æÍŶÓÓëÆä±±´óÏàÖúÕßÔÚ½ñÈÕÆÆÏþÐû²¼µÄÖØ°õÂÛÎÄ¡¶Conditional Memory via Scalable Lookup¡·£¬³¹µ×Í»ÆÆÁËÕâÒ»½©¾Ö¡£ËûÃÇÌá³öÁËÒ»ÖÖȫеÄEngram£¨Ó¡¼££©Ä£¿£¿é£¬ÔڹŰåµÄ " Ìõ¼þÅÌËã "£¨MoE£©Ö®Í⣬¿ª·¢Á˵ڶþÌõÏ£º±»¯Õ½Ïß¡ª¡ª" Ìõ¼þÓ°Ïó "¡£Õâ²»µ«ÊÇÒ»´ÎÊÖÒÕÐÞ²¹£¬¶øÊÇÒ»³¡¹ØÓÚÄ£×Ó " ÄÔÈÝÁ¿ " µÄ¹©Ó¦²àˢС£Ëü֤ʵÎú£ºµ±ÎÒÃǽ« " Ó°Ïó " ´Ó " ÅÌËã " ÖаþÀ룬°Ñ¸Ã±³µÄ½»¸ø " ×Öµä "£¬°Ñ¸ÃËãµÄ½»¸ø´óÄÔ£¬AI µÄÍÆÀíÄÜÁ¦½«Ó­À´·´Ö±¾õµÄ±¬·¢Ê½ÔöÌí¡£DeepSeek ÍýÏëÔÚ 2 Ô´º½ÚǰºóÕýʽÐû²¼ V4£¬¶øÕâÒ»¿Ì»òÐí¾ÍÊÇ DeepSeek V4 ½µÉúµÄǰҹ¡£ ÐòÕ£ºÁù²ãÉñ¾­ÍøÂçµÄ " ÎÞÓù¦ "¹ÊÊÂµÄÆðµã£¬Ô´ÓÚ DeepSeek ÍÅ¶Ó¶Ô Transformer ÄÚ²¿ÔË×÷»úÖÆµÄÒ»´Î " ºË´Å¹²Õñ " ɨÃè¡£ÔÚÈ˹¤ÖÇÄܵĺںÐ×ÓÀµ±´óÄ£×Ó¿´µ½ "Diana, Princess of Wales"£¨´÷°²ÄÈ£¬Íþ¶ûÊ¿Íõåú£©Õâ¸ö¶ÌÓïʱ£¬ËüµÄÄÚ²¿±¬·¢ÁËÒ»³¡ÁîÈ˷ѽâÇÒ¼«ÆäÌÚ¹óµÄ " ÄÚÚ§ "¡£Ñо¿Ö°Ô±·¢Ã÷£¬ÎªÁËʶ±ðÕâ¸öÀο¿µÄʵÌ壬ģ×Ó¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÍøÂ磺µÚ 1-2 ²ã£ºÄ£×Ó»¹ÔÚ×ÁÄ¥ "Wales" »òÐíÊÇÒ»¸ö¹ú¼Ò£»µÚ 3 ²ã£ºËüÒâʶµ½ÕâÊÇÅ·ÖÞµÄÒ»¸öµØÀí¿´·¨£»µÚ 4 ²ã£ºËü×îÏÈÆ´¼¯³ö "Princess of Wales" ËÆºõÊÇÒ»¸öÍ·ÏΣ»µÚ 5 ²ã£ºËüåÚÏëµ½ÁË " Íþ¶ûÊ¿Ç×ÍõµÄÆÞ×Ó "£»µÚ 6 ²ã£ºÖ±µ½ÕâÀËü²ÅÖÕÓÚÈ·ÈÏ£¬ÕâÊÇÖ¸ÄÇÎ»ÖøÃûµÄ " ´÷°²ÄÈÍõåú "¡£ÔÚһλ׷Çó¼«ÖÂЧÂʵļܹ¹Ê¦ÑÛÖУ¬Õâ¼òÖ±ÊÇËãÁ¦µÄ±©éåÌìÎï¡£" ´÷°²ÄÈÍõåú " ÊÇÒ»¸ö¿Í¹Û±£´æµÄ¡¢¡¢¡¢¾²Ì¬µÄʵÌ壬Ëü²»»áÓÉÓÚÉÏÏÂÎĵÄת±ä¶ø¸Ä±äÆäʵÖÊ¡£ÎªÁËÌáÈ¡Õâ¸öÔ­À´²é×Öµä¾ÍÄÜÖªµÀµÄÊÂʵ£¬Transformer ¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÉî¶ÈµÄÌÚ¹ó¾ØÕóÔËËãÈ¥ " ÖØÐÞ " Õâ¸ö¿´·¨¡£Õâ¾ÍÏñÊÇÒ»¸ö¾øÊÀÌì²Å£¬ÔÚÈ¥½â¾ö΢»ý·ÖÄÑÌâ֮ǰ£¬Ã¿´Î¶¼µÃÏÈ»¨°ëСʱĬдһ±é¾Å¾Å³Ë·¨±í¡£ ÕâÖÖ " ÒþʽӰÏó " µÄ»úÖÆ£¬ÆÈʹģ×Ó½«Ãû¹óµÄ²ÎÊýÈÝÁ¿ºÍÍøÂçÉî¶È£¬ÆÌÕÅÔÚÁ˼òÆÓµÄģʽƥÅäÉÏ¡£DeepSeek ÔÚÕâÆª³¤´ï 33 Ò³µÄÂÛÎÄÖУ¬Ìá³öÁËÒ»¸öÖ±»÷Áé»êµÄ¿½ÎÊ£ºÎªÊ²Ã´²»Ö±½Ó¸ø´óÄ£×ÓÅäÒ»±¾¿ÉÒÔËæ²éËæÓÃµÄ " ³¬µÈ×Öµä "£¿£¿ µÚÒ»Õ£º¼Ü¹¹ÖØËÜ¡ª¡ª Engram Ä£¿£¿éµÄ±©Á¦ÃÀѧΪÏàʶ¾öÕâ¸öÎÊÌ⣬DeepSeek Ìá³öÁËÒ»ÖÖÃûΪ "Engram£¨Ìõ¼þÓ°Ïó£©" µÄÈ«ÐÂÄ£¿£¿é¡£ÈôÊÇ˵ MoE£¨»ìÏýר¼ÒÄ£×Ó£©ÊÇ°Ñ " ´óÄÔ " ·Ö³ÉÁ˲î±ðµÄÇøÓò£¬Èòî±ðµÄר¼ÒÈÏÕæ²î±ðµÄ˼Ë÷£¨Ìõ¼þÅÌË㣩£»ÄÇô Engram ¾ÍÊǸø´óÄÔÍâ¹ÒÁËÒ»¸öÖØ´óµÄ " º£ÂíÌå "£¬×¨ÃÅÈÏÕæ´æ´¢¾²Ì¬ÖªÊ¶£¨Ìõ¼þÓ°Ï󣩡£1. ¸´Éú "N-gram"£º´Ó¹ÅÀÏÖÇ»ÛÖÐѰÕÒÃÕµ×Engram µÄ½¹µãÁé¸Ð£¬¾¹È»À´×ÔÓÚ NLP£¨×ÔÈ»ÓïÑÔ´¦Àí£©ÁìÓòµÄ " ÉϹÅÉñÆ÷ " ¡ª¡ª N-gram¡£ÔÚÉî¶ÈѧϰͳÖÎÌìÏÂ֮ǰ£¬ÎÒÃǾÍÊÇ¿¿Í³¼Æ "N ¸ö´Êͬʱ·ºÆðµÄ¸ÅÂÊ " À´Ã÷È·ÓïÑԵġ£DeepSeek ½«ÕâÒ»¾­µä¿´·¨¾ÙÐÐÁËÏÖ´ú»¯µÄħ¸Ä£º¹Å°åµÄ Transformer£ºÖªÊ¶ÊèÉ¢ÔÚÉñ¾­ÔªµÄÈ¨ÖØ£¨Weights£©ÀÌáȡ֪ʶÐèÒª¾­ÓÉÖØ´óµÄÏßÐÔ²ãÅÌËã£¬ÖØÆ¯ºó¸ß¡£Engram Ä£¿£¿é£ºËüÊÇÒ»¸öÖØ´óµÄ¡¢¡¢¡¢¿ÉÀ©Õ¹µÄǶÈë±í£¨Embedding Table£©¡£µ±Ä£×Ó¶Áµ½ " ÕÅÖÙ¾° " »òÕß " ËÄ´ó·¢Ã÷ " ÕâÖÖÀο¿´îÅ䣨N-gram£©Ê±£¬²»ÐèÒª¶¯ÓôóÄÔÆ¤²ãÈ¥ÍÆÀí£¬Ö±½Óͨ¹ý¹þÏ£Ë÷Òý£¬ÔÚÄÚ´æ±íÖÐ " ²é " ³ö¶ÔÓ¦µÄÏòÁ¿¡£ÕâÒ»Àú³ÌµÄʱ¼äÖØÆ¯ºóÊÇO ( 1 ) ¡ª¡ªÕâÒâζ×ÅÎÞÂÛ֪ʶ¿âÅòÕ͵½¶à´ó£¨ÄÄÅÂÊÇ 1000 ÒÚ²ÎÊý£©£¬²éÕÒËÙÂÊÏÕЩÎȹÌ£¬ÇÒ¼«¿ì¡£2. Èý´óÊÖÒÕ»¤³ÇºÓ¼ÈÈ»²é±íÕâôºÃ£¬ÎªÊ²Ã´ÒÔǰûÈË×ö£¿£¿ÓÉÓÚÓÐÈý¸öÀ¹Â·»¢£º´æ´¢±¬Õ¨¡¢¡¢¡¢¶àÒå´Ê³åÍ»¡¢¡¢¡¢²ÎÊý·ÖÅä¡£DeepSeek ¸ø³öÁ˽̿ÆÊé¼¶µÄ½â¾ö·½°¸£ºA. ´Ê±íѹËõ£º¼«ÖµÄÈ¥ÖØÌìÏÂÉϵĴÊ×é×éºÏÊÇÌìÎÄÊý×Ö¡£DeepSeek Ê×ÏÈ×öÁËÒ»²½ " ÎÞËðѹËõ "¡£ÔÚ·Ö´ÊÆ÷£¨Tokenizer£©²ãÃæ£¬Ëü½«ÓïÒåÏàͬµ«Ð´·¨²î±ðµÄ´Ê¾ÙÐÐÁ˹éÒ»»¯¡£ÀýÈ磬"Apple"£¨Ê××Öĸ´óд£©ºÍ "apple"£¨Ð¡Ð´£©ÔÚÓïÒåÉÏͨ³£Ö¸Í³Ò»¸ö¹¤¾ß¡£Í¨¹ýÓ³ÉäºÏ²¢£¬ÓÐÓôʱíÖ±½ÓËõСÁË 23%¡£Õâ²»µ«½ÚÔ¼Á˿ռ䣬¸üÈÃ֪ʶµÄÃܶȴó·ùÌáÉý¡£B. ¶àÍ·¹þÏ££º½â¾ö " ¹þÏ£³åÍ» "²»¿ÉÄܰÑËùÓÐ N-gram ¶¼´æÏÂÀ´¡£Engram ʹÓÃÁË " ¶àÍ·¹þÏ££¨Multi-Head Hashing£©" ÊÖÒÕ¡£Í¨¹ý¶à¸ö¹þÏ£º¯Êý£¬½«ÎÞÏÞµÄ N-gram Ó³Éäµ½ÓÐÏÞµÄÄÚ´æ²ÛλÖС£ËäÈ»»áÓйþÏ£³åÍ»£¨¼´Á½¸ö²î±ðµÄ´Ê±»Ó³Éäµ½ÁËͳһ¸öλÖã©£¬µ«Í¨¹ý " ¶àÍ· " Éè¼Æ£¬Ä£×Ó¿ÉÒÔ´Ó¶à¸öºòѡЧ¹ûÖÐÆ´¼¯³ö׼ȷµÄÐÅÏ¢£¬¼«´óµØÌá¸ßÁ˳°ôÐÔ¡£C. ÉÏÏÂÎÄÃſأº¸øÓ°ÏóÅä¸ö " ²ÃÅÐ "ÕâÊÇ×ÃîµÄÒ»±Ê¡£²é±íÊÇËÀµÄ£¬ÓïÑÔÊÇ»îµÄ¡£ºÃ±È " Æ»¹û " Õâ¸ö´Ê¡£ÔÚ " ³ÔÆ»¹û " µÄÓᄈϣ¬Ëüָˮ¹û£»ÔÚ " Æ»¹ûÐû²¼»á " µÄÓᄈϣ¬ËüÖ¸¿Æ¼¼¹«Ë¾¡£Ö±½Ó²é±í¿ÉÄÜ»áÒýÈëÔëÉù¡£DeepSeek Éè¼ÆÁËÒ»¸ö " ÉÏÏÂÎĸÐÖªÃÅ¿Ø "£¨Context-aware Gating£©¡£Query£¨ÅÌÎÊ£©£ºÄ¿½ñÉÏÏÂÎĵÄÒþ²Ø×´Ì¬£¨Hidden State£©¡£Key/Value£¨¼üÖµ£©£º²é±í»ñµÃµÄ¾²Ì¬ÏòÁ¿¡£Õâ¸öÃſؾÍÏñÒ»¸ö²ÃÅС£ÈôÊDzé³öÀ´µÄ " ¾²Ì¬ÖªÊ¶ " ºÍÄ¿½ñµÄ " ÉÏÏÂÎÄ " ²»´î£¬²ÃÅоͻá°ÑÈ¨ÖØÑ¹µÍ£¨Gate ÖµÇ÷Ïò 0£©£¬ÈÃÄ£×ÓºöÂÔÕâ¸öÔëÉù£»ÈôÊÇÍêÉÆÆõºÏ£¨ºÃ±È " É˺®ÔÓ²¡ÂÛ " ºóËæ×Å " ÕÅÖÙ¾° "£©£¬²ÃÅоͻá°Ñ´óÃÅ·­¿ª£¨Gate ÖµÇ÷Ïò 1£©£¬Ö±½Ó°Ñ֪ʶעÈëÄ£×Ó¡£ µÚ¶þÕ£º»Æ½ð±ÈÀý¡ª¡ª·¢Ã÷ AI Ä£× "U ÐÍÇúÏß "¼Ü¹¹Éè¼ÆºÃÁË£¬½ÓÏÂÀ´µÄÎÊÌâÊÇ£ºÔõô·Ö¾Ó²ú£¿£¿¼ÙÉèÎÒÃÇÏÔ¿¨ÀïµÄÏÔ´æÊÇÓÐÏ޵ģ¬×ܲÎÊýÔ¤ËãÒ²ÊÇÀο¿µÄ¡£ÎÒÃÇÓ¦¸Ã°Ñ¼¸¶à²ÎÊý·ÖÅ䏸 MoE µÄ " ר¼Ò "£¨ÈÏÕæÅÌË㣩£¬¼¸¶à²ÎÊý·ÖÅ䏸 Engram µÄ " ×Öµä "£¨ÈÏÕæÓ°Ï󣩣¿£¿ÕâÊÇÒ»¸öµä·¶µÄ×ÊÔ´ÉèÖò©ÞÄ¡£DeepSeek ÍŶӾÙÐÐÁËÒ»³¡´ó¹æÄ£µÄÏûÈÚʵÑ飬ɨÃèÁË´Ó 0% µ½ 100% µÄ·ÖÅä±ÈÀý£¬Ð§¹û»­³öÁËÒ»ÌõÍêÉÆµÄ "U ÐÍ Scaling Law ÇúÏß "¡£ÕâÕÅͼչÏÖÁË AI Ä£×ÓÉè¼ÆµÄµ×²ã¼ÍÂÉ£º×ó²à¼«¶Ë£¨´¿ Engram£©£ºÈôÊǰѲÎÊýÈ«¸ø×ֵ䣬Loss ºÜ¸ß¡£ÓÉÓÚÄ£×ÓÄð³ÉÁË " Êé°×³Õ "£¬¹âÓÐËÀ¼ÇÓ²±³£¬Ã»ÓÐÂß¼­ÍÆÀíÄÜÁ¦¡£ÓҲ༫¶Ë£¨´¿ MoE£©£ºÈôÊǰѲÎÊýÈ«¸ø×¨¼Ò£¬Loss Ò²ºÜ¸ß¡£ÓÉÓÚר¼ÒÃDZ»ÆÈ°Ñ¾«Éñ¶¼»¨ÔÚ±³Ê飨ӰÏó¾²Ì¬ÖªÊ¶£©ÉÏ£¬Ã»¿Õ¸ÉÕýÊ¡£»Æ½ðÖ§½âµã£¨¦Ñ ¡Ö 75%-80%£©£ºµ±ÎÒÃǽ«Ô¼20%-25% µÄÏ£º±²ÎÊýÔ¤Ëã·Ö¸ø Engram£¬Ê£Ïµĸø MoE ʱ£¬Ä£×ÓµÄÑéÖ¤¼¯ Loss ½µµ½ÁË×îµÍµã¡£ÕâÊÇÒ»¸ö¼«¾ßÖ¸µ¼ÒâÒåµÄ·¢Ã÷£º¹ØÓÚ¼¸°ÙÒÚ²ÎÊýµÄ´óÄ£×ÓÀ´Ëµ£¬´¿´â¶ÑÆöÅÌË㵥루MoE ר¼Ò£©ÒѾ­ÊDZ߼ÊЧӦµÝ¼õÁË£¬±ØÐèÒýÈëרÃŵľ²Ì¬Ó°ÏóÄ£¿£¿éÀ´ÊµÏÖ " ´æËãÆ½ºâ "¡£ µÚÈýÕ£º·´Ö±¾õµÄ±¬·¢¡ª¡ªÎªÊ²Ã´ " ²é×Öµä " ÄÜÌá¸ß " ÊýѧЧ¹û "£¿£¿ÈôÊÇ Engram ½ö½öÊÇÈÃÄ£×Ó " ¼ÇÐÔ¸üºÃ "£¬ÕâÆªÂÛÎĵķÖÁ¿»¹È±·¦ÒÔÕð¾ªÉçÇø¡£ÊÂʵ£¬RAG£¨¼ìË÷ÔöÇ¿ÌìÉú£©Ò²Äܽâ¾ö֪ʶÎÊÌâ¡£ÕæÕýÈÃÒµ½ç¸ÐÓ¦Õ𺳵Ä£¬ÊÇʵÑéЧ¹ûÖÐÄÇЩÒâÁÏÖ®ÍâµÄÊÕÒæ¡£DeepSeek ¹¹½¨ÁËÈý¸ö±ÈÕÕÄ£×Ó£¬ÑÏ¿á¿ØÖÆ¼¤»î²ÎÊýÄ¿£¨3.8B£©ºÍѵÁ·Êý¾ÝÁ¿£¨262B tokens£©ÍêȫһÖ£ºDense-4B£º¹Å°åµÄŨÃÜÄ£×Ó¡£MoE-27B£º´¿ MoE Ä£×Ó£¨72 ¸öר¼Ò£©¡£Engram-27B£º»ìÏýÄ£×Ó£¨55 ¸öר¼Ò + 5.7B Engram ²ÎÊý£©¡£Ð§¹ûÁîÈË´óµøÑÛ¾µ£º1. ÒâÁÏÖ®ÖУºÖªÊ¶ÀàʹÃü°Ô°ñÔÚ MMLU£¨×ÛºÏ֪ʶ£©ÉÏ£¬Engram Ä£×ÓÌáÉýÁË3.4 ·Ö£»ÔÚ CMMLU£¨ÖÐÎÄ֪ʶ£©ÉÏ£¬ÌáÉýÁË4.0 ·Ö¡£ÕâºÜºÃÃ÷È·£¬Íâ¹ÒÁË×ֵ䣬֪ʶ×ÔÈ»¸üºÃÁË£¬»Ã¾õ¸üÉÙÁË¡£2. ÒâÁÏÖ®Í⣺Âß¼­¡¢¡¢¡¢´úÂë¡¢¡¢¡¢ÊýѧÖÜÈ«±©Õǰ´Àí˵£¬" ²é×Öµä " ºÍ " ×öÊýѧÌâ " û¹ØÏµ¡£µ«ÔÚ BBH£¨×ÛºÏÍÆÀí£©ÉÏ£¬Engram-27B ¾¹È»±Èͬ²ÎÊýµÄ´¿ MoE »ùÏßÌáÉýÁËÕûÕû5.0 ·Ö£¡£¡£¡MATH£¨Êýѧ£©£ºÌáÉý2.4 ·Ö¡£HumanEval£¨´úÂëÌìÉú£©£ºÌáÉý3.0 ·Ö¡£ARC-Challenge£¨ÖØ´óÍÆÀí£©£ºÌáÉý3.7 ·Ö¡£3. Éî¶ÈÆÊÎö£ºÓÐÓÃÉî¶È£¨Effective Depth£©ÀíÂÛΪʲô£¿£¿Ò»¸ö " ËÀ¼ÇÓ²±³ " µÄÄ£¿£¿é£¬ÎªÊ²Ã´ÄÜÌá¸ßÖÇÉÌ£¿£¿DeepSeek ÍŶÓʹÓÃLogitLensºÍ "CKA£¨ÖÐÐÄºË¶ÔÆë£©" ÊÖÒÕ£¬¶ÔÄ£×ÓÄÚ²¿¾ÙÐÐÁË " ÆÊ½â "¡£ËûÃÇ·¢Ã÷ÁËÒ»¸ö¾ªÈ˵ÄÕ÷Ï󣺻¹¼ÇµÃ¿ªÍ·µÄ " ´÷°²ÄÈÍõåú " Â𣿣¿ÔÚ´¿ MoE Ä£×ÓÖУ¬Ç°¼¸²ãÍøÂç¶¼ÔÚæ×Å " Æ´¼¯¿´·¨ "¡£¶øÔÚ Engram Ä£×ÓÖУ¬ÓÉÓÚµÚ 2 ²ã¾Í²åÈëÁË Engram Ä£¿£¿é£¬¾²Ì¬ÖªÊ¶µÄ¼ìË÷ÔÚ¼«ÔçµÄ½×¶Î¾ÍÍê³ÉÁË¡£ÕâÒâζ×Å£¬Ô­±¾ÓÃÓÚ " ËÀ¼ÇÓ²±³ " µÄǰ¼¸²ãÍøÂç±»½â·ÅÁË£¡£¡£¡ÕâÏ൱ÓÚ¸øÄ£×Ó " ÐéÔö " ÁËÉî¶È¡£ ÄÇЩ±»ÊͷųöÀ´µÄÍøÂç²ãºÍ×¢ÖØÁ¦Í·£¨Attention Heads£©£¬²»ÔÙÐèÒª´¦ÀíààËյľֲ¿ÒÀÀµ£¨ºÃ±Èʶ±ð " ÕÅÖÙ¾° " ÊÇË­£©£¬´Ó¶ø¿ÉÒÔÈ«Éñ¹á×¢µØÍ¶Èëµ½¸üÖØ´óµÄÈ«¾ÖÍÆÀí¡¢¡¢¡¢³¤³ÌÂß¼­¹¹½¨ºÍ´úÂëÂß¼­ÌìÉúÖÐÈ¥¡£Engram µÄʵÖÊ£¬²»ÊÇ " Ìæ»» " ÍÆÀí£¬¶øÊÇͨ¹ý " ·ÖÁ÷ " ÔӻÈôóÄÔרעÓÚ¸ü¸ßά¶ÈµÄ˼Ë÷¡£ µÚËÄÕ£º¹¤³ÌÆæ¼£¡£¡£¡ª¡ªÍ»ÆÆÓ¢Î°´ïµÄ " ÏÔ´æ°ÔȨ "¹ØÓÚ»ª¶û½ÖµÄͶ×ÊÕߺÍËãÁ¦ÖÐÐĵÄÔËάÕßÀ´Ëµ£¬ÕâÆªÂÛÎÄ×îÐԸеĵط½²»ÔÚÓÚ Score£¬¶øÔÚÓÚCost£¨±¾Ç®£©¡£ÔÚ AI ʱ´ú£¬×îÌÚ¹óµÄ×ÊÔ´²»ÊÇËãÁ¦£¨FLOPs£©£¬¶øÊÇÏԴ棨HBM£©¡£Ó¢Î°´ï H100 Ö®ÒÔÊǹ󣬺ܺéÁ÷ƽÉÏÊÇÓÉÓÚÄÇϡȱµÄ HBM3e ÄÚ´æ¡£¶ø Engram ´øÀ´ÁËÒ»¸öÇ㸲ÐÔµÄÌØÕ÷£º³¹µ×µÄ´æËãÊèÉ¢¡£1. MoE µÄÍ´µã£ºÏÔ´æÍÌÊÉÕ߹ŰåµÄ MoE Ä£×Ó£¬Æä·ÓÉ»úÖÆ£¨Routing£©ÊǶ¯Ì¬µÄ¡£Ä£×Ó±ØÐèÏÈËã³öÄ¿½ñ Token µÄÌØÕ÷£¬ËãÍêÕâÒ»²ã£¬²ÅÖªµÀÏÂÒ»²ã¸ÃÕÒÄĸöר¼Ò¡£ÕâÒâζ×Å£¬ËùÓеÄר¼ÒÄ£×Ó±ØÐèʱ¿ÌÔÚÌÚ¹óµÄ GPU ÏÔ´æÀï´ýÃü£¬Ëæ½ÐËæµ½¡£2. Engram µÄÍ»ÆÆ£ºÈ·¶¨µÄÔ¤ÖªEngram µÄ²é±íÂß¼­ÊÇÈ·¶¨ÐԵġ£Ö»ÒªÊäÈëµÄÎı¾È·¶¨ÁË£¨ºÃ±È "A New Axis of Sparsity"£©£¬ÄÇôËü¶ÔÓ¦µÄ N-gram Ë÷Òý¾ÍÈ·¶¨ÁË¡£ÎÒÃÇ»ù´¡²»ÐèÒªµÈÄ£×ÓËãÍêǰһ²ã£¬ÔÚ Token ½øÈëÄ£×ÓµÄÄÇһ˲¼ä£¬ÎÒÃǾÍÖªµÀËüÐèÒª²éÄÄÕűíµÄÄÄÒ»ÐС£3. CPU µÄÄæÏ®£º°Ñ´óÄ£×ÓÈû½øÄÚ´æÌõÕâÒ»ÌØÕ÷´øÀ´ÁËÖØ´óµÄ¹¤³ÌÓ¯Àû£ºÐ¶ÔØ£¨Offload£©£ºÎÒÃÇ¿ÉÒ԰Ѽ¸°ÙÒÚ¡¢¡¢¡¢ÉõÖÁÉÏǧÒÚ²ÎÊýµÄ Engram ´Ê±í£¬Ö±½ÓÈÓµ½×ÔÖÆ¡¢¡¢¡¢Á¿´ó¡¢¡¢¡¢Ò×À©Õ¹µÄ "CPU Äڴ棨DRAM£©" ÀÉõÖÁ·ÅÔÚ NVMe SSD ÉÏ¡£Ô¤È¡£¨Prefetching£©£ºÔÚ GPU Æ´ÃüÅÌËãǰһ²ã Transformer µÄʱ¼ä£¬CPU ʹÓà PCIe ͨµÀ£¬Òì²½µØ°ÑÏÂÒ»²ãÐèÒªµÄÓ°ÏóÊý¾Ý " Ԥȡ " ³öÀ´£¬ÍÆË͵½ GPU¡£ÑÚÊÎÑÓ³Ù£¬²¢Ðд¦Àí¡£DeepSeek ʵ²âÊý¾ÝÏÔʾ£º×ÝÈ»¹ÒÔØÁË100B£¨Ç§ÒÚ£©²ÎÊýµÄ Engram ±íµ½ CPU Äڴ棬Ïà±ÈÓÚ´¿ GPU ÍÆÀí£¬ÍÌÍÂÁ¿µÄϽµ²»µ½ 3%¡£ÕâÊÇÒ»¸öÈÃËùÓÐÓÉÓÚÂò²»µ½ HBM ¶ø½¹ÂǵÄÈË¿ñϲµÄ½áÂÛ¡£ÕâÒâζ×Å£¬Î´À´µÄ´óÄ£×Ó£¬" Ó°ÏóÈÝÁ¿ " ¿ÉÒԵͳÉÍâµØÎÞÏÞÀ©ÕÅ£¬¶ø²»±Ø±»Ó¢Î°´ïµÄÏԴ濨²±×Ó¡£ µÚÎåÕ£º³¤Îı¾µÄʤÀû¡ª¡ª NIAH ²âÊÔµÄÔ¾Éý³ýÁËͨÓÃÍÆÀí£¬Engram ÔÚ³¤Îı¾£¨Long Context£©ÁìÓòµÄÌåÏÖͬÑù֤ʵÎú " ·Ö¹¤ " µÄ¼ÛÖµ¡£ÔÚ³¤Îı¾´¦ÀíÖУ¬×¢ÖØÁ¦»úÖÆ£¨Attention£©µÄ´°¿ÚÊÇÓÐÏ޵ġ£ÈôÊÇ×¢ÖØÁ¦±»´ó×ڵľֲ¿ÐÅÏ¢£¨ÈçÀο¿¶Ì

      ÏÂÔØ

    • ÔÂÏú³¬2Íò2£¡£¡£¡ÎµÀ´ES8ÖÕ½áÁË´óÈýÅÅSUVµÄÄÜÔ´Ö®Õù

      ¶í¾üÖ¸»Ó¹ÙÏêÊö£ºS-300ϵͳÔõÑùÁ¬ÐøÁ½·¢µ¼µ¯»÷ÂäÎÚ¿ËÀ¼F-16

      2026ƽ̨¶¯Âþ¿ªÄêÕ½£ºÄÚÈݺñ¶È³ÉΪ½¹µã³ïÂë

      ÏÂÔØ

    • È«ÇòÊ׿îÁ¿²ú¹Ì̬µç³ØÒѽµÉúÔÚ·ÒÀ¼£¬ÖйúÆóҵΪºÎ²»Å£¿£¿

      Áù±ßÐÎսʿM1X¡ª¡ª·çÇåÑï

      »ªÎª¸ß¼¶ÕÕÁÏÌïÌΣºÆóÒµ³ÉÊìµÄ±ê¼Ç¡ª¡ª°ÑÊ×´´È˹ؽøÖƶȵÄÁý×Ó

      ÏÂÔØ

    • º«¹ú¡°179ËÀ¿ÕÄÑ¡±75ÃëºÚÏ»×Ó¼ÒôÊ×´ÎÐû²¼£»ÊÓ²ì³ÆÈôÎÞΧǽ¿ÉȫԱÉú»¹

      ±ÈÑǵÏÐÂÆ·ÅÆ¡°Áì»ã¡±À´ÁË£¬4¿î³µÐÍÆØ¹â£¡£¡£¡ÖªÇéÈËÊ¿£º×¨¹©´óÅúÁ¿²É¹ºÐèÇó

      ·¨¾üÌØÇ²¶ÓÔÚÂÞÂíÄáÑÇÑÝϰÅ䱸ÐÂÐÍÃ×ÄáÃ×»úǹչÏÖÕûÌå·ÀÓùÄÜÁ¦

      ÏÂÔØ

    • ×·ÃÙÓáºÆ½Ð°å»ÆÈÊÑ«¡¢¡¢¡¢Âí˹¿Ë£¬³ÆÒª×ö¡°Ê׸ö°ÙÍòÒÚÃÀ½ðµÄ¹«Ë¾Éú̬¡±

      ÕÂÔóÌ쿪ͨСºìÊéÕ˺Å£¬3СʱÕÇ·Û³¬4Íò

      ÄêÈë10.74ÒÚ£¬Ã«Àû²»Ê亣µ×ÀÌ£¬ÓÖÒ»ÒþÐιھüÒªIPO

      ÏÂÔØ

    • ±«Íþ¶ûÔâÐÌÊÂÊӲ죬ÏÂÒ»²½»áÔõÑù£¿£¿

      Ô¬¼ÇÔÆ½È¹¥»÷¸Û¹ÉIPO£º³¬95%ÃŵêÊǼÓÃ˵ê

      ÈýάÌìµØ£ºÏÖÔÚı»®ÇéÐÎÕý³££¬²»±£´æÓ¦Åû¶¶øÎ´Åû¶µÄÖØ´óÊÂÏî

      ÏÂÔØ

    • ÁÖÐÄÈç³ÆÃ»³Ô¹ý»ô½¨»ªÖóµÄ¹¤¾ß£¬»ô½¨»ªÍê»éºóûϹý³ø

      Ô½ÄϹÞÍ·´óÍõ±»²¶£¬¿ÍÕ»²é»ñÓâ130¶Ö·ÇÖÞÖíÎÁ²¡ÖíÈâ

      ÓÅÏÈÔ®ÎÚ£¡£¡£¡Ó¢¹úʱ¸ô60ÄêÔÙ´ÎÑÐÖÆµ¯µÀµ¼µ¯£¬0.2¶Öµ¯Í·500ǧÃ×Éä³Ì

      ÏÂÔØ

    • ¼ß-10CEʵս·âÉñ£¡£¡£¡³É·É֤ʵ¡°ÁãËðʧ¡±Õ½¹û£¬Ó¡¶ÈÕÚÐß²¼ËéÁËÒ»µØ
    • ãÆÑ§¾§Î´½Óµ½2026ÄêÑëÊÓ´ºÍíÑûÔ¼£¬Ò²Ã»Óнӵ½ÁÉÄþ´ºÍíÑûÔ¼

      ÉîÛÚÒ»½ÖµÀÍø¸ñÔ±ÉîÒ¹11µãÈë»§¼ì²éÏû·À£¬Ôâ¾ÜºóÈÔÈëÄÚÕÕÏàÒýͶËߣ¬½ÖµÀ»ØÓ¦

      ʵ¼ùÆÊÎö£º°ð°ðÇå¾²ÖúÁ¦Ä³Ê¡Å©ÉÌÐй¹½¨Òƶ¯ÓªÒµ·´Õ©ÏµÍ³¡ª¡ª»ùÓÚʵʱ·çÏÕ¸ÐÖªÓëЭͬ·ÀÓù

      ÏÂÔØ

    • »ØÊ×¼ß20Ê×·É£º¹¤Òµ±ê×¼ºÍ×éÖ¯ÄÜÁ¦µÄÉý¼¶Í»ÆÆ£¬Ô¶±È·É»ú¸üÖ÷Òª
    • ÒÁÀʺôÓõÃñÖÚÉϽֿ¹ÒéÍâ²¿ÊÆÁ¦É¿»óº£ÄÚ¶¯ÂÒ
    • ´ó¶íS-300Á½·¢×èµ²µ¯´òÏÂÒ»¼ÜF-16£¡£¡£¡ÎÚ¿ËÀ¼¿ªÄ겻˳£¬F-16×ÜËðʧÒѾ­ÓÐ4¼ÜÁË

      ½ð¼ÛÁ¢Òì¸ßϵġ°ÌÔ½ðÈÈ¡±A¹ÉÒÑÓжà¼Ò¹«Ë¾¡°È«ÇòÂò¿ó¡±

      ÌØÀÊÆÕ7Äêǰ´³´ó»ö£¡£¡£¡¶íÂÞ˹¡°½û¼Éµ¼µ¯¡±Õ¨´©ÎÚ¿ËÀ¼£¬±±Ô¼±»´òãÂ

      ÏÂÔØ

    • żÓöÅ˳¤½­ÆÞÅ®£¬67ËêÑîÔÆÓëÅ®¶ùÅËÑôÏñ½ãÃã¬ÍïϧÁ³±äÌ«´ó²»¸ÒÈÏ

      ãÆÑ§¾§Î´½Óµ½2026ÄêÑëÊÓ´ºÍíÑûÔ¼£¬Ò²Ã»Óнӵ½ÁÉÄþ´ºÍíÑûÔ¼

      Öйúפ¶í´óʹ£º6.6ÍòÃûÖйúѧÉúÔÚ¶íÂÞ˹ѧϰ

      ÏÂÔØ

    • ÕâÊÇÕæ²»ÒªÃü£¡£¡£¡¶í¾üÄÃTM-62µ±Õ¨Ò©°üÓ㬲îµã°Ñ×Ô¼ººä·ÉÁË

      ÔøÖ¾Î°ÔÆÄÏÓ÷¹£¬´©¿íËÉÎÀÒÂÅäÀ«ÍÈ¿ãºÜÐÝÏÐ

      2026ÄêµÄ¡°¶¥Á÷¡±ÊÂÎñ£¬ÄêÇáÈ˾¿¾¹¡°´³ÁËʲô»ö¡±£¿£¿

      ÏÂÔØ

    • ÄãÃÇ»ÙÁËÎҵİÂÔËÃΣ¡£¡£¡¼ÓÄôó¶ÓË£ÒõÕУ¬ÃÀ¹ú¶ÓÌìϹھüÎÞÔµ¶¬°Â»á

      ¿÷ÁË2.7ÒÚ£¬¹ÅÌìÀÖÓÖ´øÀ´¡¶Ñ°Çؼǡ·¼Ó³¤°æ£¬ÏëÔÙ¾È¸ÛÆ¬Ò»°Ñ

      ÌØÀÊÆÕ¡°×ÔÎÒ¼ÓÃᡱ£º¼ò½éÀï×Ô³ÆÎ¯ÄÚÈðÀ­´ú×Üͳ

      ÏÂÔØ

    • Ī˹¿Æ¸÷´ó»ú³¡ÓÖ·ºÆð´ó¹æÄ£º½°àÑÓÎ󣬵«Õâ´Î²»ÊÇÓÉÓÚÎÞÈË»úÏ®»÷ÁË

      żÓöÅ˳¤½­ÆÞÅ®£¬67ËêÑîÔÆÓëÅ®¶ùÅËÑôÏñ½ãÃã¬ÍïϧÁ³±äÌ«´ó²»¸ÒÈÏ

      ÕÆÎÕËÄÄê¼¶Éú³¤Òªº¦ÆÚ£ºÕâ¸öº®¼Ù£¬¹¹½¨º¢×ÓÊÜÓÃÒ»ÉúµÄÔĶÁ¡°ÔªÄÜÁ¦¡±

      ÏÂÔØ

    • ÆÕÔªÐÅÏ¢£º¹«Ë¾AIÈí¼þ»ù´¡Æ½Ì¨²úÆ·ÏÖÔÚ´¦ÓÚÆðÔ´ÉÌÒµ»¯½×¶Î

      Ó¢Íõ²é¶û˹¾Ü¼û¹þÀïÍõ×Ó£¬Ã·¸ù±»ÆØ¼µ¶Ê¹þÀïÓëÕ²Äݸ¥¡¤ÂåÅå×Ƚ»Á÷

      2026ÄêµÄÁÙ½çµã£¬ÎªÊ²Ã´²»À´Ò»³¡Í·ÄÔÕß´ºÍí£¿£¿

      ÏÂÔØ

    • ²ð½â2025Äê¶ÈÎå´óÕ÷Ïó¼¶Éñ³µ£¬¿´¶®ÖйúÆû³µµÄ¡°È¨Á¦¸üµü¡±

      Èý´Î²¡Î£Í¨Öª£¬Ã»µÈÀ´ÊÂÒµ£¡£¡£¡Õâλ±»Âî¡°Èí·¹ÄС±µÄTVBÃ÷ÐÇ£¬ÔÚ²¡´²±ßÎÕ½ôÁËÍâÆÅµÄÊÖ

      ¾¯³µÁì·´³ºìµÆ£¬Õã½­Ò»ÍøÔ¼³µË¾»úËͶÏÖ¸Âÿͳö³µ»ö±»ÅÐÈ«Ô𣡣¡£¡ºÏÀíÂ𣿣¿

      ÏÂÔØ

    • ¸øÅ©ÃñÕÇÑøÀϽð£¬Ö»µ÷½É·ÑÉÏÏÞ²»·ó

      ÒÁÀ­¿Ë±í´ï²É¹ºJF-17¡°À׵硱ս¶·»úµÄÐËÈ¤Éæ¼°¸üÉîÈëµÄ·ÀÎñÏàÖú

      2025Öйú×îÍÑÏúµÄ10¿î³ËÓóµ³ö¯£ºÈÔÓÐÈý¿îÓͳµÔÚ°ñ

      ÏÂÔØ

    • ¹óȦ̫ÂÒ£¬ÒÔÊÇÌìÌì³ÔÕâÒ©£¿£¿£¡£¡£¡

      ËïÐ˽Ü£º·´¹Åµ×É«Óë¡°¶àÃ×ŵ¡±»ÃÏó£¬Â³±È°ÂËËÓÁ¸ÉÔ¤À­ÃÀΪºÎΣÏÕ£¿£¿

      ¡°ÔóÁ¬Ë¹»ùÎÞ´¦¿É²Ø¡±¡ª¡ª¶íÂÞ˹¡°é»Ê÷¡±µ¼µ¯Ê¹µØ±¤ÐÎͬÐéÉè

      ÏÂÔØ

    ±êÇ©Áбí

    ×îÐÂÁôÑÔ

    ÈÈÃÅÊÖÓÎ

    ×ܽáÈ«Íø881ƪЧ¹û

    ÉÏÃŵּÒÍÆÄÃÓÐÄÄЩƽ̨

    • ¸üУº 2026-01-13 23:51
    • ÈËÆø£º 56330
    • ̸ÂÛ£º 4357
    °²×¿ÏÂÔØ

    Ó¦ÓýéÉÜ

    • ÉÏÃŵּÒÍÆÄÃÓÐÄÄЩƽ̨
    • ÉÏÃŵּÒÍÆÄÃÓÐÄÄЩƽ̨
    • ÉÏÃŵּÒÍÆÄÃÓÐÄÄЩƽ̨
    °Ù¶È°ü¹Ü£¬ÎªÄúËÑË÷»¤º½wAAAABJRU5ErkJggg==

    ×î¼Ñ»Ø¸²

    1¡¢¡¢¡¢ÉÏÃŵּÒÍÆÄÃÓÐÄÄЩƽ̨?¡ª¡ªTG:Ôݲ»¹ûÕæ¡ª¡ª???????????????????????????

    2¡¢¡¢¡¢Í¬³ÇÒ¹Ô¼»á?¡ª¡ªTG:Ôݲ»¹ûÕæ¡ª¡ª???????????????????????????

    3¡¢¡¢¡¢?ÖØ°õÐÂÎÅÀ´Ï®£¡£¡£¡??ÉÏÃŵּÒÍÆÄÃÓÐÄÄЩƽ̨-APPÏÂÔØ?Ö§³Ö:winall/win7/win10/win11?ϵͳÀàÐÍ?:ÉÏÃŵּÒÍÆÄÃÓÐÄÄЩƽ̨(2025ȫվ)×îа汾IOS/°²×¿¹Ù·½Èë¿ÚN.19.90.78(Ç徲ƽ̨)

    4¡¢¡¢¡¢?¶À¼Ò£¡£¡£¡???ÉÏÃŵּÒÍÆÄÃÓÐÄÄЩƽ̨-APPÏÂÔØ?Ö§³Ö:winall/win7/win10/win11?ϵͳÀàÐÍ?:ÉÏÃŵּÒÍÆÄÃÓÐÄÄЩƽ̨(2025ȫվ)×îа汾IOS/°²×¿¹Ù·½Èë¿ÚN.7.71.23(Ç徲ƽ̨)

    u=4216223446,165439498&fm=30&app=106&f=JPEG?w=312&h=208&s=A78BD90314EB370D232D94CB0100E093 u=224571834,165455031&fm=30&app=106&f=JPEG?w=312&h=208&s=53383EC40C53A1C24A82482D0300E05B u=1008503611,165369737&fm=30&app=106&f=JPEG?w=312&h=208&s=F29634C4061E2DDA9610F402030090C2

    Ö©Öë³ØÖеÄ302Ìø×ªÊ¹Óù淶

    ×÷Ϊһ¸öרҵµÄSEOÐÐÒµÕ¾³¤£¬Ïàʶ²¢ÕÆÎÕÖ©Öë³Ø³ÌÐòµÄÔ­ÀíºÍÓÃ;ÊǺÜÊÇÖ÷ÒªµÄ¡£Ö©Öë³ØÊÇÒ»ÖÖÓÃÓÚÄ£ÄâËÑË÷ÒýÇæÖ©Ö루spider£©ÅÀÈ¡ÍøÒ³µÄ¹¤¾ß£¬Ëü¿ÉÒÔÄ£Äâ¶à¸öÖ©Öëͬʱ·ÃÎÊÍøÕ¾£¬²¢ÍøÂçÍøÕ¾ÉϵÄÐÅÏ¢¡£ÔÚSEOÓÅ»¯µÈÁìÓò£¬Ö©Öë³Ø³ÌÐò¿ÉÒÔ×ÊÖúÕ¾³¤¸üºÃµØÏàʶËÑË÷ÒýÇæ¶ÔÍøÕ¾µÄ»á¼ûÇéÐΣ¬´Ó¶ø×ö³öÏìÓ¦µÄÓÅ»¯¡£

    Ö©Öë³Ø³ÌÐòµÄÔ­Àí

    Ö©Öë³Ø³ÌÐòµÄÔ­ÀíÖ÷ÒªÊÇͨ¹ýÄ£Äâ¶à¸öÖ©Öëͬʱ·ÃÎÊÍøÕ¾£¬ÍøÂçÍøÕ¾ÉϵÄÐÅÏ¢¡£ÔÚÏÖʵ²Ù×÷ÖУ¬Õ¾³¤¿ÉÒÔÉèÖÃÖ©Öë³Ø³ÌÐòÄ£Äâ²î±ðËÑË÷ÒýÇæµÄÖ©Ö룬ºÃ±ÈGoogle¡¢¡¢¡¢BingµÈ£¬ÒÔ´ËÀ´Ïàʶ²î±ðËÑË÷ÒýÇæ¶ÔÍøÕ¾µÄ»á¼ûÇé¿ö¡£Í¨¹ýÖ©Öë³Ø³ÌÐòÍøÂçµ½µÄÊý¾Ý£¬Õ¾³¤¿ÉÒÔÆÊÎöÍøÕ¾ÔÚËÑË÷ÒýÇæÖеÄÅÅÃûÇéÐΡ¢¡¢¡¢ÍøÒ³±»Ë÷ÒýµÄÇéÐεÈ£¬´Ó¶ø¸üºÃµØ¾ÙÐÐSEOÓÅ»¯¡£

    Ö©Öë³Ø³ÌÐòµÄÓÃ;

    Ö©Öë³Ø³ÌÐòÔÚSEOÓÅ»¯ÖÐÓÐ×ÅÆÕ±éµÄÓÃ;¡£Ê×ÏÈ£¬Í¨¹ýÖ©Öë³Ø³ÌÐò¿ÉÒÔÊÓ²ìËÑË÷ÒýÇæÖ©Öë¶ÔÍøÕ¾µÄ»á¼ûÇéÐΣ¬****ÏÖÍøÕ¾±»ÆÁ±Î»ò±»½µÈ¨µÄÇéÐΡ£Æä´Î£¬Ö©Öë³Ø³ÌÐò¿ÉÒÔ¼à¿ØÍøÕ¾µÄË÷ÒýÇéÐΣ¬****ÏÖÄÄЩÒ³ÃæÎ´±»Ë÷Òý»ò±»ÒÅ©¡£×îºó£¬Ö©Öë³Ø³ÌÐò»¹¿ÉÒÔ¸ú×ÙÍøÕ¾Òªº¦´ÊµÄÅÅÃûÇéÐΣ¬ÊµÊ±µ÷ÕûÓÅ»¯Õ½ÂÔ¡£

    ×îºó

    ÕâÊÇÒ»¸ö¹ØÓÚ AI µ×²ãÂß¼­Öع¹µÄʱ¿Ì¡£ºã¾ÃÒÔÀ´£¬Transformer ¼Ü¹¹±»À§ÔÚÒ»¸öÌÚ¹óµÄã£ÂÛÖУºÎÒÃÇÓÃ×Å×îÏȽøµÄ GPU ËãÁ¦£¬È¥Èà AI Ä£×Ó " ËÀ¼ÇÓ²±³ " ÄÇЩ²é×Öµä¾ÍÄÜÖªµÀµÄ¾²Ì¬ÖªÊ¶¡£DeepSeek ÁºÎÄ·æÍŶÓÓëÆä±±´óÏàÖúÕßÔÚ½ñÈÕÆÆÏþÐû²¼µÄÖØ°õÂÛÎÄ¡¶Conditional Memory via Scalable Lookup¡·£¬³¹µ×Í»ÆÆÁËÕâÒ»½©¾Ö¡£ËûÃÇÌá³öÁËÒ»ÖÖȫеÄEngram£¨Ó¡¼££©Ä£¿£¿é£¬ÔڹŰåµÄ " Ìõ¼þÅÌËã "£¨MoE£©Ö®Í⣬¿ª·¢Á˵ڶþÌõÏ£º±»¯Õ½Ïß¡ª¡ª" Ìõ¼þÓ°Ïó "¡£Õâ²»µ«ÊÇÒ»´ÎÊÖÒÕÐÞ²¹£¬¶øÊÇÒ»³¡¹ØÓÚÄ£×Ó " ÄÔÈÝÁ¿ " µÄ¹©Ó¦²àˢС£Ëü֤ʵÎú£ºµ±ÎÒÃǽ« " Ó°Ïó " ´Ó " ÅÌËã " ÖаþÀ룬°Ñ¸Ã±³µÄ½»¸ø " ×Öµä "£¬°Ñ¸ÃËãµÄ½»¸ø´óÄÔ£¬AI µÄÍÆÀíÄÜÁ¦½«Ó­À´·´Ö±¾õµÄ±¬·¢Ê½ÔöÌí¡£DeepSeek ÍýÏëÔÚ 2 Ô´º½ÚǰºóÕýʽÐû²¼ V4£¬¶øÕâÒ»¿Ì»òÐí¾ÍÊÇ DeepSeek V4 ½µÉúµÄǰҹ¡£ ÐòÕ£ºÁù²ãÉñ¾­ÍøÂçµÄ " ÎÞÓù¦ "¹ÊÊÂµÄÆðµã£¬Ô´ÓÚ DeepSeek ÍÅ¶Ó¶Ô Transformer ÄÚ²¿ÔË×÷»úÖÆµÄÒ»´Î " ºË´Å¹²Õñ " ɨÃè¡£ÔÚÈ˹¤ÖÇÄܵĺںÐ×ÓÀµ±´óÄ£×Ó¿´µ½ "Diana, Princess of Wales"£¨´÷°²ÄÈ£¬Íþ¶ûÊ¿Íõåú£©Õâ¸ö¶ÌÓïʱ£¬ËüµÄÄÚ²¿±¬·¢ÁËÒ»³¡ÁîÈ˷ѽâÇÒ¼«ÆäÌÚ¹óµÄ " ÄÚÚ§ "¡£Ñо¿Ö°Ô±·¢Ã÷£¬ÎªÁËʶ±ðÕâ¸öÀο¿µÄʵÌ壬ģ×Ó¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÍøÂ磺µÚ 1-2 ²ã£ºÄ£×Ó»¹ÔÚ×ÁÄ¥ "Wales" »òÐíÊÇÒ»¸ö¹ú¼Ò£»µÚ 3 ²ã£ºËüÒâʶµ½ÕâÊÇÅ·ÖÞµÄÒ»¸öµØÀí¿´·¨£»µÚ 4 ²ã£ºËü×îÏÈÆ´¼¯³ö "Princess of Wales" ËÆºõÊÇÒ»¸öÍ·ÏΣ»µÚ 5 ²ã£ºËüåÚÏëµ½ÁË " Íþ¶ûÊ¿Ç×ÍõµÄÆÞ×Ó "£»µÚ 6 ²ã£ºÖ±µ½ÕâÀËü²ÅÖÕÓÚÈ·ÈÏ£¬ÕâÊÇÖ¸ÄÇÎ»ÖøÃûµÄ " ´÷°²ÄÈÍõåú "¡£ÔÚһλ׷Çó¼«ÖÂЧÂʵļܹ¹Ê¦ÑÛÖУ¬Õâ¼òÖ±ÊÇËãÁ¦µÄ±©éåÌìÎï¡£" ´÷°²ÄÈÍõåú " ÊÇÒ»¸ö¿Í¹Û±£´æµÄ¡¢¡¢¡¢¾²Ì¬µÄʵÌ壬Ëü²»»áÓÉÓÚÉÏÏÂÎĵÄת±ä¶ø¸Ä±äÆäʵÖÊ¡£ÎªÁËÌáÈ¡Õâ¸öÔ­À´²é×Öµä¾ÍÄÜÖªµÀµÄÊÂʵ£¬Transformer ¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÉî¶ÈµÄÌÚ¹ó¾ØÕóÔËËãÈ¥ " ÖØÐÞ " Õâ¸ö¿´·¨¡£Õâ¾ÍÏñÊÇÒ»¸ö¾øÊÀÌì²Å£¬ÔÚÈ¥½â¾ö΢»ý·ÖÄÑÌâ֮ǰ£¬Ã¿´Î¶¼µÃÏÈ»¨°ëСʱĬдһ±é¾Å¾Å³Ë·¨±í¡£ ÕâÖÖ " ÒþʽӰÏó " µÄ»úÖÆ£¬ÆÈʹģ×Ó½«Ãû¹óµÄ²ÎÊýÈÝÁ¿ºÍÍøÂçÉî¶È£¬ÆÌÕÅÔÚÁ˼òÆÓµÄģʽƥÅäÉÏ¡£DeepSeek ÔÚÕâÆª³¤´ï 33 Ò³µÄÂÛÎÄÖУ¬Ìá³öÁËÒ»¸öÖ±»÷Áé»êµÄ¿½ÎÊ£ºÎªÊ²Ã´²»Ö±½Ó¸ø´óÄ£×ÓÅäÒ»±¾¿ÉÒÔËæ²éËæÓÃµÄ " ³¬µÈ×Öµä "£¿£¿ µÚÒ»Õ£º¼Ü¹¹ÖØËÜ¡ª¡ª Engram Ä£¿£¿éµÄ±©Á¦ÃÀѧΪÏàʶ¾öÕâ¸öÎÊÌ⣬DeepSeek Ìá³öÁËÒ»ÖÖÃûΪ "Engram£¨Ìõ¼þÓ°Ïó£©" µÄÈ«ÐÂÄ£¿£¿é¡£ÈôÊÇ˵ MoE£¨»ìÏýר¼ÒÄ£×Ó£©ÊÇ°Ñ " ´óÄÔ " ·Ö³ÉÁ˲î±ðµÄÇøÓò£¬Èòî±ðµÄר¼ÒÈÏÕæ²î±ðµÄ˼Ë÷£¨Ìõ¼þÅÌË㣩£»ÄÇô Engram ¾ÍÊǸø´óÄÔÍâ¹ÒÁËÒ»¸öÖØ´óµÄ " º£ÂíÌå "£¬×¨ÃÅÈÏÕæ´æ´¢¾²Ì¬ÖªÊ¶£¨Ìõ¼þÓ°Ï󣩡£1. ¸´Éú "N-gram"£º´Ó¹ÅÀÏÖÇ»ÛÖÐѰÕÒÃÕµ×Engram µÄ½¹µãÁé¸Ð£¬¾¹È»À´×ÔÓÚ NLP£¨×ÔÈ»ÓïÑÔ´¦Àí£©ÁìÓòµÄ " ÉϹÅÉñÆ÷ " ¡ª¡ª N-gram¡£ÔÚÉî¶ÈѧϰͳÖÎÌìÏÂ֮ǰ£¬ÎÒÃǾÍÊÇ¿¿Í³¼Æ "N ¸ö´Êͬʱ·ºÆðµÄ¸ÅÂÊ " À´Ã÷È·ÓïÑԵġ£DeepSeek ½«ÕâÒ»¾­µä¿´·¨¾ÙÐÐÁËÏÖ´ú»¯µÄħ¸Ä£º¹Å°åµÄ Transformer£ºÖªÊ¶ÊèÉ¢ÔÚÉñ¾­ÔªµÄÈ¨ÖØ£¨Weights£©ÀÌáȡ֪ʶÐèÒª¾­ÓÉÖØ´óµÄÏßÐÔ²ãÅÌËã£¬ÖØÆ¯ºó¸ß¡£Engram Ä£¿£¿é£ºËüÊÇÒ»¸öÖØ´óµÄ¡¢¡¢¡¢¿ÉÀ©Õ¹µÄǶÈë±í£¨Embedding Table£©¡£µ±Ä£×Ó¶Áµ½ " ÕÅÖÙ¾° " »òÕß " ËÄ´ó·¢Ã÷ " ÕâÖÖÀο¿´îÅ䣨N-gram£©Ê±£¬²»ÐèÒª¶¯ÓôóÄÔÆ¤²ãÈ¥ÍÆÀí£¬Ö±½Óͨ¹ý¹þÏ£Ë÷Òý£¬ÔÚÄÚ´æ±íÖÐ " ²é " ³ö¶ÔÓ¦µÄÏòÁ¿¡£ÕâÒ»Àú³ÌµÄʱ¼äÖØÆ¯ºóÊÇO ( 1 ) ¡ª¡ªÕâÒâζ×ÅÎÞÂÛ֪ʶ¿âÅòÕ͵½¶à´ó£¨ÄÄÅÂÊÇ 1000 ÒÚ²ÎÊý£©£¬²éÕÒËÙÂÊÏÕЩÎȹÌ£¬ÇÒ¼«¿ì¡£2. Èý´óÊÖÒÕ»¤³ÇºÓ¼ÈÈ»²é±íÕâôºÃ£¬ÎªÊ²Ã´ÒÔǰûÈË×ö£¿£¿ÓÉÓÚÓÐÈý¸öÀ¹Â·»¢£º´æ´¢±¬Õ¨¡¢¡¢¡¢¶àÒå´Ê³åÍ»¡¢¡¢¡¢²ÎÊý·ÖÅä¡£DeepSeek ¸ø³öÁ˽̿ÆÊé¼¶µÄ½â¾ö·½°¸£ºA. ´Ê±íѹËõ£º¼«ÖµÄÈ¥ÖØÌìÏÂÉϵĴÊ×é×éºÏÊÇÌìÎÄÊý×Ö¡£DeepSeek Ê×ÏÈ×öÁËÒ»²½ " ÎÞËðѹËõ "¡£ÔÚ·Ö´ÊÆ÷£¨Tokenizer£©²ãÃæ£¬Ëü½«ÓïÒåÏàͬµ«Ð´·¨²î±ðµÄ´Ê¾ÙÐÐÁ˹éÒ»»¯¡£ÀýÈ磬"Apple"£¨Ê××Öĸ´óд£©ºÍ "apple"£¨Ð¡Ð´£©ÔÚÓïÒåÉÏͨ³£Ö¸Í³Ò»¸ö¹¤¾ß¡£Í¨¹ýÓ³ÉäºÏ²¢£¬ÓÐÓôʱíÖ±½ÓËõСÁË 23%¡£Õâ²»µ«½ÚÔ¼Á˿ռ䣬¸üÈÃ֪ʶµÄÃܶȴó·ùÌáÉý¡£B. ¶àÍ·¹þÏ££º½â¾ö " ¹þÏ£³åÍ» "²»¿ÉÄܰÑËùÓÐ N-gram ¶¼´æÏÂÀ´¡£Engram ʹÓÃÁË " ¶àÍ·¹þÏ££¨Multi-Head Hashing£©" ÊÖÒÕ¡£Í¨¹ý¶à¸ö¹þÏ£º¯Êý£¬½«ÎÞÏÞµÄ N-gram Ó³Éäµ½ÓÐÏÞµÄÄÚ´æ²ÛλÖС£ËäÈ»»áÓйþÏ£³åÍ»£¨¼´Á½¸ö²î±ðµÄ´Ê±»Ó³Éäµ½ÁËͳһ¸öλÖã©£¬µ«Í¨¹ý " ¶àÍ· " Éè¼Æ£¬Ä£×Ó¿ÉÒÔ´Ó¶à¸öºòѡЧ¹ûÖÐÆ´¼¯³ö׼ȷµÄÐÅÏ¢£¬¼«´óµØÌá¸ßÁ˳°ôÐÔ¡£C. ÉÏÏÂÎÄÃſأº¸øÓ°ÏóÅä¸ö " ²ÃÅÐ "ÕâÊÇ×ÃîµÄÒ»±Ê¡£²é±íÊÇËÀµÄ£¬ÓïÑÔÊÇ»îµÄ¡£ºÃ±È " Æ»¹û " Õâ¸ö´Ê¡£ÔÚ " ³ÔÆ»¹û " µÄÓᄈϣ¬Ëüָˮ¹û£»ÔÚ " Æ»¹ûÐû²¼»á " µÄÓᄈϣ¬ËüÖ¸¿Æ¼¼¹«Ë¾¡£Ö±½Ó²é±í¿ÉÄÜ»áÒýÈëÔëÉù¡£DeepSeek Éè¼ÆÁËÒ»¸ö " ÉÏÏÂÎĸÐÖªÃÅ¿Ø "£¨Context-aware Gating£©¡£Query£¨ÅÌÎÊ£©£ºÄ¿½ñÉÏÏÂÎĵÄÒþ²Ø×´Ì¬£¨Hidden State£©¡£Key/Value£¨¼üÖµ£©£º²é±í»ñµÃµÄ¾²Ì¬ÏòÁ¿¡£Õâ¸öÃſؾÍÏñÒ»¸ö²ÃÅС£ÈôÊDzé³öÀ´µÄ " ¾²Ì¬ÖªÊ¶ " ºÍÄ¿½ñµÄ " ÉÏÏÂÎÄ " ²»´î£¬²ÃÅоͻá°ÑÈ¨ÖØÑ¹µÍ£¨Gate ÖµÇ÷Ïò 0£©£¬ÈÃÄ£×ÓºöÂÔÕâ¸öÔëÉù£»ÈôÊÇÍêÉÆÆõºÏ£¨ºÃ±È " É˺®ÔÓ²¡ÂÛ " ºóËæ×Å " ÕÅÖÙ¾° "£©£¬²ÃÅоͻá°Ñ´óÃÅ·­¿ª£¨Gate ÖµÇ÷Ïò 1£©£¬Ö±½Ó°Ñ֪ʶעÈëÄ£×Ó¡£ µÚ¶þÕ£º»Æ½ð±ÈÀý¡ª¡ª·¢Ã÷ AI Ä£× "U ÐÍÇúÏß "¼Ü¹¹Éè¼ÆºÃÁË£¬½ÓÏÂÀ´µÄÎÊÌâÊÇ£ºÔõô·Ö¾Ó²ú£¿£¿¼ÙÉèÎÒÃÇÏÔ¿¨ÀïµÄÏÔ´æÊÇÓÐÏ޵ģ¬×ܲÎÊýÔ¤ËãÒ²ÊÇÀο¿µÄ¡£ÎÒÃÇÓ¦¸Ã°Ñ¼¸¶à²ÎÊý·ÖÅ䏸 MoE µÄ " ר¼Ò "£¨ÈÏÕæÅÌË㣩£¬¼¸¶à²ÎÊý·ÖÅ䏸 Engram µÄ " ×Öµä "£¨ÈÏÕæÓ°Ï󣩣¿£¿ÕâÊÇÒ»¸öµä·¶µÄ×ÊÔ´ÉèÖò©ÞÄ¡£DeepSeek ÍŶӾÙÐÐÁËÒ»³¡´ó¹æÄ£µÄÏûÈÚʵÑ飬ɨÃèÁË´Ó 0% µ½ 100% µÄ·ÖÅä±ÈÀý£¬Ð§¹û»­³öÁËÒ»ÌõÍêÉÆµÄ "U ÐÍ Scaling Law ÇúÏß "¡£ÕâÕÅͼչÏÖÁË AI Ä£×ÓÉè¼ÆµÄµ×²ã¼ÍÂÉ£º×ó²à¼«¶Ë£¨´¿ Engram£©£ºÈôÊǰѲÎÊýÈ«¸ø×ֵ䣬Loss ºÜ¸ß¡£ÓÉÓÚÄ£×ÓÄð³ÉÁË " Êé°×³Õ "£¬¹âÓÐËÀ¼ÇÓ²±³£¬Ã»ÓÐÂß¼­ÍÆÀíÄÜÁ¦¡£ÓҲ༫¶Ë£¨´¿ MoE£©£ºÈôÊǰѲÎÊýÈ«¸ø×¨¼Ò£¬Loss Ò²ºÜ¸ß¡£ÓÉÓÚר¼ÒÃDZ»ÆÈ°Ñ¾«Éñ¶¼»¨ÔÚ±³Ê飨ӰÏó¾²Ì¬ÖªÊ¶£©ÉÏ£¬Ã»¿Õ¸ÉÕýÊ¡£»Æ½ðÖ§½âµã£¨¦Ñ ¡Ö 75%-80%£©£ºµ±ÎÒÃǽ«Ô¼20%-25% µÄÏ£º±²ÎÊýÔ¤Ëã·Ö¸ø Engram£¬Ê£Ïµĸø MoE ʱ£¬Ä£×ÓµÄÑéÖ¤¼¯ Loss ½µµ½ÁË×îµÍµã¡£ÕâÊÇÒ»¸ö¼«¾ßÖ¸µ¼ÒâÒåµÄ·¢Ã÷£º¹ØÓÚ¼¸°ÙÒÚ²ÎÊýµÄ´óÄ£×ÓÀ´Ëµ£¬´¿´â¶ÑÆöÅÌË㵥루MoE ר¼Ò£©ÒѾ­ÊDZ߼ÊЧӦµÝ¼õÁË£¬±ØÐèÒýÈëרÃŵľ²Ì¬Ó°ÏóÄ£¿£¿éÀ´ÊµÏÖ " ´æËãÆ½ºâ "¡£ µÚÈýÕ£º·´Ö±¾õµÄ±¬·¢¡ª¡ªÎªÊ²Ã´ " ²é×Öµä " ÄÜÌá¸ß " ÊýѧЧ¹û "£¿£¿ÈôÊÇ Engram ½ö½öÊÇÈÃÄ£×Ó " ¼ÇÐÔ¸üºÃ "£¬ÕâÆªÂÛÎĵķÖÁ¿»¹È±·¦ÒÔÕð¾ªÉçÇø¡£ÊÂʵ£¬RAG£¨¼ìË÷ÔöÇ¿ÌìÉú£©Ò²Äܽâ¾ö֪ʶÎÊÌâ¡£ÕæÕýÈÃÒµ½ç¸ÐÓ¦Õ𺳵Ä£¬ÊÇʵÑéЧ¹ûÖÐÄÇЩÒâÁÏÖ®ÍâµÄÊÕÒæ¡£DeepSeek ¹¹½¨ÁËÈý¸ö±ÈÕÕÄ£×Ó£¬ÑÏ¿á¿ØÖÆ¼¤»î²ÎÊýÄ¿£¨3.8B£©ºÍѵÁ·Êý¾ÝÁ¿£¨262B tokens£©ÍêȫһÖ£ºDense-4B£º¹Å°åµÄŨÃÜÄ£×Ó¡£MoE-27B£º´¿ MoE Ä£×Ó£¨72 ¸öר¼Ò£©¡£Engram-27B£º»ìÏýÄ£×Ó£¨55 ¸öר¼Ò + 5.7B Engram ²ÎÊý£©¡£Ð§¹ûÁîÈË´óµøÑÛ¾µ£º1. ÒâÁÏÖ®ÖУºÖªÊ¶ÀàʹÃü°Ô°ñÔÚ MMLU£¨×ÛºÏ֪ʶ£©ÉÏ£¬Engram Ä£×ÓÌáÉýÁË3.4 ·Ö£»ÔÚ CMMLU£¨ÖÐÎÄ֪ʶ£©ÉÏ£¬ÌáÉýÁË4.0 ·Ö¡£ÕâºÜºÃÃ÷È·£¬Íâ¹ÒÁË×ֵ䣬֪ʶ×ÔÈ»¸üºÃÁË£¬»Ã¾õ¸üÉÙÁË¡£2. ÒâÁÏÖ®Í⣺Âß¼­¡¢¡¢¡¢´úÂë¡¢¡¢¡¢ÊýѧÖÜÈ«±©Õǰ´Àí˵£¬" ²é×Öµä " ºÍ " ×öÊýѧÌâ " û¹ØÏµ¡£µ«ÔÚ BBH£¨×ÛºÏÍÆÀí£©ÉÏ£¬Engram-27B ¾¹È»±Èͬ²ÎÊýµÄ´¿ MoE »ùÏßÌáÉýÁËÕûÕû5.0 ·Ö£¡£¡£¡MATH£¨Êýѧ£©£ºÌáÉý2.4 ·Ö¡£HumanEval£¨´úÂëÌìÉú£©£ºÌáÉý3.0 ·Ö¡£ARC-Challenge£¨ÖØ´óÍÆÀí£©£ºÌáÉý3.7 ·Ö¡£3. Éî¶ÈÆÊÎö£ºÓÐÓÃÉî¶È£¨Effective Depth£©ÀíÂÛΪʲô£¿£¿Ò»¸ö " ËÀ¼ÇÓ²±³ " µÄÄ£¿£¿é£¬ÎªÊ²Ã´ÄÜÌá¸ßÖÇÉÌ£¿£¿DeepSeek ÍŶÓʹÓÃLogitLensºÍ "CKA£¨ÖÐÐÄºË¶ÔÆë£©" ÊÖÒÕ£¬¶ÔÄ£×ÓÄÚ²¿¾ÙÐÐÁË " ÆÊ½â "¡£ËûÃÇ·¢Ã÷ÁËÒ»¸ö¾ªÈ˵ÄÕ÷Ï󣺻¹¼ÇµÃ¿ªÍ·µÄ " ´÷°²ÄÈÍõåú " Â𣿣¿ÔÚ´¿ MoE Ä£×ÓÖУ¬Ç°¼¸²ãÍøÂç¶¼ÔÚæ×Å " Æ´¼¯¿´·¨ "¡£¶øÔÚ Engram Ä£×ÓÖУ¬ÓÉÓÚµÚ 2 ²ã¾Í²åÈëÁË Engram Ä£¿£¿é£¬¾²Ì¬ÖªÊ¶µÄ¼ìË÷ÔÚ¼«ÔçµÄ½×¶Î¾ÍÍê³ÉÁË¡£ÕâÒâζ×Å£¬Ô­±¾ÓÃÓÚ " ËÀ¼ÇÓ²±³ " µÄǰ¼¸²ãÍøÂç±»½â·ÅÁË£¡£¡£¡ÕâÏ൱ÓÚ¸øÄ£×Ó " ÐéÔö " ÁËÉî¶È¡£ ÄÇЩ±»ÊͷųöÀ´µÄÍøÂç²ãºÍ×¢ÖØÁ¦Í·£¨Attention Heads£©£¬²»ÔÙÐèÒª´¦ÀíààËյľֲ¿ÒÀÀµ£¨ºÃ±Èʶ±ð " ÕÅÖÙ¾° " ÊÇË­£©£¬´Ó¶ø¿ÉÒÔÈ«Éñ¹á×¢µØÍ¶Èëµ½¸üÖØ´óµÄÈ«¾ÖÍÆÀí¡¢¡¢¡¢³¤³ÌÂß¼­¹¹½¨ºÍ´úÂëÂß¼­ÌìÉúÖÐÈ¥¡£Engram µÄʵÖÊ£¬²»ÊÇ " Ìæ»» " ÍÆÀí£¬¶øÊÇͨ¹ý " ·ÖÁ÷ " ÔӻÈôóÄÔרעÓÚ¸ü¸ßά¶ÈµÄ˼Ë÷¡£ µÚËÄÕ£º¹¤³ÌÆæ¼£¡£¡£¡ª¡ªÍ»ÆÆÓ¢Î°´ïµÄ " ÏÔ´æ°ÔȨ "¹ØÓÚ»ª¶û½ÖµÄͶ×ÊÕߺÍËãÁ¦ÖÐÐĵÄÔËάÕßÀ´Ëµ£¬ÕâÆªÂÛÎÄ×îÐԸеĵط½²»ÔÚÓÚ Score£¬¶øÔÚÓÚCost£¨±¾Ç®£©¡£ÔÚ AI ʱ´ú£¬×îÌÚ¹óµÄ×ÊÔ´²»ÊÇËãÁ¦£¨FLOPs£©£¬¶øÊÇÏԴ棨HBM£©¡£Ó¢Î°´ï H100 Ö®ÒÔÊǹ󣬺ܺéÁ÷ƽÉÏÊÇÓÉÓÚÄÇϡȱµÄ HBM3e ÄÚ´æ¡£¶ø Engram ´øÀ´ÁËÒ»¸öÇ㸲ÐÔµÄÌØÕ÷£º³¹µ×µÄ´æËãÊèÉ¢¡£1. MoE µÄÍ´µã£ºÏÔ´æÍÌÊÉÕ߹ŰåµÄ MoE Ä£×Ó£¬Æä·ÓÉ»úÖÆ£¨Routing£©ÊǶ¯Ì¬µÄ¡£Ä£×Ó±ØÐèÏÈËã³öÄ¿½ñ Token µÄÌØÕ÷£¬ËãÍêÕâÒ»²ã£¬²ÅÖªµÀÏÂÒ»²ã¸ÃÕÒÄĸöר¼Ò¡£ÕâÒâζ×Å£¬ËùÓеÄר¼ÒÄ£×Ó±ØÐèʱ¿ÌÔÚÌÚ¹óµÄ GPU ÏÔ´æÀï´ýÃü£¬Ëæ½ÐËæµ½¡£2. Engram µÄÍ»ÆÆ£ºÈ·¶¨µÄÔ¤ÖªEngram µÄ²é±íÂß¼­ÊÇÈ·¶¨ÐԵġ£Ö»ÒªÊäÈëµÄÎı¾È·¶¨ÁË£¨ºÃ±È "A New Axis of Sparsity"£©£¬ÄÇôËü¶ÔÓ¦µÄ N-gram Ë÷Òý¾ÍÈ·¶¨ÁË¡£ÎÒÃÇ»ù´¡²»ÐèÒªµÈÄ£×ÓËãÍêǰһ²ã£¬ÔÚ Token ½øÈëÄ£×ÓµÄÄÇһ˲¼ä£¬ÎÒÃǾÍÖªµÀËüÐèÒª²éÄÄÕűíµÄÄÄÒ»ÐС£3. CPU µÄÄæÏ®£º°Ñ´óÄ£×ÓÈû½øÄÚ´æÌõÕâÒ»ÌØÕ÷´øÀ´ÁËÖØ´óµÄ¹¤³ÌÓ¯Àû£ºÐ¶ÔØ£¨Offload£©£ºÎÒÃÇ¿ÉÒ԰Ѽ¸°ÙÒÚ¡¢¡¢¡¢ÉõÖÁÉÏǧÒÚ²ÎÊýµÄ Engram ´Ê±í£¬Ö±½ÓÈÓµ½×ÔÖÆ¡¢¡¢¡¢Á¿´ó¡¢¡¢¡¢Ò×À©Õ¹µÄ "CPU Äڴ棨DRAM£©" ÀÉõÖÁ·ÅÔÚ NVMe SSD ÉÏ¡£Ô¤È¡£¨Prefetching£©£ºÔÚ GPU Æ´ÃüÅÌËãǰһ²ã Transformer µÄʱ¼ä£¬CPU ʹÓà PCIe ͨµÀ£¬Òì²½µØ°ÑÏÂÒ»²ãÐèÒªµÄÓ°ÏóÊý¾Ý " Ԥȡ " ³öÀ´£¬ÍÆË͵½ GPU¡£ÑÚÊÎÑÓ³Ù£¬²¢Ðд¦Àí¡£DeepSeek ʵ²âÊý¾ÝÏÔʾ£º×ÝÈ»¹ÒÔØÁË100B£¨Ç§ÒÚ£©²ÎÊýµÄ Engram ±íµ½ CPU Äڴ棬Ïà±ÈÓÚ´¿ GPU ÍÆÀí£¬ÍÌÍÂÁ¿µÄϽµ²»µ½ 3%¡£ÕâÊÇÒ»¸öÈÃËùÓÐÓÉÓÚÂò²»µ½ HBM ¶ø½¹ÂǵÄÈË¿ñϲµÄ½áÂÛ¡£ÕâÒâζ×Å£¬Î´À´µÄ´óÄ£×Ó£¬" Ó°ÏóÈÝÁ¿ " ¿ÉÒԵͳÉÍâµØÎÞÏÞÀ©ÕÅ£¬¶ø²»±Ø±»Ó¢Î°´ïµÄÏԴ濨²±×Ó¡£ µÚÎåÕ£º³¤Îı¾µÄʤÀû¡ª¡ª NIAH ²âÊÔµÄÔ¾Éý³ýÁËͨÓÃÍÆÀí£¬Engram ÔÚ³¤Îı¾£¨Long Context£©ÁìÓòµÄÌåÏÖͬÑù֤ʵÎú " ·Ö¹¤ " µÄ¼ÛÖµ¡£ÔÚ³¤Îı¾´¦ÀíÖУ¬×¢ÖØÁ¦»úÖÆ£¨Attention£©µÄ´°¿ÚÊÇÓÐÏ޵ġ£ÈôÊÇ×¢ÖØÁ¦±»´ó×ڵľֲ¿ÐÅÏ¢£¨ÈçÀο¿¶Ì

    ±¾ÎÄÁ´½Ó£º?/p/Products/8458069.html

    °Ù¶ÈÔÊÐí£ºÈçÓöÐéαڲƭ£¬ÖúÄú****(Ôð±à£º³ÂÞÈÔ£¡£¡£¡¢¡¢¡¢µËΰÏè)

    Ïà¹ØÓ¦ÓÃ

    ¡¾ÍøÕ¾µØÍ¼¡¿
    Æ»¹û°æ-Ô¼ÉÏÃÅ·þÎñ-ÕâÊÇÕæ²»ÒªÃü£¡£¡£¡¶í¾üÄÃ