ylai@lemmy.mlEnglish · 1 year agoInside the Matrix: Visualizing Matrix Multiplication, Attention and Beyondplus-squarepytorch.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkInside the Matrix: Visualizing Matrix Multiplication, Attention and Beyondplus-squarepytorch.orgylai@lemmy.mlEnglish · 1 year agomessage-square0fedilink
CanadaPlus@lemmy.sdf.orgEnglish · 1 year agoWhat is the state of the art on putting text samples into a latent space?message-squaremessage-square2fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1message-squareWhat is the state of the art on putting text samples into a latent space?CanadaPlus@lemmy.sdf.orgEnglish · 1 year agomessage-square2fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoAttention Is All You Needplus-squarelemmy.intai.techexternal-linkmessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkAttention Is All You Needplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoAttention Is Off By Oneplus-squarewww.evanmiller.orgexternal-linkmessage-square0fedilinkarrow-up16arrow-down10
arrow-up16arrow-down1external-linkAttention Is Off By Oneplus-squarewww.evanmiller.orgmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoChatGPT an ENFJ, Bard an ISTJ: Empirical Study on Personalities of Large Language Modelsplus-squarelemmy.intai.techexternal-linkmessage-square1fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkChatGPT an ENFJ, Bard an ISTJ: Empirical Study on Personalities of Large Language Modelsplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square1fedilink
manitcor@lemmy.intai.techMEnglish · edit-21 year agoPersonality Traits in Large Language Modelsplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imagePersonality Traits in Large Language Modelsplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · edit-21 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoLarge Language Models as General Pattern Machinesplus-squarelemmy.intai.techexternal-linkmessage-square2fedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkLarge Language Models as General Pattern Machinesplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square2fedilink
manitcor@lemmy.intai.techMEnglish · edit-21 year agoLarge Language Models as Tool Makersplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageLarge Language Models as Tool Makersplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · edit-21 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoLanguage models can explain neurons in language modelsplus-squareopenai.comexternal-linkmessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkLanguage models can explain neurons in language modelsplus-squareopenai.commanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
taters@lemmy.intai.techEnglish · edit-21 year agoCurious Replay for Model-based Adaptationplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up12arrow-down10
arrow-up12arrow-down1imageCurious Replay for Model-based Adaptationplus-squarelemmy.intai.techtaters@lemmy.intai.techEnglish · edit-21 year agomessage-square0fedilink
taters@lemmy.intai.techEnglish · edit-21 year agoThe imperative for regulatory oversight of large language models (or generative AI) in healthcareplus-squarewww.nature.comexternal-linkmessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkThe imperative for regulatory oversight of large language models (or generative AI) in healthcareplus-squarewww.nature.comtaters@lemmy.intai.techEnglish · edit-21 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoMicrosoft Announces: LongNet - Scaling LLM Transformers to 1,000,000,000 Tokens & Context Lengthplus-squaremessage-squaremessage-square2fedilinkarrow-up17arrow-down10
arrow-up17arrow-down1message-squareMicrosoft Announces: LongNet - Scaling LLM Transformers to 1,000,000,000 Tokens & Context Lengthplus-squaremanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square2fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoLarge Language Models Enable Few-Shot Clusteringplus-squarelemmy.intai.techexternal-linkmessage-square0fedilinkarrow-up14arrow-down10
arrow-up14arrow-down1external-linkLarge Language Models Enable Few-Shot Clusteringplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoPreference Ranking Optimization for Human Alignmentplus-squarelemmy.intai.techexternal-linkmessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkPreference Ranking Optimization for Human Alignmentplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoPushing the Limits of Machine Design Automated CPU Design with AIplus-squarelemmy.intai.techexternal-linkmessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkPushing the Limits of Machine Design Automated CPU Design with AIplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · edit-21 year agoIs ChatGPT A Good Translator? Yes With GPT-4 As The Engineplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up17arrow-down10
arrow-up17arrow-down1imageIs ChatGPT A Good Translator? Yes With GPT-4 As The Engineplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · edit-21 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · edit-21 year agoSequenceMatch - Imitation Learning for Autoregressive Sequence Modelling with Backtrackingplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up14arrow-down10
arrow-up14arrow-down1imageSequenceMatch - Imitation Learning for Autoregressive Sequence Modelling with Backtrackingplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · edit-21 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoThe RefinedWeb Dataset for Falcon LLM - Outperforming Curated Corpora with Web Data, and Web Data Onlyplus-squarelemmy.intai.techexternal-linkmessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkThe RefinedWeb Dataset for Falcon LLM - Outperforming Curated Corpora with Web Data, and Web Data Onlyplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoOn the Coverage of Cognitive mmWave Networks with Directional Sensing and Communicationplus-squaremessage-squaremessage-square0fedilinkarrow-up12arrow-down10
arrow-up12arrow-down1message-squareOn the Coverage of Cognitive mmWave Networks with Directional Sensing and Communicationplus-squaremanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · edit-21 year agoGenerate Anything Anywhere in Any Scene.plus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up15arrow-down10
arrow-up15arrow-down1imageGenerate Anything Anywhere in Any Scene.plus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · edit-21 year agomessage-square0fedilink