ByteOnBikes@slrpnk.net to Reddit@lemmy.world · 3 days agoAI bots on Reddit reaching the front page? I absolutely believe itslrpnk.netimagemessage-square87fedilinkarrow-up1619arrow-down15file-text
arrow-up1614arrow-down1imageAI bots on Reddit reaching the front page? I absolutely believe itslrpnk.netByteOnBikes@slrpnk.net to Reddit@lemmy.world · 3 days agomessage-square87fedilinkfile-text
minus-squareshittydwarf@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up23arrow-down1·edit-23 days agoThe truly valuable data is the stuff that was created prior to LLMs, anything after this is tainted by slop. Any verifiable human data would be worth more, which is why they are simultaneously trying to erode any and all privacy
minus-squaregandalf_der_12te@discuss.tchncs.delinkfedilinkarrow-up2arrow-down2·2 days agoI’m not sure about that. It implies that only humans are able to produce high-quality output. But that seems wrong to me. First of all, not everything that humans produce has high quality; rather, the opposite. Second, with the development of AI i think it will be very well possible for AI to generate good-quality output in the future.
minus-squaremorrowind@lemmy.mllinkfedilinkarrow-up4·2 days agoMicrosoft’s PHI-4 is primarily trained on synthetic (generated by other AIs) data. It’s not a future thing, it’s been happening for years
The truly valuable data is the stuff that was created prior to LLMs, anything after this is tainted by slop. Any verifiable human data would be worth more, which is why they are simultaneously trying to erode any and all privacy
I’m not sure about that. It implies that only humans are able to produce high-quality output. But that seems wrong to me.
Microsoft’s PHI-4 is primarily trained on synthetic (generated by other AIs) data. It’s not a future thing, it’s been happening for years