simple@lemm.ee to Technology@lemmy.worldEnglish · 20 hours agoFOSS infrastructure is under attack by AI companiesthelibre.newsexternal-linkmessage-square42fedilinkarrow-up1367arrow-down12 cross-posted to: linux@programming.devopensource@jlai.lutechnology@lemmy.worldopensource@lemmy.mlopensource@programming.devtechnology@lemmy.worldhackernews@lemmy.bestiver.se
arrow-up1365arrow-down1external-linkFOSS infrastructure is under attack by AI companiesthelibre.newssimple@lemm.ee to Technology@lemmy.worldEnglish · 20 hours agomessage-square42fedilink cross-posted to: linux@programming.devopensource@jlai.lutechnology@lemmy.worldopensource@lemmy.mlopensource@programming.devtechnology@lemmy.worldhackernews@lemmy.bestiver.se
minus-squarewjs018@piefed.sociallinkfedilinkEnglisharrow-up55arrow-down1·18 hours agoThe theory that the lead maintainer had (he is an actual software developer, I just dabble), is that it might be a type of reinforcement learning: Get your LLM to create what it thinks are valid bug reports/issues Monitor the outcome of those issues (closed immediately, discussion, eventual pull request) Use those outcomes to assign how “good” or “bad” that generated issue was Use that scoring as a way to feed back into the model to influence it to create more “good” issues If this is what’s happening, then it’s essentially offloading your LLM’s reinforcement learning scoring to open source maintainers.
minus-squareHubertManne@piefed.sociallinkfedilinkEnglisharrow-up28arrow-down1·18 hours agoThats wild. I don’t have much hope for llm’s if things like this is how they are doing things and I would not be surprised given how well they don’t work. Too much quantity over quality in training.
The theory that the lead maintainer had (he is an actual software developer, I just dabble), is that it might be a type of reinforcement learning:
If this is what’s happening, then it’s essentially offloading your LLM’s reinforcement learning scoring to open source maintainers.
Thats wild. I don’t have much hope for llm’s if things like this is how they are doing things and I would not be surprised given how well they don’t work. Too much quantity over quality in training.