what bugs me the most is thinking that if lemmy became as relevant as reddit, servers to spam all over the threadiverse would be created nonstop
does anyone have some resources into solutions for spam in stuff like email? i’d like to check if having some giants gatekeepers like microsoft and google is inevitable
(Just thinking out loud here) I wonder if it’s possible to pull posts via the API, format them into a text file in email format, run them through SpamAssassin, and then use the results to trigger an automod action?
You’d need to curate a good list of ham/spam posts to train it on, but the built-in heuristics may catch some low-hanging fruit out of the box.
If anyone has any thoughts or has tried this, lemme know. I’ve been kicking around the idea for a few months but haven’t gone any further than that.
honestly i don’t like automated systems based on content of the post rather than number
too easy to become biased against non native talkers and occasional promotion is healthy in a community
p.s. i’m also thinkign out loud, the web is too empty of actual discussions like the one we having right now, not everything has to be published only when finished :D (even tho on commercial social media it’s actually the opposite of having too much noise lol)
If you haven’t seen walls of pill spam from, typically, Kbin, then thank your instance admins. Picking those out in email is not quite a solved problem, but is routine and accurate enough that I only see that kind of spam once or twice a year. That’s the content I’m targeting.
Plus, I estimate about 10-15% of the people I work with (and/or interact with professionally) are non-native English speakers. I never have to dig their emails out of spam.