When querying for “legal” and “law” on lemmyverse.net, it’s starkly clear that many communities are missing from the database. I get far more results when querying on specific instances. So what’s the problem? Is it no longer crawling?

  • FrostyTrichs@walledgarden.xyz
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 month ago

    The crawler has been wonky for a week or more. If you check the recent posts at !trendingcommunities@feddit.nl you’ll see the list has been very unpredictable lately as some of the bigger instances aren’t being reported by the bot. Simultaneously there seems to be a federating bug that no one has solved yet. It was/is affecting .world communities federating to lemmy.ml and it’s been causing communities on walledgarden and some other instances to fall in and out of sync with each other.

    • activistPnk@slrpnk.netOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 month ago

      The law communities I was expecting to find are old and on small instances. When I searched on just the domain portion of a small instance, Lemmyverse finds only like 10% of the communities on that node.

      You would expect LV to not forget a community once it is in the DB. But it must also have a removal mechanism for communities that are deleted. It’s as if the removal detection is getting false positives for removals.

      • FrostyTrichs@walledgarden.xyz
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 month ago

        I understand the confusion.

        Maybe I should’ve been clearer that lemmyverse performance has been unreliable, not just the crawler. Entire instances, communities, and mixes of the two have disappeared and reappeared between scans. It also hasn’t been reliably picking up changes to instances it already knows about.

        TLDR lemmyverse has something wrong with it.