• dohpaz42@lemmy.world
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    1
    ·
    5 months ago

    It’s normal to denormalize data in a relational database. Having a lot of joins can be expensive and non-performant. So it makes sense to use a common structure like JSON for storing the demoralized data. It’s concise, and still human readable and human writable.

    Why should I spin up a NoSQL solution when 99% of my data is relational?

    • GBU_28@lemm.ee
      link
      fedilink
      English
      arrow-up
      19
      ·
      5 months ago

      As a data engineer, I focus on moralizing my data, reforming it so it is ready to rejoin society

    • SpaceCowboy@lemmy.ca
      link
      fedilink
      arrow-up
      11
      arrow-down
      1
      ·
      5 months ago

      Having a lot of joins can be expensive and non-performant.

      Only if you don’t know how to do indexing properly. Normalized data is more performant (less duplication of data, less memory and bandwidth is being used) if you know how to index.

      It may have been true decades ago that denormalized tables were more performant, I don’t know. But today it’s far more common that the phrase “denormalized tables are more performant” is something that’s said by someone that sucks at indexing and/or is just being lazy.

      But I do put JSON into tables sometimes when the data is going to be very inconsistent between different items and there’s no need to index any of the values in there. Like if different vendors provide different kinds of information about their products, I need to store it somewhere, so just serialize it and put it in there to be read by a program that has abstraction layers to deal with it. It’s never going to perform well if I do a query on it, but if all that’s needed is to display details on one item at a time, it’s fine.

      • krippix@feddit.org
        link
        fedilink
        arrow-up
        1
        ·
        5 months ago

        I am currently trying to get deeper into database topics, could you maybe point me somewhere I can read up on that topic a bit more?