• gigachad@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    5
    ·
    2 months ago

    I guess it’s more of a critique of how bad CSV is for storing large data than pandas being inefficient

    • zaphod
      link
      fedilink
      English
      arrow-up
      13
      ·
      2 months ago

      CSV is not optimal, but then someone shows up and gives you 60GB of JSON instead of 600MB of CSV.

      • naught@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        3
        ·
        2 months ago

        Or they dump their entire 6gb SQL database, customer info and all, into a SQL file that you have to load into a mariadb docker container when you just needed a subset that you were going to turn into csv anyway ☺️