fossilesque@mander.xyzM to Science Memes@mander.xyzEnglish · 8 months agoPandasmander.xyzexternal-linkmessage-square3fedilinkarrow-up10arrow-down10
arrow-up10arrow-down1external-linkPandasmander.xyzfossilesque@mander.xyzM to Science Memes@mander.xyzEnglish · 8 months agomessage-square3fedilink
minus-squareKausta@lemm.eelinkfedilinkEnglisharrow-up0·8 months agoYou havent seen anything until you need to put a 4.2gb gzipped csv into a pandas dataframe, which works without any issues I should note.
minus-squareQuizzaciousOtter@lemm.eelinkfedilinkEnglisharrow-up0·8 months agoI really don’t think that’s a lot either. Nowadays we routinely process terabytes of data.
minus-squareKausta@lemm.eelinkfedilinkEnglisharrow-up1·8 months agoYeah, it was just a simple example. Although using just pandas (without something like dask) for loading terabytes of data at once into a single dataframe may not be the best idea, even with enough memory.
You havent seen anything until you need to put a 4.2gb gzipped csv into a pandas dataframe, which works without any issues I should note.
I really don’t think that’s a lot either. Nowadays we routinely process terabytes of data.
Yeah, it was just a simple example. Although using just pandas (without something like dask) for loading terabytes of data at once into a single dataframe may not be the best idea, even with enough memory.