Do all the things like ++ or -- rants, post your own rants, comment on others' rants and build your customized dev avatarSign Up
From the creators of devRant, Pipeless lets you power real-time personalized recommendations and activity feeds using a simple APILearn More
netikras3352027dDB table backup?
Have my ++!
Holy fuck. My condolences xD
PaperTrail1034127dData from a government agency?
One agency we worked with had a similar no header CSV. Which was fine because the columns were supposed to be Name, Address, and so on.
Easy right? Nope. Sometimes an address was the first column, a zip code, a phone number. Almost random.
We complained the data was completely unusable and their response "The data is accurate, you are the first company to complain"
Our requests to fix the data (because we know the CSV is likely generated by a 1950 COBOL program) was replied with "Any changes to our internal processes will have to be directed by a congressional decree. We recommend you contact your state senator."
Yes, moron. A state senator will make it his first priority to make sure names are in the Name field.
kobenz306926dmake it 29 files and let it burn
myss455326dSounds like an inside job for GPT
Signed up on March. First post. Has not upvoted a single rant. What is this account?
Good first rant though.
rantydev06006826dWhen I was young and naive, I thought people didn't knew better than to do what they do.
But know, it's almost always malicious compliance and it's done to make some checkbox tick, but that's all.
tosensei567726dyou'd rather have the same data as a 5GB .xlsx?
go count your blessings.
JsonBoa242224dI've seen shitheads pipe out logs from parallel processes as header-less CSV rows, and gather the files in a very hadoop-resembling reduce operation.
Then the reduced files are dumped and rotated on time buckets.
Since no row yielded can be guaranteed to be the first, each file had no header.
Seems like a no-brainer, right? Document it up and provide the header-only first file.
Buuut... the fuckers forgot that production code is _alive_, man.
And those many parallel processes were not workers of the same job, they were microservices piping out similarly-formatted logs as CSV rows.
Soon enough there were changes to a microservice or another, and exotically formatted log rows started getting mixed together in the same file.
If at least there were a version number column per row, we could do a second map-reduce and gather the similarly versioned/formatted data...
gemsy323dYeah, I got a 7.5 GB CSV from a government agency, and at least it has a header. But every now and then, I come across a new kind of "NaN". Sometimes it's "---", then it's an empty column, then a "nan". What the fuck, guys?
kiki3070720dAsk chatgpt to infer column names
grayfox35* Selects text to copy * * Ctrl + C to copy * * Selects text to be replaced with copied text * * Ctrl + C a...
arthurdent25"Are you running android?" "No Samsung" "So your OS is android" "No Samsung"
linuxxx28"could I get admin privileges to reboot this server?" Sounds valid enough, right? OH YEAH SURE, YOU'RE A T...