Ranter
Join devRant
Do all the things like
++ or -- rants, post your own rants, comment on others' rants and build your customized dev avatar
Sign Up
Pipeless API
From the creators of devRant, Pipeless lets you power real-time personalized recommendations and activity feeds using a simple API
Learn More
Comments
-
Data from a government agency?
One agency we worked with had a similar no header CSV. Which was fine because the columns were supposed to be Name, Address, and so on.
Easy right? Nope. Sometimes an address was the first column, a zip code, a phone number. Almost random.
We complained the data was completely unusable and their response "The data is accurate, you are the first company to complain"
Our requests to fix the data (because we know the CSV is likely generated by a 1950 COBOL program) was replied with "Any changes to our internal processes will have to be directed by a congressional decree. We recommend you contact your state senator."
Yes, moron. A state senator will make it his first priority to make sure names are in the Name field. -
@Demolishun > "how do you word a contract to say data must be parse-able?"
Can't say we ever had an issue with other companies.
We were legally required to use this data as-is, so there wasn't anything we could really do. -
Signed up on March. First post. Has not upvoted a single rant. What is this account?
Good first rant though. -
JsonBoa29761yI've seen shitheads pipe out logs from parallel processes as header-less CSV rows, and gather the files in a very hadoop-resembling reduce operation.
Then the reduced files are dumped and rotated on time buckets.
Since no row yielded can be guaranteed to be the first, each file had no header.
Seems like a no-brainer, right? Document it up and provide the header-only first file.
Buuut... the fuckers forgot that production code is _alive_, man.
And those many parallel processes were not workers of the same job, they were microservices piping out similarly-formatted logs as CSV rows.
Soon enough there were changes to a microservice or another, and exotically formatted log rows started getting mixed together in the same file.
If at least there were a version number column per row, we could do a second map-reduce and gather the similarly versioned/formatted data...
Fuck everything. -
gemsy291yYeah, I got a 7.5 GB CSV from a government agency, and at least it has a header. But every now and then, I come across a new kind of "NaN". Sometimes it's "---", then it's an empty column, then a "nan". What the fuck, guys?
-
MoboTheHobo3026334dBro welcome to my world where I deal as our company dev with consultants. For them this is ok, Im going Bonkers everday of this shitty workflow 🤣🤣
TL;DR alot of people in the IT field are just here for the high salary 😜
Related Rants
-
grayfox36* Selects text to copy * * Ctrl + C to copy * * Selects text to be replaced with copied text * * Ctrl + C a...
-
arthurdent25"Are you running android?" "No Samsung" "So your OS is android" "No Samsung"
-
linuxxx27"could I get admin privileges to reboot this server?" Sounds valid enough, right? OH YEAH SURE, YOU'RE A T...
A 520MB CSV file with 29 columns and no headers.
If it's not an impertinent question, why in the name of Satan's magnificent testicles would anyone do that?
I hope their pig dies.
devrant
pigs
stupidity
csv