this post was submitted on 04 Jul 2025
136 points (98.6% liked)

Programmer Humor

24736 readers
1788 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] wise_pancake@lemmy.ca 3 points 23 hours ago (1 children)

I’d probably just use line delimited JSON or CSV for this use case. It plays nicely with cat and other standard tools and basically all the yaml is doing is wrapping raw json and adding extra parse time/complexity.

In the end consider converting this to parquet for analysis, you probably won’t get much from compression or row-group clustering, but you will get benefits from the column store format when reading the data.

[–] qaz@lemmy.world 4 points 22 hours ago* (last edited 22 hours ago) (1 children)

Thanks for the advice, but this is just the format of some eyetracking software I had to use not something I develop myself

[–] wise_pancake@lemmy.ca 4 points 21 hours ago

Ah, well, such is software dependencies.