Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Thanks for the upvotes.

Here is a more technical deep dive comparing the three main open formats - https://www.onehouse.ai/blog/apache-hudi-vs-delta-lake-vs-ap...



I talk to customers basically all day about table formats. Only one customer has really brought up Hudi in a meaningful way. IMO, Hudi is basically out of contention for 95%+ of people looking at table formats.


For a Spark shop Delta is the default choice. If you deploy to AWS then Glue encourages you to go with Iceberg. What makes people use Hudi?


That comparison blog seems biased toward Hudi.


Biased in what way? The authors provide solid arguments for why they think Hudi is a good tool.


I think it is also wrong in the capabilities, example: Redshift should be able to read Iceberg via Redshift Spectrum.

https://docs.aws.amazon.com/redshift/latest/dg/querying-iceb...


Biased in that the authors seem to favor Hudi and the arguments for seem based on that favor rather than an objective presentation of all relevant factors.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: