So an AI company announces a new model that is so dangerous that they cannot release it. And, as usual, a raft of folks who I guess are trying to be influencers on LinkedIn uncritically parrot this...
All parsers are not made equal, looking at you CSV
Like Excel spreadsheets, data engineers will probably be ingesting CSV files until the heat death of the universe. It’s Death, Taxes, and problematic data formats. While not limited to CSV...
Python libraries to consider – Tenacity
Per the README, “Tenacity is an Apache 2.0 licensed general-purpose retrying library, written in Python, to simplify the task of adding retry behavior to just about anything.” I find this...
Free data tools to consider
YData Profiling YData Profiling is data profiler with a FOSS component and a paid upgrade. It is easy to use and powerful – It is a solid choice if you are working in the Spark ecosystem with...