The RDBMs in this case accepts 9999-12-31 as a valid date. Pandas does not. This is where the issue came in, and switching to PySpark meant we needed no date manipulation to handle the data supplied by the upstream.
Magic dates suck, but they exist in the wild. There are also valid cases where data is not tied to the lifetimes of humans currently writing code.
The range of values for date values in PostgreSQL is 4713 BC to 5874897 AD:
Magic dates suck, but they exist in the wild. There are also valid cases where data is not tied to the lifetimes of humans currently writing code.
The range of values for date values in PostgreSQL is 4713 BC to 5874897 AD:
https://www.postgresql.org/docs/current/datatype-datetime.ht...