You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When running a flow in INCREMENTAL mode with a DATE column as replication-key, the subsequent runs produce duplicate value, since the comparison is performed using >= operators.
Thank you @edgarrmondragon for pointing this out! For now the solution would be to configure idempotent sink on the other side to handle the duplicate rows.
Closing the issue and the PR since it is tracked elsewhere.
When running a flow in
INCREMENTAL
mode with a DATE column asreplication-key
, the subsequent runs produce duplicate value, since the comparison is performed using>=
operators.https://github.com/MeltanoLabs/tap-postgres/blob/main/tap_postgres/client.py#L242
The correct operator to use is
>
, regardless the replication key type.Example steps
State column:
key_col
No state existing, full table load, output data is:
| col_a | col_b | key_col |
| a3 | b3 | 2024-12-01 |
State value:
2024-01-02
⚠ Duplicate row
a2
in output| col_a | col_b | key_col |
| a2 | b2 | 2024-01-02 |
| a3 | b3 | 2024-12-01 |
The text was updated successfully, but these errors were encountered: